AIRaML | ML Engineer Portfolio

About

Building ML — end to end

From raw data to deployed prediction APIs — every system built, tested, and live.

Ramakrishnasai Wuppalapati

ML Engineer · Data Scientist · AI Builder

Available for ML roles

Achievement-driven ML professional with a PG Diploma in Data Science from IIIT-Bangalore (3.7/4). I build complete systems — not just notebooks — covering data ingestion, EDA, feature engineering, model training, evaluation, and deployment via REST APIs.

My work spans classical ML, deep learning (CNN/RNN/Transfer Learning), NLP, computer vision, and Generative AI (RAG, Agents, LangChain). Every project here is live and interactive.

WRamakrishnasai github.com/ramleo hub.docker.com/u/wram Hyderabad, India

Core Competencies

Machine LearningDeep LearningGenerative AIComputer VisionNatural Language ProcessingModel DeploymentData VisualizationStatistical AnalysisBusiness IntelligenceWeb ScrapingDocker & ContainersCloud (GCP)

96.7%Accuracy

4+Live Apps

2+Years ML

3.7/4GPA

CNNDL Expert

RAGGen AI

Key Metrics

Drag to rotate · each face shows a
live project stat

Education

PG Diploma in Data Science

Specialization in Deep Learning

IIIT-Bangalore × upGrad

3.7 / 4.0

2021

Bachelor of Commerce

Accounts & Economics

Mumbai University

62%

2005

Projects

Live ML Apps — click to predict

Platform

ML Unified Platform

Models

One app, four models. Select Iris classifier, Titanic survival predictor, Diabetes risk model, or Insurance premium estimator from a sidebar — all served from a single schema-driven FastAPI backend with dynamic forms.

ModelMulti-Model

Features26

4 Datasets · 26 Features

PlatformFastAPISchema-DrivenClassificationRegression

Exploratory Analysis

EDA Explorer

∞

Datasets

Upload any CSV dataset and instantly explore it — shape, dtypes, missing value heatmap, per-column distributions (histograms for numeric, bar charts for categorical), descriptive statistics, outlier counts, and a full Pearson correlation heatmap. No code required.

ModelPandas · NumPy

Features0

Any CSV

EDAStatisticsCorrelationDistributionsData Profiling

Vision

ML Vision Platform

150

Seg Classes

Three vision tasks in one app: classify images across 1000 ImageNet categories (MobileNetV2 · ResNet50 · SqueezeNet · GoogLeNet), detect objects with TinyYOLOv3 (COCO 80 classes), and segment scenes pixel-by-pixel with SegFormer-B0 (ADE20K 150 classes). All models run as ONNX on a FastAPI microservice.

ModelSegFormer-B0 · YOLOv3 · MobileNetV2

Features3

ImageNet · COCO · ADE20K

VisionONNXSegmentationDetectionClassification

All projects are hosted on Render free tier — first load may take ~15s to spin up.

ML Capabilities

What powers every prediction

Clean Before You Train

Data Preprocessing

Steps

Deduplicate rows, impute missing values with 8+ numeric strategies (Mean, Median, KNN, MICE) and 4 categorical strategies, remove outliers via IQR / Z-score / Winsorize, fix skewness, and apply Yeo-Johnson power transform. Download a clean CSV or hand off directly to AutoML.

ModelSimpleImputer · KNN · MICE

Any CSV

ImputationOutliersEncodingPower Transform

No-Code Transforms

Feature Engineering

10+

Transforms

log1p, sqrt, Yeo-Johnson, percentile rank, outlier flag, and missing flag per numeric column. Plus binning, polynomial pairs, interaction terms, date extraction, and cyclical encoding (sin / cos). All transforms are fit on training data only — no leakage.

Modelscikit-learn · pandas

Any CSV

TransformsInteractionsDate FeaturesCyclical

Keep Only What Matters

Feature Selection

Methods

Four methods — Variance Threshold, Correlation Filter (drop >0.9 correlated), RFE (Random Forest), and SelectKBest (Mutual Info) — automatically prune irrelevant or redundant columns before training. Configurable top-K cutoff.

ModelRFE · SelectKBest · Variance

Any CSV

RFESelectKBestVarianceCorrelation

4-Model Competition

AutoML Pipeline

Models

RF, XGBoost, LightGBM, and CatBoost compete via 5-fold cross-validation. The winner is selected automatically by F1 (classification) or MAE (regression). Optional Optuna tuning and SHAP explanation run on the winner.

ModelRF · XGB · LGB · CatBoost

Any CSV

scikit-learnXGBoostLightGBMCatBoost

Post-Winner Hyperparameter Search

Optuna Tuning

Max Trials

TPE sampler runs up to 30 trials on the AutoML winner to find optimal hyperparameters. Tuning is optional and runs after model selection — not before — so it never inflates the competition score.

ModelTPE Sampler · 5-fold CV

AutoML winner

OptunaTPE Sampler5-fold CV

Per-Prediction Feature Impact

SHAP Explainability

100%

Explainable

Every prediction comes with a SHAP bar chart showing which features drove the result and by how much. FE-derived columns are grouped back to their originals so you see source-feature influence, not transform noise.

ModelSHAP · TreeExplainer

AutoML winner

SHAPFeature ImpactClassificationRegression

Combine Top-N Models

Ensemble Methods

Strategies

Simple voting (VotingClassifier / VotingRegressor) or stacking with a meta-learner on top of the AutoML winners. Reduces variance and improves generalization over any single model.

ModelVoting · Stacking

AutoML winners

VotingStackingMeta-Learnerscikit-learn

Monitor Production Data

Data Drift Detection

PSI

+ KS Test

Upload a new production batch CSV and compare it against the training baseline. PSI, KS test, and distribution histograms for numeric columns; category frequency shifts for categoricals. Trend sparkline tracks drift score across multiple batches.

ModelStatistical tests

Trained model + batch CSV

PSIKS TestDistribution ShiftMonitoring

End-to-End ML Canvas

Pipeline Builder

Stages

Visual card canvas that orchestrates all 7 ML stages — Preprocessing, Feature Engineering, Feature Selection, AutoML, Optuna, SHAP, and Ensemble — into one sequential pipeline.

ModelFull Pipeline

Any labeled CSV

PipelineAutoMLOptunaSHAPEnsembleEnd-to-End

Animated ML Showcase

Pipeline Cinema

Stages

Watch your data transform in real time — chibi scientist characters process each ML stage with fluid animations. A cinematic walkthrough of the full pipeline.

ModelVisual Demo

No upload needed

AnimationPipelineVisualCinematicDemo

Live Event Dashboard

Real-Time Analytics

∞

Live Events

Track every page view and tool interaction on this portfolio in real time. Events flow from the browser into a PostgreSQL database via a FastAPI ingestion API, then Supabase Realtime pushes each row to the dashboard the moment it lands — no polling, no refresh.

ModelSupabase Realtime · asyncpg

Browser events (page views, tool opens)

Real-TimeWebSocketPostgreSQLFastAPISupabase

Natural Language → Database Queries

Text-to-SQL Agent

LLM Providers

Ask questions in plain English and get executable SQL instantly. The agent generates SQL, runs it against a real database, explains results, and retries automatically on errors. Supports Chinook demo DB, SQLite upload, and PostgreSQL.

ModelGroq / Gemini / Cohere

Natural language question

SQLLLMAgentDatabaseNLP

AI-Powered Document Data Extraction

Document Intelligence

Document Types

Upload invoices, contracts, resumes, medical reports, bank statements, and more. AI classifies the document type, extracts structured fields with confidence scores, and highlights each field's location with bounding box overlays.

ModelGroq / Gemini / Cohere

PDF, PNG, JPG, JPEG, WEBP

OCRLLMPDFExtractionNLP

Tables & Figures as Citable Knowledge

Multimodal RAG

Chunk Types

Upload a PDF mixing prose, tables, and charts. Tables are read as structured data and figures get an AI-written caption, so questions whose answer lives in a number or a chart — not just a paragraph — get a grounded, page-cited answer.

ModelGroq / Mistral / Gemini

PDF (text, tables, figures)

RAGMultimodalPDFCitationsLLM

Discrepancy Report Across Documents

Contract/Invoice Reconciliation Assistant

Doc Roles

Upload a contract, then one or more invoices. Flags amounts, dates, and terms that disagree across documents, each with the two source passages and an explanation — never comparing invoices against each other, since they're expected to differ.

ModelGroq (llama-3.1-8b-instant)

PDF, PNG, JPG (contract + invoices)

RAGReconciliationContractsInvoicesLLM

Experience

Career timeline

A journey from financial services to full-stack ML engineering.

Consultant B2

Capgemini

Now

Dec 2025 – Present

Current employer.

Support Engineer

JoulestoWatts Business Solutions

Nov 2024 – Nov 2025

Technical support and ML project development. Built and deployed ML systems end-to-end.

Business Development Executive

SBI Life Insurance

May 2014 – Jul 2017

Business development, client management, and analytics-driven sales strategy.

Junior Executive

Veenus Cybersoft

Oct 2013 – May 2014

Technical operations and client support in a software environment.

Business Development Executive

Valuegain Distributors

Apr 2012 – Oct 2013

Business development and distribution operations.

Associate

Statestreet Syntel Services

Jun 2006 – Jun 2010

Financial services operations, process execution, and data management in a global enterprise environment.

Building ML — end to end

The full AI/ML stack

Live ML Apps — click to predict

ML Unified Platform

EDA Explorer

ML Vision Platform

What powers every prediction

Data Preprocessing

Feature Engineering

Feature Selection

AutoML Pipeline

Optuna Tuning

SHAP Explainability

Ensemble Methods

Data Drift Detection

Pipeline Builder

Pipeline Cinema

Real-Time Analytics

Text-to-SQL Agent

Document Intelligence

Multimodal RAG

Contract/Invoice Reconciliation Assistant

From raw data to live prediction

What's happening in AI & ML

Career timeline

Let's work together