Projects

A collection of infrastructure projects, platform engineering solutions, and ML systems I've built to solve real-world problems at scale.

Filter by Technology & Domain

Click any label to filter projects

Atlas

LLM Gateway Production Ready Enterprise Scale

A sophisticated LLM traffic and quota management gateway built with Redis, FastAPI, and Prometheus. Enables intelligent model routing, request limiting, and comprehensive observability for AI applications at scale.

Key Features:

  • Real-time quota management and rate limiting
  • Intelligent model routing based on load and cost
  • Comprehensive metrics and monitoring
  • High-performance async architecture

Atlas Dashboard

$ atlas status
✓ Redis connection: OK
✓ Model endpoints: 3 active
✓ Rate limiter: 1000 req/min
📊 Current load: 23%

Hyperion Performance Monitor

LIVE
GPU Inference
NVIDIA A100
28ms
avg latency
CPU Inference
Intel Xeon
312ms
avg latency
Throughput 347 req/s
Batch Size 4.2 avg
GPU Utilization 78%
11.1x
Performance Boost

Hyperion

ML Platform Production Ready Enterprise Scale

High-performance ML inference platform with GPU acceleration and intelligent request batching. Achieves 10-50ms inference times with 10x+ throughput improvements through dynamic batching and Kubernetes-native autoscaling.

Key Performance Features:

  • GPU acceleration: 10-50ms inference times (10x faster than CPU)
  • Intelligent batching: 10x+ throughput with dynamic batch sizes
  • Advanced Kubernetes scaling: HPA, VPA, and KEDA support
  • Production monitoring: Prometheus metrics and real-time observability

MonitorX

ML Platform Observability Production Ready

Comprehensive ML/AI infrastructure observability platform with zero-code monitoring, intelligent alerting, and real-time drift detection. Provides complete visibility into production ML systems with enterprise-grade dashboards and automated model health monitoring.

Key Features:

  • Real-time model performance monitoring and drift detection
  • Intelligent multi-channel alerting with automated remediation
  • Interactive dashboards with A/B testing and model comparison
  • Cost optimization insights and resource utilization tracking

MonitorX Dashboard

GPT-4 Inference Healthy
245ms
avg latency
Image Classification Warning
87%
GPU usage
Model Drift Detection
Active

AerialView

Analytics

Interactive stock market analytics dashboard with real-time visualizations, candlestick charts, and technical indicators powered by Streamlit.

FairTune

AI Ethics Research Tool

LLM fine-tuning and fairness evaluation platform with interactive Streamlit dashboards. Helps researchers identify, measure, and mitigate bias in language models.

Technology Stack

Technologies and tools I use to build scalable, reliable systems

Languages

Python
Go
Rust
TypeScript

Infrastructure

Kubernetes
Docker
Terraform
AWS/GCP

Data & ML

PyTorch
Apache Spark
Kafka
Redis

Monitoring

Prometheus
Grafana
Jaeger
ELK Stack