Product Engineer • Backend Engineer — GenAI Specialist
Django/DRF • PostgreSQL • Redis • LangChain • RAG • Agents
🔗 Portfolio • 💼 LinkedIn • 🧩 LeetCode • ✉️ muazzamali741@gmail.com
- ✅ Own backend delivery for high-traffic workflows serving 10,000+ daily users.
- ⚡ Reduced API latency by 25% via DRF endpoint tuning + PostgreSQL optimizations.
- 🗄️ Built Redis caching strategies (hot-path caching + safe invalidation) to reduce DB pressure.
- 🚦 Implemented Redis sliding-window rate limiting to protect critical endpoints and stabilize p95 during spikes.
- 🧠 Built AI-ready backend interfaces: stable schemas, deterministic error contracts, and clean service boundaries to support LLM/agent integrations without breaking consumers.
- 🔎 Improved reliability for heavy workloads using background-job patterns (timeouts, retries, idempotency principles) plus observability (latency + failure reason logs).
- 🧩 Delivered 12 production features in 3 months (Agile/Scrum), improving sprint velocity by 15%.
- 🗄️ Reduced direct DB query load by 30% using multi-layer Redis caching.
- 🧱 Standardized validation + error handling across endpoints to reduce regressions and integration friction.
- 🤝 Built integration-friendly service layers that make it easier to attach ML/LLM inference later (classification/extraction/ranking) without refactoring core APIs.
- 📊 Built ML pipelines (TensorFlow + scikit-learn), achieving 85% validation accuracy.
- 📈 Improved model performance by 20% via transfer learning + iterative experimentation.
- 🔁 Designed repeatable train/eval workflows: clean splits, metric tracking, and error analysis to guide improvements.
- 📦 Packaged inference into clean interfaces (single + batch prediction) to support product/backend integration.
I build production-grade systems where backend engineering meets AI/ML:
- 🏗️ Scalable APIs — Django/DRF + PostgreSQL serving thousands of users
- 🤖 AI Integration — RAG pipelines, LLM orchestration with LangChain
- ⚡ Performance — Redis caching, Celery task queues, rate limiting at scale
- 🔒 Security — JWT auth, OAuth2, domain/IP whitelisting
- Deployed 15+ REST APIs reducing latency by 25% (800ms → 600ms)
- Built RAG chatbot with dynamic LLM swapping using LangChain + Ollama
- Implemented caching & rate limiting to handle 5× traffic spikes
- Reduced DB load by 30% through query optimization
Backend & APIs
Django DRF FastAPI Flask PostgreSQL Redis Celery
AI/ML & LLMs
LangChain LlamaIndex OpenAI API TensorFlow Scikit-learn YOLOv8 OpenCV
DevOps & Cloud
AWS (EC2, Lambda, CloudWatch) Docker Firebase CI/CD Git
Additional
JWT OAuth2 Rate Limiting Vector Databases Microservices
RAG pipeline using LangChain & LlamaIndex with local LLM swapping via Ollama
Python LangChain RAG Vector DB Hugging Face
Real-time helmet violation detection using YOLOv8 + OpenCV with automated logging
YOLOv8 OpenCV Streamlit Computer Vision
NLP system with 92% accuracy using TensorFlow & Scikit-learn
NLP TensorFlow Scikit-learn Streamlit
Content-based filtering using TF-IDF and cosine similarity
Machine Learning NLP NLTK Streamlit
RESTful API with JWT authentication and SQLAlchemy ORM
Flask JWT SQLite REST API
- ✅ Machine Learning Specialization – Stanford University (Coursera)
- ✅ Computational Thinking & Data Science – MIT (6.00.2x)
- ✅ Introduction to CS with Python – MIT (6.00.1x)
- ✅ CS50: Introduction to Computer Science – Harvard University
- 📧 muazzamali741@gmail.com
- 🐦 Open to collaboration on AI/ML and backend projects!
⚡ "Building systems that scale, one API at a time"