|
i'm an AI/ML engineer based in the US. right now i'm building production AI systems at Reallytics.ai and Verticiti, mostly getting large language models to do useful things in the real world. not demos, actual systems with real users and real traffic. before this i was at Afiniti and Cloud Kinetics for a few years. fraud detection, voice analytics, enterprise search. the kind of stuff that pages you at 3am when something breaks. honestly what keeps me going is when an agent you built solves something you never explicitly told it to do. that feeling never gets old. what i'm working on right now:
|
|
|
Agentic AI Workflows |
RAG Enterprise Search |
|
Voice AI Platform |
LLM Fine-Tuning LoRA |
|
RLHF LLM Optimization |
Sentinel Fraud Detection |
not going to pretend i use everything equally. here's what i actually reach for:
the full picture (click to expand)
| daily drivers | Python, PyTorch, FastAPI, Docker, Git, VS Code |
| LLM and GenAI | LangChain, LlamaIndex, HuggingFace Transformers, vLLM, PEFT/LoRA/QLoRA |
| data and vector | FAISS, ChromaDB, Pinecone, PostgreSQL, MongoDB, Redis, Kafka, Elasticsearch |
| cloud and MLOps | AWS (SageMaker, Bedrock, Lambda, ECS), GCP Vertex AI, Azure OpenAI |
| ML frameworks | TensorFlow, scikit-learn, XGBoost, LightGBM, ONNX |
| infrastructure | Kubernetes, Terraform, GitHub Actions, MLflow, Weights & Biases |
i write about what i'm building and learning. nothing polished, more like notes to my future self that happen to be public.
Automl For Time Series Forecasting
|
Automated Machine Learning For Time Series Forecas
|
Production Scale Retrieval Augmented Generation R
|
Fine Tuning Large Language Models With Parameter E
|
💬 Commented on Streaming tool_call deltas with duplicate indexes in first c in openai/openai-python (2026-05-11)
💬 Commented on Gemma4-E2B/E4B: passing inputs_embeds triggers an extremel in huggingface/transformers (2026-05-11)
💬 Commented on OpenReward binding only discovers shared /tools, misses ta in huggingface/trl (2026-05-11)
💬 Commented on Documentation (at least Google-related) is an outdated mess. in 567-labs/instructor (2026-05-11)
💬 Commented on YI:9b在长上下下回答异常 in 01-ai/Yi (2026-05-11)
💬 Commented on Привет, ребят. Очень прошу добавить долговременную память ме in deepseek-ai/DeepSeek-V3 (2026-05-11)
💬 Commented on The full dataset viewer is not available (click to read why) in huggingface/datasets (2026-05-11)
⭐ Starred XGenerationLab/XiYan-SQL (2026-05-11)
stuff i've been digging into recently. mostly papers, blog posts, and rabbit holes that kept me up too late.
🔬 AutoML for Time Series Forecasting
🔬 Edge AI for Real-Time Computer Vision
🔬 Automated Machine Learning for Time Series Forecasting
🔬 Real-Time Multimodal AI Applications (Vision + Language)
🔬 Fine-Tuning Large Language Models with Parameter-Efficient Techniques (LoRA/QLoRA)
🔬 Production-Scale Retrieval-Augmented Generation (RAG) for Enterprise Search
📌 Real-Time Feature Store Client — Production Pattern (Python) (2026-05-11)
📌 Agent Tool Registry with Dynamic Discovery — Production Pattern (Python) (2026-05-11)
📌 Multi-Provider LLM Router with Fallback — Production Pattern (Python) (2026-05-09)
🤖 Profile auto-updated on 2026-05-11 19:57 UTC


