|
i'm an AI/ML engineer based in the US, currently building production AI systems at Reallytics.ai and Verticiti. most of my work revolves around getting large language models to do useful things in production — not toy demos, actual systems handling real traffic. before this, i spent years at Afiniti and Cloud Kinetics doing the grunt work of making ML models reliable at scale. fraud detection, voice analytics, enterprise search — the kind of stuff that breaks at 3am and you have to fix. what keeps me going: that moment when an AI agent you built actually solves a problem you didn't explicitly program it for. still hits different every time. right now i'm deep into:
|
|
|
Agentic AI Workflows — Production AI Agents |
RAG Enterprise Search — Retrieval-Augmented Generation |
|
Voice AI Platform — Real-Time Speech AI |
LLM Fine-Tuning (LoRA/QLoRA) — Parameter-Efficient Fine-Tuning |
|
RLHF LLM Optimization — Reinforcement Learning from Human Feedback |
Sentinel Fraud Detection — Explainable AI |
i'm not going to pretend i use everything equally. here's what i actually reach for day-to-day:
the full picture (click to expand)
| daily drivers | Python, PyTorch, FastAPI, Docker, Git, VS Code |
| LLM & GenAI | LangChain, LlamaIndex, HuggingFace Transformers, vLLM, PEFT/LoRA/QLoRA |
| vector & data | FAISS, ChromaDB, Pinecone, PostgreSQL, MongoDB, Redis, Kafka, Elasticsearch |
| cloud & MLOps | AWS (SageMaker, Bedrock, Lambda, ECS), GCP Vertex AI, Azure OpenAI |
| ML frameworks | TensorFlow, scikit-learn, XGBoost, LightGBM, ONNX |
| infrastructure | Kubernetes, Terraform, GitHub Actions, MLflow, Weights & Biases |
i commit a lot. sometimes it's good code, sometimes it's "fix: typo in typo fix".
i publish research notes daily — not polished papers, just honest writeups of what i'm learning and building. think of it as a public lab notebook for generative AI, LLM fine-tuning, RAG, and agentic systems.
|
Streaming Model Inference For Real Time Applicatio
|
Fine Tuned Llms For Enterprise Retrieval Augmented
|
|
Explainable Ai Xai For Trustworthy Models
|
Edge Ai For Real Time Inference
|
📝 Opened issue [Feature] Automatic LoRA rank recommendation based on datase in axolotl-ai-cloud/axolotl (2026-04-24)
💬 Commented on Integrate SAM3-LiteText to Ultralytics in ultralytics/ultralytics (2026-04-24)
💬 Commented on Crazy Logging I want to shut it down in NVIDIA-NeMo/NeMo (2026-04-24)
💬 Commented on Regression in 1.1.7 (#7498): Second regenerate from latest c in langchain-ai/langgraph (2026-04-24)
💬 Commented on Issue with the custom nodes. in modal-labs/modal-examples (2026-04-24)
💬 Commented on [Model Request] Support Gemma4 in mlc-ai/mlc-llm (2026-04-24)
💬 Commented on AuxiliaryTrainingWrapper.forward requires positional x, br in huggingface/peft (2026-04-24)
💬 Commented on Error: Gemini 3 Pro - Unknown error in continuedev/continue (2026-04-24)
topics discovered daily by a multi-model AI research engine (GPT-4.1, Grok-3, DeepSeek R1, Llama-4)
🔬 Efficient Model Serving with Quantization and Distillation
🔬 Streaming Model Inference for Real-Time Applications
🔬 Fine-Tuned LLMs for Enterprise Retrieval-Augmented Generation (RAG)
🔬 Synthetic Data Generation for ML Training
🔬 Explainable AI (XAI) for Trustworthy Models
🔬 Edge AI for Real-Time Inference
📌 Async Retry Pattern with Exponential Backoff — Production Pattern (Python) (2026-04-24)
📌 Async Retry Pattern with Exponential Backoff — Production Pattern (Python) (2026-04-24)
📌 RAG Relevance Scorer using Cross-Encoder — Production Pattern (Python) (2026-04-23)
🤖 Profile auto-updated on 2026-04-24 19:03 UTC