AI Research Engineer based in Cape Town, South Africa 🇿🇦
I specialize in Multi-Agent Reinforcement Learning and LLM Agents & Engineering
Currently at InstaDeep working on MARL research, I'm currently focused on combining Contrastive Goal Conditioned Reinforcement Learnining and Unsupervised Environment Design (UED) in Multi Agent settings.
🎓 MSc in AI from University of Cape Town & AIMS South Africa (Google DeepMind Scholar)
- Multi-Agent RL — Contrastive learning, goal-conditioned RL, and curriculum strategies in JAX
- LLM Agents — Autonomous agents for ML engineering, scientific discovery, and code generation
- Inference-Time Scaling — Making open-source LLMs competitive with proprietary models
- LLM Engineering — Fine-tuning, RLHF (PPO/GRPO/DPO), vLLM serving, distributed training
I'm good with Python JAX/Flax PyTorch vLLM HuggingFace TRL Unsloth LangGraph/LangSmith TPU/GPU