Multi-Agent Systems
How multiple LLM and VLM agents work together to solve problems no single model can handle alone. My research explores agent collaboration, role specialization, debate-based verification, and orchestration frameworks that enable reliable, scalable multi-agent workflows — from simple pipelines to complex reasoning chains.
Key Research Topics
Multi-Agent Collaboration
How multiple LLM/VLM agents collaborate, debate, verify, and refine each other's outputs. Research on emergent communication protocols, consensus-building, and multi-agent self-play for improving reasoning and task completion quality.
Agent Orchestration
Building scalable frameworks for multi-agent pipelines — task routing, tool use, memory systems, and self-correction loops. How to design agent architectures that are reliable, composable, and can scale from single tasks to complex workflows.
Role Specialization
Training and prompting agents for distinct roles — critic, coder, researcher, planner — and studying how role assignment affects team performance. Research on when specialization outperforms generalist agents and how to dynamically allocate roles.
Debate & Verification
Using adversarial debate and cross-agent verification to improve output quality. Research on how agents can catch each other's mistakes, reduce hallucination through mutual critique, and produce more reliable final outputs.
Self-Improving Agents
Systems that generate their own training signal through synthetic data, self-reflection, and iterative refinement. Investigating feedback loops where agents evaluate their own outputs, generate preference pairs, and continuously improve without human annotation.
Evaluation & Benchmarking
Developing evaluation frameworks for multi-agent systems — measuring coordination efficiency, task decomposition quality, communication overhead, and emergent capabilities that arise from agent interaction at scale.
Related Work
PRISM: Multi-Agent Synthetic Data Pipeline
A multi-agent pipeline for generating persona-diverse synthetic data, combining intent-based routing with role-specialized agents for high-quality data curation.