Research Direction

Multi-Agent Systems

How multiple LLM and VLM agents work together to solve problems no single model can handle alone. My research explores agent collaboration, role specialization, debate-based verification, and orchestration frameworks that enable reliable, scalable multi-agent workflows — from simple pipelines to complex reasoning chains.

Key Research Topics

Multi-Agent Collaboration

How multiple LLM/VLM agents collaborate, debate, verify, and refine each other's outputs. Research on emergent communication protocols, consensus-building, and multi-agent self-play for improving reasoning and task completion quality.

Agent Orchestration

Building scalable frameworks for multi-agent pipelines — task routing, tool use, memory systems, and self-correction loops. How to design agent architectures that are reliable, composable, and can scale from single tasks to complex workflows.

Role Specialization

Training and prompting agents for distinct roles — critic, coder, researcher, planner — and studying how role assignment affects team performance. Research on when specialization outperforms generalist agents and how to dynamically allocate roles.

Debate & Verification

Using adversarial debate and cross-agent verification to improve output quality. Research on how agents can catch each other's mistakes, reduce hallucination through mutual critique, and produce more reliable final outputs.

Self-Improving Agents

Systems that generate their own training signal through synthetic data, self-reflection, and iterative refinement. Investigating feedback loops where agents evaluate their own outputs, generate preference pairs, and continuously improve without human annotation.

Evaluation & Benchmarking

Developing evaluation frameworks for multi-agent systems — measuring coordination efficiency, task decomposition quality, communication overhead, and emergent capabilities that arise from agent interaction at scale.

Related Work

Active Project

PRISM: Multi-Agent Synthetic Data Pipeline

A multi-agent pipeline for generating persona-diverse synthetic data, combining intent-based routing with role-specialized agents for high-quality data curation.

Ongoing