about

Zizhao Hu

Los Angeles

CS PhD @ USC · GLAMOR Lab · advised by Jesse Thomason

“Context is the new weight.”

I mainly work on post-training LLM agents. Low-latency control of what to remember, forget, and explore decides next-gen world-model-aware, self-improving AI. I work on continual learning, unlearning, memory management, task adaptation — at the agentic context (in-context RL, harness) and model level (distillation, finetuning).

what i work on

memory

Agentic Memory

Continual learning of AI agents — in-context learning, continual fine-tuning, and unlearning.

world model

World Model

In-context world models, adaptation to post-training task worlds, and adapting agents in evolving envs.

latency

Low-Latency AI

Efficient attention architectures, KV-cache compression, latent segmentation, and recurrent transformers.

safety

AI Safety

Synthetic data training, risks of multi-agent interaction, post-training guardrails, and AI behavioral study.

news & media

2026-05· preprint
arXiv preprint: SHRED — Document Unlearning via Self-Distillation and Entropy Demotion
2026-03· preprint
arXiv preprint: Expert Personas Improve LLM Alignment but Damage Accuracy — Bootstrapping Intent-Based Persona Routing with PRISM
2026-03· coverage
Media coverage on PRISM paper — The Register, AIToday, Tencent News, 36Kr, QbitAI, Yahoo Tech
2025-12· fellowship
Wrapped up Project Canary
2025-10· talks
Presented Multimodal Synthetic Data Finetuning and Model Collapse at ACM ICMI
2025-08· fellowship
Joined Project Canary (Handshake AI)
2025-07· fellowship
Started Handshake AI Fellowship 2025

my path

2016
01physics
Physics
photonics & metasurface design · dynamic systems
2018
02biophysics
Agile Systems
bio-inspired flight, sensing, and locomotion
2021
03robotics
Robotics · RL
policy learning for physical control and agent behavior
2022
04vae
Continual Learning · VAE
regularization design for variational autoencoders
2023
05multimodal
Multimodal Generation
diffusion models · vision-language model architecture
2025
06multiagent
Multiagent
coordination, division of labor, and mutual verification across agents — continual adaptation at the population level
2026
07agentic-memory
Agentic Memory
in-context learning, continual fine-tuning, unlearning, and memory scaffolds — adapting agents at the context and model level
08horizon
World Models
predictive world models and the architectures to serve them in real time

collaborators

Agentic Memory

World Model

Low-Latency AI

AI Safety

Physics

Agile Systems

Robotics · RL

Continual Learning · VAE

Multimodal Generation

Multiagent

Agentic Memory

World Models