Agentic Memory
Continual learning of AI agents — in-context learning, continual fine-tuning, and unlearning.
Continual learning of AI agents — in-context learning, continual fine-tuning, and unlearning.
In-context world models, adaptation to post-training task worlds, and adapting agents in evolving envs.
Efficient attention architectures, KV-cache compression, latent segmentation, and recurrent transformers.
Synthetic data training, risks of multi-agent interaction, post-training guardrails, and AI behavioral study.
A document-level unlearning method that combines self-distillation on retain data with entropy demotion on the forget set. Removes targeted knowledge from LLMs without catastrophic damage to unrelated capabilities.
viewPersona effectiveness is task-type dependent: expert prompts consistently improve alignment-dependent tasks (safety, preference) but reliably damage pretraining-dependent knowledge retrieval. PRISM teaches models when to invoke a persona via intent-based self-modeling, preserving accuracy while keeping alignment gains.
viewA two-pass attention scheme that dynamically selects which KV-cache tokens to attend to per query, enabling long-context inference at a fraction of the standard memory footprint.
Studies how vision-language models degrade when fine-tuned on AI-generated multimodal data. Characterizes the collapse dynamics specific to the multimodal regime and proposes mitigation strategies that preserve diversity across modalities.
viewA brain-inspired MLP architecture with hemispheric lateralization applied to diffusion models. Shows competitive sample quality at reduced parameter count, suggesting structured asymmetry as an inductive bias for generative modeling.
viewA more efficient attention variant for vision transformers that pre-computes a static key projection, reducing per-token compute while maintaining downstream task performance.
view