publications
All publications.
2026
- TimeSage-EV: A Live Benchmark for Agentic Time Series Analysis in Evolving EnvironmentsIn Under review at EMNLP 2026, 2026
- PlaySuite: A Large-Scale Benchmark for Interactive Visual IntelligenceIn Under review at NeurIPS 2026, 2026
- TimeSage-MT: A Multi-Turn Benchmark for Evaluating Agentic Time Series ReasoningIn Under review at NeurIPS 2026, 2026
- Route, Reuse, Repurpose: Continual Adaptation of LLMs with Bounded Adapter PoolsIn Continual Adaptation at Scale Workshop (ICML 2026), 2026
- CaLLM: Continual Adaptation of LLMs via Non-Parametric Adapter RoutingIn Under review at CoLLAs 2026, 2026