news | Kaiyan Zhang (张开颜)

Jan 26, 2026	Five papers are accepted to ICLR 2026, congrats to the collaborators.
Sep 19, 2025	TTRL was accepted to NeurIPS 2025, Congratulations!
Sep 11, 2025	Excited to share our new survey paper on RL for Large Reasoning Models .
Aug 21, 2025	One paper is accepted to EMNLP 2025 (see ReviewRL).
Aug 15, 2025	We investigate agentic search RL without reliance on external search engine while maintaining strong sim2real generalization. (see SSRL ).
Jun 26, 2025	Two papers are accepted to ICCV 2025, congrats to the collaborators.
May 27, 2025	We are very excited to release MARTI: A framework for LLM-based Multi-Agent Reinforced Training and Inference. (see MARTI ).
May 16, 2025	Two papers are accepted to ACL 2025 Main, congrats to the collaborators.
May 14, 2025	Just shared our latest work on TTS, RL and TTRL at QingkeTalk.
May 02, 2025	Four papers are accepted to ICML 2025, congrats to the collaborators.
Apr 23, 2025	We release Test-time Reinforcement Learning (TTRL), which investigates Reinforcement Learning (RL) on data without explicit labels for reasoning tasks in LLMs. (see TTRL ).
Mar 31, 2025	We release collections of RL recipes (see Awesome-RL-Reasoning-Recipes ).
Mar 24, 2025	Video-T1 is released, which firstly evaluate TTS on video generation (see Video-T1 ).
Feb 10, 2025	We explore compute-optimal test-time scaling (see compute-optimal-tts ).
Jan 23, 2025	One first-author paper is accepted to ICLR 2025 (see OpenPRM).
Dec 24, 2024	One paper is accepted to AAAI 2025 (Congrats to Xinwei).
Sep 27, 2024	One first-author paper is accepted to NeurIPS 2024 D&B Track (see UltraMedical ).
Sep 20, 2024	One paper is accepted to EMNLP 2024 (see LPA).
Jul 10, 2024	One co-first author paper is accepted to COLM 2024 (see LLM4BioHypoGen).
May 16, 2024	Two papers are accepted to ACL 2024 (One first-author, see CoGenesis).
Mar 13, 2024	One paper is accepted to NAACL 2024 (see PAD).
Oct 06, 2023	One first-author paper is accepted to EMNLP 2023 (see CRaSh).