news

May 27, 2025 We are very excited to release MARTI: A framework for LLM-based Multi-Agent Reinforced Training and Inference. (see MARTI ).
May 16, 2025 Two papers are accepted to ACL 2025 Main, congrats to the collaborators.
May 14, 2025 Just shared our latest work on TTS, RL and TTRL at QingkeTalk.
May 02, 2025 Four papers are accepted to ICML 2025, congrats to the collaborators.
Apr 23, 2025 We release Test-time Reinforcement Learning (TTRL), which investigates Reinforcement Learning (RL) on data without explicit labels for reasoning tasks in LLMs. (see TTRL ).
Mar 31, 2025 We release collections of RL recipes (see Awesome-RL-Reasoning-Recipes ).
Mar 24, 2025 Video-T1 is released, which firstly evaluate TTS on video generation (see Video-T1 ).
Feb 10, 2025 We explore compute-optimal test-time scaling (see compute-optimal-tts ).
Jan 23, 2025 One first-author paper is accepted to ICLR 2025 (see OpenPRM).
Dec 24, 2024 One paper is accepted to AAAI 2025 (Congrats to Xinwei).
Sep 27, 2024 One first-author paper is accepted to NeurIPS 2024 D&B Track (see UltraMedical ).
Sep 20, 2024 One paper is accepted to EMNLP 2024 (see LPA).
Jul 10, 2024 One co-first author paper is accepted to COLM 2024 (see LLM4BioHypoGen).
May 16, 2024 Two papers are accepted to ACL 2024 (One first-author, see CoGenesis).
Mar 13, 2024 One paper is accepted to NAACL 2024 (see PAD).
Oct 06, 2023 One first-author paper is accepted to EMNLP 2023 (see CRaSh).