May 27, 2025 | We are very excited to release MARTI: A framework for LLM-based Multi-Agent Reinforced Training and Inference. (see MARTI ). |
May 16, 2025 | Two papers are accepted to ACL 2025 Main, congrats to the collaborators. |
May 14, 2025 | Just shared our latest work on TTS, RL and TTRL at QingkeTalk. |
May 02, 2025 | Four papers are accepted to ICML 2025, congrats to the collaborators. |
Apr 23, 2025 | We release Test-time Reinforcement Learning (TTRL), which investigates Reinforcement Learning (RL) on data without explicit labels for reasoning tasks in LLMs. (see TTRL ). |
Mar 31, 2025 | We release collections of RL recipes (see Awesome-RL-Reasoning-Recipes ). |
Mar 24, 2025 | Video-T1 is released, which firstly evaluate TTS on video generation (see Video-T1 ). |
Feb 10, 2025 | We explore compute-optimal test-time scaling (see compute-optimal-tts ). |
Jan 23, 2025 | One first-author paper is accepted to ICLR 2025 (see OpenPRM). |
Dec 24, 2024 | One paper is accepted to AAAI 2025 (Congrats to Xinwei). |
Sep 27, 2024 | One first-author paper is accepted to NeurIPS 2024 D&B Track (see UltraMedical ). |
Sep 20, 2024 | One paper is accepted to EMNLP 2024 (see LPA). |
Jul 10, 2024 | One co-first author paper is accepted to COLM 2024 (see LLM4BioHypoGen). |
May 16, 2024 | Two papers are accepted to ACL 2024 (One first-author, see CoGenesis). |
Mar 13, 2024 | One paper is accepted to NAACL 2024 (see PAD). |
Oct 06, 2023 | One first-author paper is accepted to EMNLP 2023 (see CRaSh). |