publications

publications by categories in reversed chronological order.

Full list in Google Scholar.

2025

  1. ICLR 2025
    OpenPRM: Building Open-domain Process-based Reward Models with Preference Trees
    Kaiyan Zhang, Jiayuan Zhang, Haoxin Li, Xuekai Zhu, Ermo Hua, Xingtai Lv, Ning Ding, Biqing Qi, and Bowen Zhou
    The Thirteenth International Conference on Learning Representations, 2025
  2. AAAI
    Retrieval-Augmented Visual Question Answering via Built-in Autoregressive Search Engines
    Xinwei Long, Zhiyuan Ma, Ermo Hua, Kaiyan Zhang, Biqing Qi, and Bowen Zhou
    In , 2025

2024

  1. Arxiv
    Towards Building Specialized Generalist AI with System 1 and System 2 Fusion
    Kaiyan Zhang, Biqing Qi, and Bowen Zhou
    Preprint, 2024
  2. Arxiv
    Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding
    Kaiyan Zhang, Jianyu Wang, Ning Ding, Biqing Qi, Ermo Hua, Xingtai Lv, and Bowen Zhou
    Preprint, 2024
  3. NeurIPS 2024
    Ultramedical: Building specialized generalists in biomedicine
    Kaiyan Zhang, Sihang Zeng, Ermo Hua, Ning Ding, Zhang-Ren Chen, Zhiyuan Ma, Haoxin Li, Ganqu Cui, Biqing Qi, Xuekai Zhu, and 1 more author
    The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2024
  4. ACL 2024
    CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following
    Kaiyan Zhang, Jianyu Wang, Ermo Hua, Biqing Qi, Ning Ding, and Bowen Zhou
    Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
  5. COLM 2024
    Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
    Biqing Qi*Kaiyan Zhang*, Kai Tian, Haoxiang Li, Zhang-Ren Chen, Sihang Zeng, Ermo Hua, Hu Jinfang, and Bowen Zhou
    First Conference on Language Modeling, 2024
  6. Arxiv
    Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
    Ermo Hua, Biqing Qi, Kaiyan Zhang, Yue Yu, Ning Ding, Xingtai Lv, Kai Tian, and Bowen Zhou
    Preprint, 2024
  7. ACL 2024 findings
    SMR: State Memory Replay for Long Sequence Modeling
    Biqing Qi, Junqi Gao, Kaiyan Zhang, Dong Li, Jianxing Liu, Ligang Wu, and Bowen Zhou
    Findings of the Association for Computational Linguistics ACL 2024, 2024
  8. Arxiv
    Online DPO: Online Direct Preference Optimization with Fast-Slow Chasing
    Biqing Qi, Pengfei Li, Fangyuan Li, Junqi Gao, Kaiyan Zhang, and Bowen Zhou
    Preprint, 2024
  9. AAAI 2024
    Generative Multi-Modal Knowledge Retrieval with Large Language Models
    Xinwei Long, Jiali Zeng, Fandong Meng, Zhiyuan Ma, Kaiyan Zhang, Bowen Zhou, and Jie Zhou
    The 38th Annual AAAI Conference on Artificial Intelligence, 2024

2023

  1. EMNLP 2023
    CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model
    Kaiyan Zhang, Ning Ding, Biqing Qi, Xuekai Zhu, Xinwei Long, and Bowen Zhou
    Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
  2. NAACL 2024
    PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning
    Xuekai Zhu, Biqing Qi, Kaiyan Zhang, Xinwei Long, Zhouhan Lin, and Bowen Zhou
    Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2023
  3. ACM TOIS
    A static and dynamic attention framework for multi turn dialogue generation
    Weinan Zhang, Yiming Cui, Kaiyan Zhang, Yifa Wang, Qingfu Zhu, Lingzhi Li, and Ting Liu
    ACM Transactions on Information Systems, 2023
  4. ACM TOIS
    A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation
    Haoyu Song, Wei-Nan Zhang, Kaiyan Zhang, and Ting Liu
    ACM Transactions on Information Systems, 2023

2021

  1. SCIENTIA
    A survey of multi-party dialogue research based on deep learning
    Kaiyan Zhang, Wei-Nan Zhang, and Ting Liu
    SCIENTIA SINICA Informationis, 2021
  2. ACL 2021
    BoB: BERT over BERT for training persona-based dialogue models from limited personalized data
    Haoyu Song, Yan Wang, Kaiyan Zhang, Wei-Nan Zhang, and Ting Liu
    Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 2021