publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2024

  1. arena-hard-img_resized.png
    From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline
    Tianle Li, Wei-Lin Chiang , Evan Frick , Lisa Dunlap , and 4 more authors
    Under Review, Jun 2024
  2. nexusflow_resized.png
    Athene-70B: Redefining the Boundaries of Post-Training for Open Models
    Evan Frick* , Peter Jin* , Tianle Li*, Karthik Ganesan , and 3 more authors
    Jul 2024
  3. PPE.png
    How to Evaluate Reward Models for RLHF
    Evan Frick , Tianle Li, Connor Chen , Wei-Lin Chiang , and 5 more authors
    Under Review, Nov 2024
  4. bear.png
    Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
    Wei-Lin Chiang, Lianmin Zheng, Ying Sheng, Anastasios Angelopoulos, Tianle Li, Dacheng Li, Hao Zhang, Banghua Zhu, Michael Jordan, Joseph E. Gonzalez, Ion Stoica
    ICML, Mar 2024

2023

  1. vicuna.jpeg
    LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
    Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Tianle Li, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zhuohan Li, Zi Lin, Eric Xing, Joseph E. Gonzalez, Ion Stoica, Hao Zhang
    ICLR Spotlight, Sep 2023