google.com, pub-6103328420946084, DIRECT, f08c47fec0942fa0 pub-6103328420946084
top of page

Chatbot Arena: Benchmarking LLMs in the Wild with Elo Ratings

2023年5月3日

<p>We present Chatbot Arena, a benchmark platform for large language models (LLMs) that features anonymous, randomized battles in a crowdsourced manner. In t...

by: Lianmin Zheng*, Ying Sheng*, Wei-Lin Chiang, Hao Zhang, Joseph E. Gonzalez, Ion Stoica,May 03, 2023We present Chatbot Arena, a benchmark platform for large language models (LLMs) that features an… https://lmsys.org/blog/2023-05-03-arena/




bottom of page