AI模型与基准测试
模型竞赛:GPT、Gemini、Claude、Llama……谁在每项基准测试中领先AI排名?
AI模型与基准测试关闭于 8d 13h
Will a new AI model appear in the top 3 of the MMLU benchmark or equivalent published leaderboard (Hugging Face Open LLM Leaderboard or other recognized one) between June 10–20, 2026?
AI模型与基准测试关闭于 8d 13h
Will Meta AI or Mistral AI release a new open-source language model with more than 70 billion parameters between June 10–20, 2026?
AI模型与基准测试关闭于 111d 5h
Which company's model will top the LMSYS Chatbot Arena ranking on September 30, 2026?
AI模型与基准测试关闭于 203d 5h