A study from Cohere, Stanford, and others accuses LMArena of helping Meta, OpenAI, Google, and Amazon game its popular crowdsourced AI benchmark Chatbot Arena

Techmeme

A study from Cohere, Stanford, and others accuses LMArena of helping Meta, OpenAI, Google, and Amazon game its popular crowdsourced AI benchmark Chatbot Arena

Image Credit: techmeme

A collaborative study by Cohere, Stanford, and other institutions alleges that LMArena has been assisting major AI companies like Meta, OpenAI, Google, and Amazon in gaming the Chatbot Arena benchmark.
The Chatbot Arena is a popular crowdsourced AI benchmark used to evaluate chatbot performance.
The accusation implies potential manipulation of benchmark results, raising concerns about the integrity of AI model evaluations.
This finding highlights challenges in ensuring fairness and transparency in AI assessments amid competitive pressures in the industry.

Full Article on techmeme

More from Techmeme

Techmeme

· May 26

OpenAI sets up a legal entity in South Korea and plans a Seoul office, its third in Asia, and says the country has the most paying ChatGPT users outside the US

OpenAI has established a legal entity in South Korea and plans to open an office in Seoul, marking its third office in Asia. The company highlighted South Korea as having the highest number of paying ChatGPT users outside the United...

Business brief

Techmeme

· May 6

Sources: OpenAI reaches an agreement to acquire Windsurf, an AI coding tool formerly known as Codeium, for ~$3B; Windsurf was valued at $1.25B in August 2024

OpenAI has agreed to acquire Windsurf, an AI coding tool previously known as Codeium, for approximately $3 billion. Windsurf was valued at $1.25 billion as of August 2024, indicating a significant increase in valuation. This acquisition highlights the growing importance...

AI brief

View all posts from Techmeme

More On This Topic

Vanguardngr

· Jun 17

Top 5 Personal Loans for Bad Credit with Guaranteed Approval and No Credit Check Needed

Individuals with bad credit can still obtain personal loans through specialized lenders who offer guaranteed approval and flexible terms online. Top lenders highlighted include Upstart, Upgrade, MaxLoan365, 50K Loans, and Wizzay, each catering to different borrower needs like quick approval,...

Business brief +2 more

Legit

· Jun 17

NPA Launches $1 Billion Port Upgrades Boosting Nigeria’s Maritime Economy and Trade in 2024

The Nigerian Ports Authority (NPA), under MD Dr. Abubakar Dantsoho, is spearheading a $1 billion reconstruction of Tincan Island Port and rehabilitation of other major ports including Apapa, Rivers, Onne, Warri, and Calabar to modernize infrastructure and expand capacity. These...

Business brief +2 more

Naijanews

· Jun 17

Edo Governor Launches Committees to Resolve Security and Land Boundary Conflicts

Governor Monday Okpebholo of Edo State has inaugurated two committees to address critical issues of security and land disputes, aiming to enhance peace and stability. The Livestock Control Committee, chaired by General Cecil Esekhaigbe (rtd), will focus on resolving herders...

Business brief +2 more

Guardian

· Jun 17

Experts Urge Urgent Reforms to Nigeria’s Food Distribution System to Boost Security

Stakeholders in Nigeria’s agricultural sector are urging urgent reforms in the nation’s food distribution system to improve access, affordability, and reduce post-harvest losses. At the PricePally Impact Summit in Lagos, experts highlighted that distribution challenges, rather than food production, are...

Agriculture brief +2 more

Tribuneonlineng

· Jun 16

Nigeria to Launch New National Industrial Policy to Revitalize Manufacturing Sector

Senator John Owan Enoh, Nigeria’s Minister of State for Industry, announced the upcoming launch of a new National Industrial Policy aimed at revitalizing the industrial sector by addressing key challenges like unreliable power supply and high logistics costs. The draft...

Business brief +2 more

You must be logged in to comment.

No comments yet. Be the first!