Google's Alphabet And Amazon-Backed Anthropic Lead Effort To Redefine AI Evaluation Standards

Benzinga Neuro , Benzinga Staff Writer

July 02, 2024 12:45am Comments

Alphabet Inc (NASDAQ:GOOG), (NASDAQ:GOOGL) search arm Google and Amazon.com Inc (NASDAQ:AMZN)-backed Anthropic, a prominent AI startup, has announced a new initiative to fund the development of advanced AI benchmarks. The program aims to address the current inadequacies in AI benchmarking and provide a more comprehensive evaluation of AI models.

What Happened: Anthropic’s new program, revealed on Monday, will allocate funds to third-party organizations capable of creating benchmarks that can effectively evaluate the performance and impact of AI models, including generative models like Anthropic’s own Claude. Interested parties can submit their applications for evaluation on an ongoing basis.

Anthropic’s official blog post stated, “Our investment in these evaluations is intended to elevate the entire field of AI safety, providing valuable tools that benefit the whole ecosystem. Developing high-quality, safety-relevant evaluations remains challenging, and the demand is outpacing the supply.”

The current AI benchmarks are criticized for their inability to accurately represent how the average person uses the systems being tested. There are also concerns about the relevance of older benchmarks in measuring modern generative AI.

Anthropic’s solution is to create new, more challenging benchmarks that focus on AI security and societal implications. The company is calling for tests that evaluate a model’s ability to carry out tasks such as cyberattacks, weapon enhancement, and manipulation or deception of people through deepfakes or misinformation.

The company also aims to develop an “early warning system” for AI risks related to national security and defense. Anthropic’s program will also support research into benchmarks and “end-to-end” tasks that explore AI’s potential for aiding in scientific study, conversing in multiple languages, mitigating ingrained biases, and self-censoring toxicity.

Why It Matters: The launch of this funding program by Anthropic follows the recent unveiling of their most advanced AI model, Claude 3.5 Sonnet. This model, which outperforms its predecessor, demonstrates Anthropic’s commitment to pushing the boundaries of AI technology.

Moreover, Dario Amodei, CEO of Anthropic, has been vocal about the broader implications of AI on society. In an interview with Time Magazine in June, Amodei emphasized the need for a more comprehensive solution than Universal Basic Income to tackle AI-induced inequality. This reflects Anthropic’s focus on ensuring that AI advancements benefit the wider public.

Additionally, Anthropic has been involved in controversies, such as the alleged disregard for web scraping rules. This controversy highlights the ethical challenges that AI companies face in their quest for data to train their models.

Image Generated Using AI by MidJourney

This story was generated using Benzinga Neuro and edited by Kaustubh Bagalkote

View Comments and Join the Discussion!

View the discussion thread.

Posted-In: Anthropic Claude Google Kaustubh Bagalkote News Global Startups Tech

1	ABBV, PAAS: My Top 2 Earnings Picks For This Earnings Season
2	GS: Goldman Sachs Q2 Assets Under Supervision Hit Record, Net Interest I...
3	TSM, AAPL: Nvidia-Supplier Taiwan Semiconductor On A Roll, Profit Expl...
4	TSLA, GM: Tesla Cybertruck Bet Not Paying Off? Ford, GM Outse...
5	GEV: Analysts See Big Gains Ahead For GE Vernova, Rai...
6	USB: U.S. Bancorp Nails Q2 Profits But Stock Dips...
7	GE, $BTC: Nasdaq 100 Sets New Records, Small Ca...

1	VOO: Trump's Fed Pick? He's Cutting First, And Cutting Powell Out
2	The Race For Market Leadership In RWA Tokenization
3	SNA: Snap-on Analysts Boost Their Forecasts After Better-Than-Expected Q2 Earnings
4	PARA: CBS To End Stephen Colbert's 'The Late Show' In 2026, D...
5	MPNGY, JD: Meituan Cries 'Irrational!' As Alibaba Escalates De...
6	AHR, MCS: This Verint Analyst Begins Coverage On A Bullish Note...
7	Ramit Sethi Explains That People Who Are Bad With Mon...

Market Overview

Tickers

Articles

Keywords

Google's Alphabet And Amazon-Backed Anthropic Lead Effort To Redefine AI Evaluation Standards

Related Articles (AMZN + GOOG)