Artificial Intelligence

Anthropic Initiates AI Benchmarking Project to Enhance Safety and Evaluation Standards

Anthropic funds the development of comprehensive AI benchmarks to improve the evaluation and safety of AI systems.

July 2, 2024 8:00 AM

2 MIN TO READ

Andreea Iorga

Anthropic Initiates AI Benchmarking Project to Enhance Safety and Evaluation Standards

Image Credits:

anthropic.com

A

nthropic, the AI safety and research company is looking forward to creating new artificial intelligence benchmarks. Those new benchmarks will have the

ability to evaluate the performance and the impact that artificial intelligence has and use the data received.

‍

Antrophic is an AI research company whose prime purpose is to find a way in which artificial intelligence can function safely. The company is based in San Francisco and is formed out of teams with experience across ML, physics, products, and more. The purpose is to discover the right techniques and practices to keep using AI safely.

‍

In order to do so, the AI safety company Anthropic will launch a program that has meaning to fund the development of those new benchmarks. Even more so, the program will be able to allocate payments to third-party organizations that can help sustain their cause.

‍

“A robust, third-party evaluation ecosystem is essential for assessing AI capabilities and risks, but the current evaluation landscape is limited. Developing high-quality, safety-relevant evaluations remains challenging, and the demand is outpacing the supply,” reported Anthropic in a blog post on their website.

‍

Even more so, the company has a benchmarking shortcoming – the manner in which AI is actually used by daily users. How do they interact with the software and how can we make sure that they are tested appropriately.

‍

Anthropic is proposing a solution that will work toward the highest priority areas such as the AI safety level assessments, for advanced capability and safety metrics, and even for “Infrastructure, tools, and methods for developing evaluations”. Those tests and the ability to accomplish a higher level of protection for all users and minimize the risks for everyone.

‍

“We offer a range of funding options tailored to the needs and stage of each project,” writes Anthropic in a blog post from their blog, further stating that “Teams will have the opportunity to interact directly with Anthropic’s domain experts from the frontier red team, fine-tuning, trust and safety, and other relevant teams.”.

‍

In their blog post, Anthropic writes that they hope that they will be able to be “a catalyst for progress towards a future where comprehensive AI evaluation is an industry standard.”, hope for a better future where AI can not cause harm or any problems for our society.

‍

By

Andreea Iorga

•

July 2, 2024 8:00 AM

Most Read Articles

What is Qdrant?

Microsoft Bans Employees From Using DeepSeek App

CrowdStrike Announces Layoffs Affecting 500 Employees

RELATED ARTICLES

Meta Connect 2025 Recap: Key Announcements and Insights

Digital Innovation Tools

Meta Connect 2025 Recap: Key Announcements and Insights

Explore the biggest announcements and insights from the Meta Connect 2025 conference, including new smart glasses, AI, and VR technology.

September 18, 2025

Meta Connect 2025 Is Almost Here: What to Expect and How to Watch

Meta Connect 2025 Is Almost Here: What to Expect and How to Watch

Meta Connect 2025 conference is coming! See what’s expected, including two pairs of AI smart glasses, and where to watch the conference live.

September 17, 2025

iOS 26 with Liquid Glass Design Is Now Available for All Users

Apps & Software

iOS 26 with Liquid Glass Design Is Now Available for All Users

Apple releases iOS 26 with the stunning new Liquid Glass design, now available for all users to download and experience.

September 16, 2025

SUNRISE TRENDS

Disney Plus' 4K Streaming: The Technical Challenges and Solutions

Apps & Software

Disney Plus' 4K Streaming: The Technical Challenges and Solutions

September 11, 2025

10 Insane MrBeast Giveaways That Broke the Internet

10 Insane MrBeast Giveaways That Broke the Internet

Discover 10 insane MrBeast giveaways, from cash to houses, that shocked his subscribers and broke the internet with millions of views.

Top 20 YouTube Travel Challenges of All Time

Top 20 YouTube Travel Challenges of All Time

This is the biggest top of YouTube travel challenges where you will find content creators, YouTube influencers and their unbelievable journeys.

Digital Innovation Tools

Take a look at the Content Delivery Network used by Amazon Prime Video

A deep dive into the content delivery network (CDN) used by Amazon Prime Video. Learn about the different elements that make up Amazon Prime Video's CDN.

Ryan Trahan’s 50 States Adventure: Why It Blew Up and What’s Next

Ryan Trahan’s 50 States Adventure: Why It Blew Up and What’s Next

Ryan Trahan’s 50 States Adventure went viral for its storytelling, community engagement, and editing skills. Discover what’s next for him.

Read More About Sunrise Trends

THIS WEEK

Apple Introduces Three New Apple Watch Versions and Next-Gen AirPods

Apple unveils three new Apple Watch models and next-gen AirPods, bringing advanced health features and improved performance.

Have you missed the Apple Event? Here’s a quick rundown of the new iPhones you need to know about

If you missed the Apple event, here is a quick recap of all the new iPhones Apple just launched and what makes them stand out.

Apps & Software

Google Meet Restores Service After Outage

Google Meet finally works, after an outage on Monday, September 9, that confused many users. This unfortunate incident had 16,391 reports on Down Detector.

Best Doorbell Cameras to Keep Your Home Safe and Connected

Explore the best doorbell cameras of 2025 with HD video, motion alerts, and smart features to keep your home safe and connected.

DIGITAL INSIGHTS

Apple Introduces Three New Apple Watch Versions and Next-Gen AirPods

Apple Introduces Three New Apple Watch Versions and Next-Gen AirPods

Apple unveils three new Apple Watch models and next-gen AirPods, bringing advanced health features and improved performance.

September 10, 2025

Have you missed the Apple Event? Here’s a quick rundown of the new iPhones you need to know about

Have you missed the Apple Event? Here’s a quick rundown of the new iPhones you need to know about

If you missed the Apple event, here is a quick recap of all the new iPhones Apple just launched and what makes them stand out.

September 10, 2025

Trump Plans Tariffs on Semiconductor Imports from Firms That Don’t Shift Production to U.S.

Trump Plans Tariffs on Semiconductor Imports from Firms That Don’t Shift Production to U.S.

Trump will impose tariffs on semiconductor imports from companies that don’t make chips in the U.S., raising trade tensions and market worries.

September 5, 2025

The Facebook Poke Is Back After Years of Silence

Apps & Software

The Facebook Poke Is Back After Years of Silence

Facebook brings back the iconic Poke after years of silence, reviving nostalgia and memories from the early days of social media.

September 5, 2025

Featured Articles

Apps & Software

The Evolution of Mobile App Design

Meta Connect 2025 Is Almost Here: What to Expect and How to Watch

How to Build Your Startup

How to Tell a Compelling Story in Your Pitch Deck

Apps & Software

iOS 26 with Liquid Glass Design Is Now Available for All Users

Is Social Media Making Us More Isolated?

Apps & Software

Trump Once Again Extends Deadline on TikTok Sale or Divestment

The Negative Impact of Social Media on Attention Span and Concentration

Read More Articles