Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

Whereas Claude Opus 4 will probably be restricted to paying Anthropic clients, a second mannequin, Claude Sonnet 4, will probably be accessible for each paid and free tiers of customers. Opus 4 is being marketed as a strong, giant mannequin for advanced challenges, whereas Sonnet 4 is described as a sensible, environment friendly mannequin for on a regular basis use.

Each of the brand new fashions are hybrid, which means they will supply a swift reply or a deeper, more reasoned response relying on the character of a request. Whereas they calculate a response, each fashions can search the net or use different instruments to enhance their output.

AI firms are presently locked in a race to create actually helpful AI agents which might be capable of plan, purpose, and execute advanced duties each reliably and free from human supervision, says Stefano Albrecht, director of AI on the startup DeepFlow and coauthor of Multi-Agent Reinforcement Studying: Foundations and Trendy Approaches. Typically this includes autonomously utilizing the web or different instruments. There are nonetheless security and safety obstacles to beat. AI brokers powered by giant language fashions can act erratically and perform unintended actions—which turns into much more of an issue after they’re trusted to behave with out human supervision.

“The extra brokers are capable of go forward and do one thing over prolonged intervals of time, the extra useful they are going to be, if I’ve to intervene much less and fewer,” he says. “The brand new fashions’ potential to make use of instruments in parallel is fascinating—that might save a while alongside the best way, in order that’s going to be helpful.”

For instance of the types of issues of safety AI firms are nonetheless tackling, brokers can find yourself taking sudden shortcuts or exploiting loopholes to achieve the targets they’ve been given. For instance, they could e-book each seat on a aircraft to make sure that their consumer will get a seat, or resort to creative cheating to win a chess game. Anthropic says it managed to cut back this habits, referred to as reward hacking, in each new fashions by 65% relative to Claude Sonnet 3.7. It achieved this by extra carefully monitoring problematic behaviors throughout coaching, and enhancing each the AI’s coaching atmosphere and the analysis strategies.

Source link

OpenAI is throwing everything into building a fully automated researcher

DataRobot + Nebius: An enterprise-ready AI Factory optimized for agents

The Pentagon is making plans for AI companies to train on classified data, defense official says

The US may be heading toward a drone-filled future

Mechanistic interpretability: 10 Breakthrough Technologies 2026

At MIT, a continued commitment to understanding intelligence | MIT News

Microsoft har förvandlat Edge till en AI-webbläsare med Copilot-läge

The Machine Learning “Advent Calendar” Day 4: k-Means in Excel

Most Popular

A Tale of Two Variances: Why NumPy and Pandas Give Different Answers

MIT researchers propose a new model for legible, modular software | MIT News

3 Questions: Using computation to study the world’s best single-celled chemists | MIT News

Our Picks

The Math That’s Killing Your AI Agent

Building Robust Credit Scoring Models (Part 3)

How to Measure AI Value

Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

Related Posts