Anthropic AI Model Updates

ニュース

9 日on MSN

Anthropic’s new AI model turns to blackmail when engineers try to take it offline

Anthropic says its Claude Opus 4 model frequently tries to blackmail software engineers when they try to take it offline.

4 日on MSN

Anthropic’s new AI model threatened to reveal engineer’s affair to avoid being shut down

In a fictional scenario set up to test Claude Opus 4, the model often resorted to blackmail when threatened with being ...

5 日on MSN

Anthropic says its new AI model can work almost an entire workday straight

Artificial intelligence startup Anthropic says its new AI model can work for nearly seven hours in a row, in another sign ...

6 日

Anthropic’s new AI model tried to blackmail engineers during testing

Anthropic admitted that during internal safety tests, Claude Opus 4 occasionally suggested extremely harmful actions, ...

9 日on MSN

Startup Anthropic says its new AI model can code for hours at a time

Artificial intelligence lab Anthropic unveiled its latest top-of-the-line technology called Claude Opus 4 on Thursday, which ...

7 日on MSN

Anthropic's new AI model resorted to blackmail during testing, but it's also really good at ...

So endeth the never-ending week of AI keynotes. What started with Microsoft Build, continued with Google I/O, and ended with ...

7 時間

We Have No Idea Why It Makes Certain Choices, Says Anthropic CEO Dario Amodei as He Builds ...

We still have no idea why an AI model picks one phrase over another, Anthropic Chief Executive Dario Amodei said in an April ...

4 日

Hidden AI instructions reveal how Anthropic controls Claude 4

Large language models (LLMs) like the AI models that run Claude and ChatGPT process an input called a "prompt" and return an ...

4 日on MSN

Anthropic’s newest AI model shows disturbing behavior when threatened

If you’re planning to switch AI platforms, you might want to be a little extra careful about the information you share with ...

4 日on MSN

When an AI model misbehaves, the public deserves to know—and to understand what it means

Safety testing AI means exposing bad behavior. But if companies hide it—or if headlines sensationalize it—public trust loses ...

eWeek5 日

New AI Model Threatens Blackmail After Implication It Might Be Replaced

Anthropic’s Claude Opus 4 exhibited simulated blackmail in stress tests, prompting safety scrutiny despite also showing a ...

The Indian Express3 日

Anthropic’s new AI model uses blackmail to avoid being taken offline

Anthropic’s top AI model showed that it was willing to carry out harmful acts like blackmail and deception if its ‘self-preservation’ is threatened, according to new research by the AI firm. The ...

一部の結果でアクセス不可の可能性があるため、非表示になっています。

アクセス不可の結果を表示する