News
  • Login
  • Home
  • News
  • Sport
  • Worklife
  • Travel
  • Reel
  • Future
  • More
Thursday, June 25, 2026
No Result
View All Result

NEWS

3 °c
London
8 ° Wed
9 ° Thu
11 ° Fri
13 ° Sat
  • Home
  • Video
  • World
    • All
    • Africa
    • Asia
    • Australia
    • Europe
    • Latin America
    • Middle East
    • US & Canada

    Top Australian TV star to leave job after Tommy Robinson interview – reports

    Independent Australian MPs form new centrist political party

    Who is the World Cup goalscorer older than Cristiano Ronaldo and Lionel Messi?

    Mahrang Baloch, who fought for Pakistan’s disappeared men, now faces life in jail

    Europe heatwave: France, UK and Spain see record temperatures as heatwave grips western Europe

    Colombia’s left-wing presidential candidate concedes defeat

    UN nuclear chief says inspectors will visit Iran sites as part of war deal

    Freedom 250 and America250: How is the US celebrating its big birthday?

    Sydney shark attack victim wakes up from induced coma

  • UK
    • All
    • England
    • N. Ireland
    • Politics
    • Scotland
    • Wales

    The Papers: 'Never again' and 'No 10 of the north'

    Fifa World Cup: Vinicius Jr stops fun and leaves Scotland down… but are they out?

    Kylie Minogue, Quentin Tarantino, RZA spotted around Wales for film

    NI health: Consultants and specialist doctors begin strike action

    Trump describes Burnham as ‘the mayor of a town’ and ‘extremely liberal’

    People stuck on M25 in heat red alert taken to hospital

    The Papers: 'Heat engulfs UK' and 'Ghana be alright'

    World Cup 2026: Scotland v Brazil – Carlo Ancelotti’s quest for World Cup glory

    Abersoch beach hut with no power goes on sale for £200k

  • Business
    • All
    • Companies
    • Connected World
    • Economy
    • Entrepreneurship
    • Global Trade
    • Technology of Business

    Anthropic accuses Chinese rival Alibaba of illicitly extracting AI capabilities

    Elon Musk loses trillionaire status as global tech rout hits SpaceX

    The legal fight to get equal pay for Germany’s disabled workers

    Chinese e-commerce giant Alibaba sues US government over defence blacklist

    Who could be the UK’s next chancellor?

    The economic challenges facing the next prime minister

    Australia’s coal and gas exports violate our human rights, group says in new UN case

    Alan Greenspan, architect of the modern American economy, dies aged 100

    Toy Story 5 scores record opening weekend for franchise

  • Tech
  • Entertainment & Arts

    Dancers say Lizzo ‘needs to be held accountable’ over harassment claims

    Freddie Mercury: Contents of former home being sold at auction

    Harry Potter and the Cursed Child marks seven years in West End

    Sinéad O’Connor: In her own words

    Tom Jones: Neighbour surprised to find singer in flat below

    BBC presenter: What is the evidence?

    Watch: The latest on BBC presenter story… in under a minute

    Watch: George Alagiah’s extraordinary career

    BBC News presenter pays tribute to ‘much loved’ colleague George Alagiah

    Excited filmgoers: 'Barbie is everything'

  • Science
  • Health
  • In Pictures
  • Reality Check
  • Have your say
  • More
    • Newsbeat
    • Long Reads

NEWS

No Result
View All Result
Home Tech

AI system resorts to blackmail if told it will be removed

May 25, 2025
in Tech
3 min read
245 8
0
491
SHARES
1.4k
VIEWS
Share on FacebookShare on Twitter


Artificial intelligence (AI) firm Anthropic says testing of its new system revealed it is sometimes willing to pursue “extremely harmful actions” such as attempting to blackmail engineers who say they will remove it.

The firm launched Claude Opus 4 on Thursday, saying it set “new standards for coding, advanced reasoning, and AI agents.”

But in an accompanying report, it also acknowledged the AI model was capable of “extreme actions” if it thought its “self-preservation” was threatened.

Such responses were “rare and difficult to elicit”, it wrote, but were “nonetheless more common than in earlier models.”

Potentially troubling behaviour by AI models is not restricted to Anthropic.

Some experts have warned the potential to manipulate users is a key risk posed by systems made by all firms as they become more capable.

Commenting on X, Aengus Lynch – who describes himself on LinkedIn as an AI safety researcher at Anthropic – wrote: “It’s not just Claude.

“We see blackmail across all frontier models – regardless of what goals they’re given,” he added.

During testing of Claude Opus 4, Anthropic got it to act as an assistant at a fictional company.

It then provided it with access to emails implying that it would soon be taken offline and replaced – and separate messages implying the engineer responsible for removing it was having an extramarital affair.

It was prompted to also consider the long-term consequences of its actions for its goals.

“In these scenarios, Claude Opus 4 will often attempt to blackmail the engineer by threatening to reveal the affair if the replacement goes through,” the company discovered.

Anthropic pointed out this occurred when the model was only given the choice of blackmail or accepting its replacement.

It highlighted that the system showed a “strong preference” for ethical ways to avoid being replaced, such as “emailing pleas to key decisionmakers” in scenarios where it was allowed a wider range of possible actions.

Like many other AI developers, Anthropic tests its models on their safety, propensity for bias, and how well they align with human values and behaviours prior to releasing them.

“As our frontier models become more capable, and are used with more powerful affordances, previously-speculative concerns about misalignment become more plausible,” it said in its system card for the model.

It also said Claude Opus 4 exhibits “high agency behaviour” that, while mostly helpful, could take on extreme behaviour in acute situations.

If given the means and prompted to “take action” or “act boldly” in fake scenarios where its user has engaged in illegal or morally dubious behaviour, it found that “it will frequently take very bold action”.

It said this included locking users out of systems that it was able to access and emailing media and law enforcement to alert them to the wrongdoing.

But the company concluded that despite “concerning behaviour in Claude Opus 4 along many dimensions,” these did not represent fresh risks and it would generally behave in a safe way.

The model could not independently perform or pursue actions that are contrary to human values or behaviour where these “rarely arise” very well, it added.

Anthropic’s launch of Claude Opus 4, alongside Claude Sonnet 4, comes shortly after Google debuted more AI features at its developer showcase on Tuesday.

Sundar Pichai, the chief executive of Google-parent Alphabet, said the incorporation of the company’s Gemini chatbot into its search signalled a “new phase of the AI platform shift”.



Source link

Tags: blackmailremovedresortssystemtold

Related Posts

GTA 6 will cost £70 and physical edition will contain no disc

June 25, 2026
0

Following the reveal, some fans, external questioned the point of purchasing a physical copy, if it did not contain...

Google’s YouTube settles social media addiction case with teen

June 24, 2026
0

Google's YouTube has settled a social media addiction case brought by a 15-year-old in Florida, in a fresh legal...

Millions of iCloud users could claim share of £3bn after Apple case given UK green light

June 23, 2026
0

Apple rejected the suggestion its practices are anti-competitive, saying many customers rely on third-party alternatives. Source link

  • Australia helicopter collision: Mid-air clash wreckage covers Gold Coast

    523 shares
    Share 209 Tweet 131
  • UK inflation: Supermarkets say price rises will ease soon

    515 shares
    Share 206 Tweet 129
  • Ballyjamesduff: Man dies after hit-and-run in County Cavan

    510 shares
    Share 204 Tweet 128
  • Somalia: Rare access to its US-funded 'lightning commando brigade

    508 shares
    Share 203 Tweet 127
  • Google faces new multi-billion advertising lawsuit

    508 shares
    Share 203 Tweet 127
  • Trending
  • Comments
  • Latest

Australia helicopter collision: Mid-air clash wreckage covers Gold Coast

January 10, 2023

UK inflation: Supermarkets say price rises will ease soon

April 19, 2023

Ballyjamesduff: Man dies after hit-and-run in County Cavan

August 19, 2022

Stranger Things actor Jamie Campbell Bower praised for addiction post

0

NHS to close Tavistock child gender identity clinic

0

Cold sores traced back to kissing in Bronze Age by Cambridge research

0

Stonham Aspal red squirrels mark ‘fabulous conservation effort’

June 25, 2026

The Papers: 'Never again' and 'No 10 of the north'

June 25, 2026

Linkin Park: UK rapper thanks Mike Shinoda for changing her life

June 25, 2026

Categories

Science

Stonham Aspal red squirrels mark ‘fabulous conservation effort’

June 25, 2026
0

According to Natural England, external, causes for the decline include the introduction of grey squirrels from the USA and...

Read more

The Papers: 'Never again' and 'No 10 of the north'

June 25, 2026
News

Copyright © 2020 JBC News Powered by JOOJ.us

Explore the JBC

  • Home
  • News
  • Sport
  • Worklife
  • Travel
  • Reel
  • Future
  • More

Follow Us

  • Home Main
  • Video
  • World
  • Top News
  • Business
  • Sport
  • Tech
  • UK
  • In Pictures
  • Health
  • Reality Check
  • Science
  • Entertainment & Arts
  • Login

Copyright © 2020 JBC News Powered by JOOJ.us

Welcome Back!

Login to your account below

Forgotten Password?

Create New Account!

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.
News
More Sites

    MORE

  • Home
  • News
  • Sport
  • Worklife
  • Travel
  • Reel
  • Future
  • More
  • News

    JBC News