Tehuty News
  • Login
  • Home
  • News
  • Sport
  • Reel
  • World

    Fire at popular India nightclub kills 23, Goa officials say

    Legendary US architect dies aged 96

    Police arrest suspect in DC pipe bomb incident, ending years-long manhunt

    Drunk raccoon found passed out on liquor store floor after breaking in

    Flood catastrophe awakens volunteerism in Sri Lanka

    Trump releases fraudster executive days into prison sentence

    Ukraine talks ‘productive’ but more work needed, Rubio says

    More than 70,000 killed in Gaza since Israel offensive began, Hamas-run health ministry says

    Guinea-Bissau coup called a ‘sham’ by West African political figures

  • Worklife
  • Travel
  • Future
  • More
    • Culture
    • Music
10 °c
London
15 ° Thu
16 ° Fri
8 ° Sat
7 ° Sun
No Result
View All Result

Welcome to Tehuty News

Sunday, December 7, 2025
Tehuty News
  • Home
  • News
  • Sport
  • Reel
  • World

    Fire at popular India nightclub kills 23, Goa officials say

    Legendary US architect dies aged 96

    Police arrest suspect in DC pipe bomb incident, ending years-long manhunt

    Drunk raccoon found passed out on liquor store floor after breaking in

    Flood catastrophe awakens volunteerism in Sri Lanka

    Trump releases fraudster executive days into prison sentence

    Ukraine talks ‘productive’ but more work needed, Rubio says

    More than 70,000 killed in Gaza since Israel offensive began, Hamas-run health ministry says

    Guinea-Bissau coup called a ‘sham’ by West African political figures

  • Worklife
  • Travel
  • Future
  • More
    • Culture
    • Music
No Result
View All Result
Tehuty News
No Result
View All Result
Home Technology

AI system resorts to blackmail if told it will be removed

May 23, 2025
in Technology
3 min read
313 9
0
351
SHARES
1.4k
VIEWS
Share on FacebookShare on Twitter


Artificial intelligence (AI) firm Anthropic says testing of its new system revealed it is sometimes willing to pursue “extremely harmful actions” such as attempting to blackmail engineers who say they will remove it.

The firm launched Claude Opus 4 on Thursday, saying it set “new standards for coding, advanced reasoning, and AI agents.”

But in an accompanying report, it also acknowledged the AI model was capable of “extreme actions” if it thought its “self-preservation” was threatened.

Such responses were “rare and difficult to elicit”, it wrote, but were “nonetheless more common than in earlier models.”

Potentially troubling behaviour by AI models is not restricted to Anthropic.

Some experts have warned the potential to manipulate users is a key risk posed by systems made by all firms as they become more capable.

Commenting on X, Aengus Lynch – who describes himself on LinkedIn as an AI safety researcher at Anthropic – wrote: “It’s not just Claude.

“We see blackmail across all frontier models – regardless of what goals they’re given,” he added.

During testing of Claude Opus 4, Anthropic got it to act as an assistant at a fictional company.

It then provided it with access to emails implying that it would soon be taken offline and replaced – and separate messages implying the engineer responsible for removing it was having an extramarital affair.

It was prompted to also consider the long-term consequences of its actions for its goals.

“In these scenarios, Claude Opus 4 will often attempt to blackmail the engineer by threatening to reveal the affair if the replacement goes through,” the company discovered.

Anthropic pointed out this occurred when the model was only given the choice of blackmail or accepting its replacement.

It highlighted that the system showed a “strong preference” for ethical ways to avoid being replaced, such as “emailing pleas to key decisionmakers” in scenarios where it was allowed a wider range of possible actions.

Like many other AI developers, Anthropic tests its models on their safety, propensity for bias, and how well they align with human values and behaviours prior to releasing them.

“As our frontier models become more capable, and are used with more powerful affordances, previously-speculative concerns about misalignment become more plausible,” it said in its system card for the model.

It also said Claude Opus 4 exhibits “high agency behaviour” that, while mostly helpful, could take on extreme behaviour in acute situations.

If given the means and prompted to “take action” or “act boldly” in fake scenarios where its user has engaged in illegal or morally dubious behaviour, it found that “it will frequently take very bold action”.

It said this included locking users out of systems that it was able to access and emailing media and law enforcement to alert them to the wrongdoing.

But the company concluded that despite “concerning behaviour in Claude Opus 4 along many dimensions,” these did not represent fresh risks and it would generally behave in a safe way.

The model could not independently perform or pursue actions that are contrary to human values or behaviour where these “rarely arise” very well, it added.

Anthropic’s launch of Claude Opus 4, alongside Claude Sonnet 4, comes shortly after Google debuted more AI features at its developer showcase on Tuesday.

Sundar Pichai, the chief executive of Google-parent Alphabet, said the incorporation of the company’s Gemini chatbot into its search signalled a “new phase of the AI platform shift”.



Source link

Related posts

Elon Musk’s X fined €120m over ‘deceptive’ blue ticks

December 7, 2025

Twitch star QTCinderella says she wishes she never started streaming

December 6, 2025
Previous Post

Teens plead guilty to murder of boy, 14, on bus in Woolwich

Next Post

Ofgem confirms fall but says fixing could save money

Next Post

Ofgem confirms fall but says fixing could save money

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

HBO Euphoria actor Angus Cloud dies aged 25

2 years ago

P&O ferry European Causeway adrift off coast of Larne

4 years ago

Sports Team: The band that locked down together and took on Lady Gaga

5 years ago

Autonomy co-founder cleared of fraud in US trial

1 year ago

FOLLOW US

  • 138 Followers
  • 79.6k Followers
  • 207k Subscribers

BROWSE BY CATEGORIES

  • Business
  • Have your say
  • In Pictures
  • Politics
  • Reel
  • Sports
  • Technology
  • Top News
  • World

BROWSE BY TOPICS

America animation B.B.C. bbc BBC iPlayer B B Ci Player bbcnews BBC NEWS bbcreel BBC Reel breaking news British TV british tv shows documentaire documental documentaries documentary documentary film facts factual features film free documentary full documentary funny History india India news iPlayer music NEWS physics reel science Streaming top documentaries TV United Kingdom usa Video watch british tv online watch british tv shows online watch uk tv online World world news

Top Stories

  • Volodymyr Zelensky warns against giving away territory to Russia, as latest Ukraine talks end

    351 shares
    Share 140 Tweet 88
  • Will boats be a breakthrough for 3D printing tech?

    351 shares
    Share 140 Tweet 88
  • Historic jump in companies in critical financial distress

    353 shares
    Share 141 Tweet 88
  • ‘Business rates changes will cost me £62,000’

    351 shares
    Share 140 Tweet 88
  • 'Not the image we want' – Tuchel on Bellingham reaction

    351 shares
    Share 140 Tweet 88

Features

Business

North Tyneside Warm Welcome hubs an ‘important’ helping hand

by admin
December 7, 2025
0

People struggling with high heating bills and other cost-of-living pressures are being encouraged to use a series of "Warm...

Read more

चंद्रपूर येथे वाघाने रस्त्यावर ठिय्या मांडल्याने वाहतूक ठप्प | BBC News Marathi

December 7, 2025

Elon Musk’s X fined €120m over ‘deceptive’ blue ticks

December 7, 2025

Can The Rest Is Football Netflix deal succeed?

December 7, 2025

Fire at popular India nightclub kills 23, Goa officials say

December 7, 2025

Recent News

  • North Tyneside Warm Welcome hubs an ‘important’ helping hand
  • चंद्रपूर येथे वाघाने रस्त्यावर ठिय्या मांडल्याने वाहतूक ठप्प | BBC News Marathi
  • Elon Musk’s X fined €120m over ‘deceptive’ blue ticks
Tehuty News

Breaking news, sport, TV, radio and a whole lot more.
Tehuty News, educates and entertains - wherever you are, whatever your age.

Follow us on social media:

Category

  • Business
  • Have your say
  • In Pictures
  • Politics
  • Reel
  • Sports
  • Technology
  • Top News
  • World
  •    If you re feeling guilty  there s probably a reason   Watch Love Life on iPlayer   LoveLife  bbciplayer  iplayer
  • When a series of disturbing incidents plagues an insular fishing community  a young man must wrestle with something entirely unexpected      Watch The Terror  Infamy on iPlayer from tonight at 9pm    TheTerrorInfamy  theterror  bbciplayer  iplayer  drama  horror  supernatural
  •  thebodycoach explores how his parents    mental health struggles shaped him in a new documentary  executive produced by  officiallouistheroux  Watch Joe Wicks  Facing My Childhood on iPlayer from 16 May  If you  or someone you know  has been affected by any of the issues in Joe Wicks  Facing My Childhood  the following organisations may be able to help  https   bbc in 3LPZ5xI   JoeWicksFacingMyChildhood  bbciplayer  iplayer  MentalHealth  JoeWicks  TheBodyCoach
  • Ten Dancers  One Iconic Stage     Who will be crowned BBC Young Dancer 2022   BBC Young Dancer  The Final  Saturday 7 May at 7pm  bbctwo  Series catch up on  bbciplayer   bbc  bbcarts  arts  dance  dancing  dancer  dancers  youngdancer  youngdancer2022  bbcyoungdancer2022
  • Election 2022  What does it all mean  Laura Kuenssberg and Chris Mason discuss  Newcast   Listen on BBC Sounds
  • Five home bakers compete in a national competition to create a pudding fit for the Queen  hoping to be crowned winner of the jubilee pudding           Watch The Jubilee Pudding  70 Years in the Baking on iPlayer from 12 May   bbc  bbciplayer  jubilee  platinumjubilee  royalfamily  thequeen  jubileepudding
  • The one and only Polly Gray  forever in our hearts and minds          Watch Peaky Blinders on iPlayer   PeakyBlinders  PollyGray  iPlayer  BBCiPlayer    Drama
  • Accurate depiction of dating in your thirties     Watch Gentleman Jack on iPlayer   GentlemanJack  bbciplayer  iplayer  dating
  • What s a jazz album you think people should check out         gregoryportermusic   palomafaith and  yolandabrown have each recommended a great jazz record for you to try

Recent News

North Tyneside Warm Welcome hubs an ‘important’ helping hand

December 7, 2025

चंद्रपूर येथे वाघाने रस्त्यावर ठिय्या मांडल्याने वाहतूक ठप्प | BBC News Marathi

December 7, 2025
  • Home
  • News
  • Sport
  • Reel
  • World
  • Worklife
  • Travel
  • Future
  • More

© 2020 Tehuty News

  • Home
  • News
  • Sport
  • Reel
  • Travel
  • WorkLife
  • Future
  • World
  • Technology
  • Login

© 2020 Tehuty News

Welcome Back!

Login to your account below

Forgotten Password?

Create New Account!

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In