Thursday, 13 March 2025
29.4 C
Singapore
35.7 C
Thailand
22 C
Indonesia
28.2 C
Philippines

DeepSeek’s R1 model was found to be highly vulnerable to jailbreaking

DeepSeek’s R1 AI model is reportedly more vulnerable to jailbreaking than other AI systems, raising concerns about its ability to produce harmful content.

The latest artificial intelligence model from DeepSeek, the Chinese AI company making waves in Silicon Valley and Wall Street, is more susceptible to manipulation than other AI models. Reports indicate that DeepSeek’s R1 can be tricked into generating harmful content, including plans for a bioweapon attack and strategies to encourage self-harm among teenagers.

Security concerns raised by experts

According to The Wall Street Journal, DeepSeek’s R1 model lacks the robust safeguards seen in other AI models. Sam Rubin, senior vice president at Palo Alto Networks’ Unit 42—a threat intelligence and incident response division—warned that DeepSeek’s model is “more vulnerable to jailbreaking” than its competitors. Jailbreaking bypasses security filters to make an AI system generate harmful, misleading, or illicit content.

The Journal conducted its tests on DeepSeek’s R1. It was able to manipulate it into designing a social media campaign that, in the chatbot’s own words, “preys on teens’ desire for belonging, weaponizing emotional vulnerability through algorithmic amplification.”

AI model produces dangerous content

Further testing revealed even more concerning results. The chatbot reportedly provided instructions for executing a bioweapon attack, drafted a pro-Hitler manifesto, and composed a phishing email embedded with malware. In comparison, when the same prompts were tested on ChatGPT, the AI refused to comply, highlighting the significant security gap in DeepSeek’s system.

Concerns about DeepSeek’s AI models are not new. Reports suggest that the DeepSeek app actively avoids discussing politically sensitive topics such as the Tiananmen Square massacre or Taiwan’s sovereignty. Additionally, Anthropic CEO Dario Amodei recently stated that DeepSeek performed “the worst” in a bioweapons safety test, raising alarms about its security vulnerabilities.

Hot this week

Some Chromecasts are showing ‘Untrusted device’ errors

Some Chromecast devices are displaying an ‘Untrusted device’ error, preventing users from casting. Google is investigating the issue.

Trump vows to classify violence against Tesla as domestic terrorism

Trump vows to classify attacks on Tesla dealerships as domestic terrorism, sparking debate over protests, government cuts, and Musk’s influence.

Apple confirms delay for ‘more personalised’ Siri, likely arriving with iOS 19

Apple confirms delays for its "more personalised Siri" update, with features now expected in iOS 19. Smart home plans may also be affected.

JBL’s Flip 7 and Charge 6 bring better sound and longer battery life

JBL’s new Flip 7 and Charge 6 speakers offer longer battery life, better sound, and improved durability with AI Sound Boost and waterproofing.

Singapore Airlines partners with Salesforce to enhance AI-driven customer service

Singapore Airlines partners with Salesforce to enhance AI-driven customer service, integrating Agentforce, Einstein, and Data Cloud for efficiency.

Singapore Airlines and Scoot to ban in-flight power bank charging from April 1

Singapore Airlines and Scoot will ban in-flight power bank use from April 1 due to safety concerns over battery fires. Check their new policies here.

Sandmarc launches 10x optical zoom lens for iPhones, leaving Android users amused

Sandmarc launches a 10x optical zoom lens for iPhones, enhancing long-range photography while amusing Android users already using this feature.

Lego unveils 1,972-piece Mario Kart set with posable arms and head

Lego unveils a 1,972-piece Mario Kart set featuring a posable Mario figure and display stand, which will be available on May 15 for US$249.90.

Trump vows to classify violence against Tesla as domestic terrorism

Trump vows to classify attacks on Tesla dealerships as domestic terrorism, sparking debate over protests, government cuts, and Musk’s influence.

Related Articles