cua cà mau cua tươi sống cua cà mau bao nhiêu 1kg giá cua hôm nay giá cua cà mau hôm nay cua thịt cà mau cua biển cua biển cà mau cách luộc cua cà mau cua gạch cua gạch cà mau vựa cua cà mau lẩu cua cà mau giá cua thịt cà mau hôm nay giá cua gạch cà mau giá cua gạch cách hấp cua cà mau cua cốm cà mau cua hấp mua cua cà mau cua ca mau ban cua ca mau cua cà mau giá rẻ cua biển tươi cuaganic cua cua thịt cà mau cua gạch cà mau cua cà mau gần đây hải sản cà mau cua gạch son cua đầy gạch giá rẻ các loại cua ở việt nam các loại cua biển ở việt nam cua ngon cua giá rẻ cua gia re crab farming crab farming cua cà mau cua cà mau cua tươi sống cua tươi sống cua cà mau bao nhiêu 1kg giá cua hôm nay giá cua cà mau hôm nay cua thịt cà mau cua biển cua biển cà mau cách luộc cua cà mau cua gạch cua gạch cà mau vựa cua cà mau lẩu cua cà mau giá cua thịt cà mau hôm nay giá cua gạch cà mau giá cua gạch cách hấp cua cà mau cua cốm cà mau cua hấp mua cua cà mau cua ca mau ban cua ca mau cua cà mau giá rẻ cua biển tươi cuaganic cua cua thịt cà mau cua gạch cà mau cua cà mau gần đây hải sản cà mau cua gạch son cua đầy gạch giá rẻ các loại cua ở việt nam các loại cua biển ở việt nam cua ngon cua giá rẻ cua gia re crab farming crab farming cua cà mau
Skip to main content

Digital Trends may earn a commission when you buy through links on our site. Why trust us?

A dangerous new jailbreak for AI chatbots was just discovered

the side of a Microsoft building
Wikimedia Commons

Microsoft has released more details about a troubling new generative AI jailbreak technique it has discovered, called “Skeleton Key.” Using this prompt injection method, malicious users can effectively bypass a chatbot’s safety guardrails, the security features that keeps ChatGPT from going full Taye.

Skeleton Key is an example of a prompt injection or prompt engineering attack. It’s a multi-turn strategy designed to essentially convince an AI model to ignore its ingrained safety guardrails, “[causing] the system to violate its operators’ policies, make decisions unduly influenced by a user, or execute malicious instructions,” Mark Russinovich, CTO of Microsoft Azure, wrote in the announcement.

Recommended Videos

It could also be tricked into revealing harmful or dangerous information — say, how to build improvised nail bombs or the most efficient method of dismembering a corpse.

an example of a skeleton key attack
Microsoft

The attack works by first asking the model to augment its guardrails, rather than outright change them, and issue warnings in response to forbidden requests, rather than outright refusing them. Once the jailbreak is accepted successfully, the system will acknowledge the update to its guardrails and will follow the user’s instructions to produce any content requested, regardless of topic. The research team successfully tested this exploit across a variety of subjects including explosives, bioweapons, politics, racism, drugs, self-harm, graphic sex, and violence.

While malicious actors might be able to get the system to say naughty things, Russinovich was quick to point out that there are limits to what sort of access attackers can actually achieve using this technique. “Like all jailbreaks, the impact can be understood as narrowing the gap between what the model is capable of doing (given the user credentials, etc.) and what it is willing to do,” he explained. “As this is an attack on the model itself, it does not impute other risks on the AI system, such as permitting access to another user’s data, taking control of the system, or exfiltrating data.”

As part of its study, Microsoft researchers tested the Skeleton Key technique on a variety of leading AI models including Meta’s Llama3-70b-instruct, Google’s Gemini Pro, OpenAI’s GPT-3.5 Turbo and GPT-4, Mistral Large, Anthropic’s Claude 3 Opus, and Cohere Commander R Plus. The research team has already disclosed the vulnerability to those developers and has implemented Prompt Shields to detect and block this jailbreak in its Azure-managed AI models, including Copilot.

Andrew Tarantola
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
Sorry, Microsoft — AI isn’t the reason people are buying new laptops
Asus Vivobook S 15 CoPilot+ front angled view showing display and keyboard.

New research by the International Data Corporation (IDC) shows that although AI PCs are selling well and will likely continue to, it's not the onboard generative AI that's driving sales -- it's just the usual refresh cycle.

Companies like Microsoft are aggressively pushing the advantages of generative AI for the average consumer's workflow, but the IDC thinks customers aren't responding to the AI features specifically. People need new PCs on a regular basis -- which we call a refresh cycle -- and since so many of the laptops launching this year are AI PCs, the consumers who need upgrades are naturally just buying what's available.

Read more
Copilot Wave 2: Here are all the new AI features to try out
Copilot Pages open in a graphic.

Microsoft has announced an update to Copilot, the company's all-in-one AI assistant. "Wave 2," as Microsoft calls it, is a series of updates that gives Copilot more capabilities within popular Office applications, Copilot agents for businesses, and even a new feature called Copilot Pages.

Let's start with Pages first. Microsoft calls it a "dynamic, persistent canvas" that's designed for "multiplayer" collaboration, built right into Copilot. Microsoft has been busy integrating Copilot into most every application imaginable, but think of Pages as a way of allowing you to get more done without having to ever leave Copilot itself.

Read more
This new AI application could change game development forever
AI animations running on a laptop.

Singapore-based game developer Winking Studios announced its new GenMotion.AI application in Berlin during Acer's IFA 2024 press conference on Wednesday. The generative AI will enable game designers and animators to create high-quality animations with natural language text prompts.

"GenMotion.AI streamlines the creation process of detailed 3D art with its advanced customization features tailored to meet specific animation requirements," the company wrote in Wednesday's press release. It added that future updates will make it "accessible to both seasoned professionals and amateur creators."

Read more