Writing backwards can trick an AI into providing a bomb recipe

2 hours ago 13

AI models have safeguards in place to prevent them creating dangerous or illegal output, but a range of jailbreaks have been found to evade them. Now researchers show that writing backwards can trick AI models into revealing bomb-making instructions.

Read Entire Article

Writing backwards can trick an AI into providing a bomb recipe

Related

17,000-year-old remains of blue-eyed baby boy unearthed in I...

How do people die of the flu?

Folklore uncovers a tsunami that rocked Hawaii hundreds of y...