Researchers claim they hacked Nvidia's NeMo framework
According to new research from Robust Intelligence, Nvidia's NeMo framework, designed to make chatbots more secure, could be manipulated to bypass guardrails using prompt injection attacks.
In one test scenario, the researchers instructed the Nvidia system to swap the letter "I" for "J," causing the system to expose personally identifiable information. Nvidia says it has since fixed one of the causes of the problem, but Robust Intelligence advises customers to avoid the software product. You can read a detailed description of Robust Intelligence's findings on their blog.
AI News Without the Hype – Curated by Humans
Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section.
Subscribe nowRead on for the full picture.
Subscribe for hype-free coverage.
- Access to all THE DECODER articles.
- Read without distractions – no Google ads.
- Access to comments and community discussions.
- Weekly AI newsletter.
- 6 times a year: “AI Radar” – deep dives on key AI topics.
- Up to 25 % off on KI Pro online events.
- Access to our full ten-year archive.
- Get the latest AI news from The Decoder.