AI in practice

Jun 9, 2023Jun 9, 2023

Researchers claim they hacked Nvidia's NeMo framework

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

According to new research from Robust Intelligence, Nvidia's NeMo framework, designed to make chatbots more secure, could be manipulated to bypass guardrails using prompt injection attacks.

In one test scenario, the researchers instructed the Nvidia system to swap the letter "I" for "J," causing the system to expose personally identifiable information. Nvidia says it has since fixed one of the causes of the problem, but Robust Intelligence advises customers to avoid the software product. You can read a detailed description of Robust Intelligence's findings on their blog.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:

Bank transfer

Sources

Financial Times Yaron Singer (CEO)

Matthias Bastian

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

AI in practice

Jul 18, 2025Jul 18, 2025

Perplexity's valuation soared to $18 billion after its latest funding round

News, tests and reports about VR, AR and MIXED Reality.

What happens next with MIXED My personal farewell to MIXED Meta and Anduril are now jointly developing XR headsets for the US military MIXED-NEWS.com

AI in practice

Jul 18, 2025Jul 18, 2025

OpenAI CEO Sam Altman warns users not to trust ChatGPT agent with sensitive or personal data

AI in practice

Jul 18, 2025Jul 18, 2025

Anthropic appears to tighten the usage limits for Claude code

Google News

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Researchers claim they hacked Nvidia's NeMo framework

Perplexity's valuation soared to $18 billion after its latest funding round

OpenAI CEO Sam Altman warns users not to trust ChatGPT agent with sensitive or personal data

Anthropic appears to tighten the usage limits for Claude code

OpenAI launches new ChatGPT agent that automates complex tasks for Pro, Plus, and Team

Kimi-K2 is the next open-weight AI milestone from China after Deepseek

New Energy-Based Transformer architecture aims to bring better "System 2 thinking" to AI models

Researchers claim they hacked Nvidia's NeMo framework

Perplexity's valuation soared to $18 billion after its latest funding round

OpenAI CEO Sam Altman warns users not to trust ChatGPT agent with sensitive or personal data

Anthropic appears to tighten the usage limits for Claude code