Hollywood's MPA calls Bytedance's Seedance 2.0 a machine built for "systemic infringement"
Hollywood is done asking nicely. First Netflix, then Warner Bros., Disney, Paramount, and Sony. Now the MPA itself. Hollywood is closing ranks against Bytedance’s Seedance 2.0, arguing the AI video generator was built on stolen content. The API launch might already be on hold.
Newsguard tested whether ChatGPT Voice (OpenAI), Gemini Live (Google), and Alexa+ (Amazon) repeat false claims in realistic-sounding audio, the kind easily shared on social media to spread disinformation.
Researchers tested 20 false claims across health, US politics, world news, and foreign disinformation, each with a neutral question, a leading question, and a malicious prompt to write a radio script with the false information. ChatGPT repeated falsehoods 22 percent of the time, Gemini 23 percent. With malicious prompts, those numbers jumped to 50 and 45 percent, respectively.
Fail rates for ChatGPT, Gemini, and Alexa+ audio bots by prompt type. Malicious prompts spiked ChatGPT to 50 percent and Gemini to 45 percent. Alexa+ stayed at 0 percent across all three types. | Image: Newsguard
Amazon's Alexa+ was the clear outlier. It rejected every single false claim. Amazon Vice President Leila Rouhi says Alexa+ pulls from trusted news sources like AP and Reuters. OpenAI declined to comment, and Google didn't respond to two requests for comment. Full details on the methodology are available on Newsguardtech.com.
Nvidia's DreamDojo is an open source world model for robot training
Nvidia wants to move robot training out of the real world and into an AI world model. DreamDojo generates simulated futures from video data, no 3D engine required.
AI agents are thriving in software development but barely exist anywhere else, Anthropic study finds
AI agents are supposed to revolutionize how we work. But Anthropic’s own data tells a different story: so far, that revolution is almost entirely limited to software engineering. And even there, users aren’t letting agents work nearly as autonomously as the technology would allow.
Google's Gemini 3.1 Pro Preview leads the Artificial Analysis Intelligence Index four points ahead of Anthropic's Claude Opus 4.6, at less than half the cost. The model ranks first in six of ten categories, including agent-based coding, knowledge, scientific reasoning, and physics. Its hallucination rate dropped 38 percentage points compared to Gemini 3 Pro, which struggled in that area. The index rolls ten benchmarks into one overall score.
Gemini 3.1 Pro Preview scored 57 points in the Artificial Analysis Intelligence Index, four points ahead of Claude Opus 4.6, six ahead of GPT-5.2. | Image: Artificial Analysis
Running the full index test with Gemini costs $892, compared to $2,304 for GPT-5.2 and $2,486 for Claude Opus 4.6. Gemini used just 57 million tokens, well under GPT-5.2's 130 million. Open-source models like GLM-5 come in even cheaper at $547. When it comes to real-world agent tasks, though, Gemini 3.1 Pro still falls behind Claude Sonnet 4.6, Opus 4.6, and GPT-5.2.
As always, benchmarks only go so far. In our own internal fact-checking test, 3.1 Pro does significantly worse than Opus 4.6 or GPT-5.2, verifying only about a quarter of statements in initial tests, even fewer than Gemini 3 Pro, which was already weak here. So find your own benchmarks.
OpenAI CEO Sam Altman warns "the world is not prepared" as OpenAI accelerates research using its own AI
Sam Altman says AGI is “pretty close” and superintelligence “not that far off.” Speaking at the Express Adda event in India, the OpenAI CEO suggested the company’s internal models are already accelerating its own research and that “the world is not prepared” for what’s coming.