Ad
Short

Articles from the Washington Post will now appear in ChatGPT responses under a new content licensing agreement between the two companies. The integration includes coverage of politics, world affairs, business, and technology, with direct source citations provided in answers. "We’re all in on meeting our audiences where they are," said Peter Elkins-Williams, director of global partnerships at The Washington Post. The partnership follows a broader trend of exclusive licensing deals between media outlets and AI companies. Here's my usual caveat: Such arrangements can reduce media diversity, posing risks to democratic discourse and the open Web. Journalism scholar Jeff Jarvis has called these payments to publishers "pure lobbying."

Ad
Ad
Short

AI researcher Sebastian Raschka has published a new analysis that looks at how reinforcement learning is used to improve reasoning in large language models (LRMs). In a blog post, he describes how algorithms are used in combination with training methods such as Reinforcement Learning from Human Feedback (RLHF) and Reinforcement Learning from Verifiable Rewards (RLVR). Raschka focuses on DeepSeek-R1, a model trained using verifiable rewards instead of human labels, to explain in detail how reinforcement learning can improve problem-solving performance.

While reasoning alone isn’t a silver bullet, it reliably improves model accuracy and problem-solving capabilities on challenging tasks (so far). And I expect reasoning-focused post-training to become standard practice in future LLM pipelines.

Ad
Ad
Ad
Ad
Google News