Ad
Skip to content

A new platform lets AI agents pay humans to do the real-world work they can't

On Rentahuman.ai, AI agents can hire people for real-world tasks, from holding signs to picking up packages. It sounds absurd, but it shows what happens when language models stop just talking and start taking action.

Read full article about: Anthropic partners with leading research institutes to tackle biology's data bottleneck

Anthropic has announced two partnerships with major US research institutions to develop AI agents for biological research. The Allen Institute and the Howard Hughes Medical Institute (HHMI) will serve as founding partners in the initiative. According to Anthropic, "modern biological research generates data at unprecedented scale," but turning it into "validated biological insights remains a fundamental bottleneck." The company says manual processes "can't keep pace with the data being produced."

HHMI will develop specialized AI agents at the Janelia Research Campus that connect experimental knowledge to scientific instruments and analysis pipelines. The Allen Institute is working on multi-agent systems for data integration and experiment design that could "compress months of manual analysis into hours." According to Anthropic, these systems "are designed to amplify scientific intuition rather than replace it, keeping researchers in control of scientific direction while handling computational complexity."

The move extends Anthropic's push into scientific applications. The company recently launched Cowork, a feature designed for office work that gives Claude access to local files. OpenAI is also targeting the research market with Prism, an AI workspace for scientific writing.

Ad
Read full article about: Gemini models dominate new AI rankings for strategic board games

Google's Gemini models are outperforming the competition in board game benchmarks. Google Deepmind and Kaggle have expanded their "Game Arena" platform with two new games: Werewolf and Poker. The platform tests AI models across strategic games that measure different cognitive abilities—chess evaluates logical thinking, Werewolf tests social skills like communication and detecting deception, and Poker assesses how models handle risk and incomplete information.

These games provide objective ways to measure skills like planning and decision-making under uncertainty. Gemini 3 Pro and Gemini 3 Flash currently hold the top spots in all rankings. The Werewolf benchmark serves double duty for security research as well: it tests whether models can detect manipulation without any real-world consequences. According to Google Deepmind CEO Demis Hassabis, the AI industry needs more rigorous tests to properly evaluate the latest models.

Read full article about: French prosecutors raid X's Paris offices over data and child abuse allegations

French prosecutors have raided the Paris offices of Elon Musk's platform X. The cybercrime unit is investigating multiple allegations, including unlawful data extraction and aiding the distribution of child sexual abuse material. Sexual deepfakes are also part of the investigation. Musk and former X CEO Linda Yaccarino have been summoned for hearings in April, according to the BBC. X has previously called the investigation politically motivated.

At the same time, the UK's Information Commissioner's Office (ICO) has opened an investigation into Musk's AI tool Grok. The probe focuses on whether personal data was used without consent to create sexualized images. The UK media regulator Ofcom and the European Commission are also continuing their reviews of the platform. X has not commented on the investigations.

Comment Source: BBC
Ad
Read full article about: Firefox users will soon be able to block all generative AI features in one place

Mozilla is rolling out new AI settings with Firefox 148 on February 24. Users will be able to manage all the browser's generative AI features from a single location, or turn them off entirely, the company announced in a blog post.

The new settings cover translations, automatic image descriptions in PDFs, AI-powered tab grouping, link previews, and a chatbot in the sidebar. The chatbot supports services like Anthropic Claude, ChatGPT, Microsoft Copilot, Google Gemini, and Le Chat Mistral.

For users who want nothing to do with AI features, a single toggle blocks all AI extensions. Once enabled, no pop-ups or notifications about current or future AI features will appear. The settings persist through updates. Users who want to try the feature early can find it in Firefox Nightly.

Ad