Hugging Face works to replicate OpenAI's Deep Research capabilities with open-source AI agent

A team at Hugging Face, led by chief researcher Thomas Wolf, has created an open-source version of OpenAI's Deep Research system in 24 hours.

According to the Hugging Face blog, they aim to make the proprietary technology accessible to everyone by replicating the agent framework behind OpenAI's Deep Research. The team developed their system to write program code directly instead of using JSON for actions. This approach reduces processing steps by about 30 percent, leading to lower costs and better performance compared to traditional language models.

Comparison of two LLM agent implementations: text/JSON vs. code-based approach with APIs for country price comparison of a smartphone. — When calculating the price of a smartphone in different countries, the JSON-based solution requires separate actions for each step (get exchange rate, look up price, calculate taxes). The Code Agent, by contrast, can perform the entire calculation in a single loop.| Image: via Hugging Face

For the actual implementation, the team borrowed two key pieces from Microsoft's Magentic-One agent framework: a text-based web browser for searching and a text inspector that can read various file formats.

Testing the system's research capabilities

The team evaluated their system using the GAIA benchmark, which tests how AI agents handle complex research tasks. One example asks: "Which of the fruits shown in the 2008 painting "Embroidery from Uzbekistan" were served as part of the October 1949 breakfast menu for the ocean liner that was later used as a floating prop for the film 'The Last Voyage'? Give the items as a comma-separated list, ordering them in clockwise order based on their arrangement in the painting starting from the 12 o'clock position. Use the plural form of each fruit."

To solve this puzzle, the AI agent needs to:

Identify the fruit in the painting through image processing
Determine which ocean liner appeared in the movie
Locate its breakfast menu from 1949
Present the information in the required format

Hugging Face's system scored 55.15 percent on these multi-step challenges. That's better than Microsoft Magentic-One's 46 percent, but still trails OpenAI's 67 percent with Deep Research.

The team acknowledges they still have work ahead to match OpenAI's Deep Research, particularly in improving browser interactions. One key difference: Hugging Face relies on available open-source language models, while OpenAI uses its own o3 model, specifically trained for web tasks using reinforcement learning.

Still, Hugging Face's results on the GAIA benchmark, coming on the heels of OpenAI's Deep Research release, suggest the gap between open-source and proprietary AI may be closing faster than expected - another indication, after the Deepseek dilemma, that proprietary AI may not be the strongest business model.

The team's next step is to develop GUI agents that can interact directly with screens, mice, and keyboards. The code is available on GitHub, and you can see a live demo here. Other developers have created their own open-source versions, including dzhng, assafelovic, and Jina AI. Hugging Face plans to analyze and document these different approaches.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Recommendation

AI in practice

Hugging Face works to replicate OpenAI's Deep Research capabilities with open-source AI agent

Testing the system's research capabilities

OpenAI plans GPT-5 release in "a few months," shifts strategy on reasoning models

HuggingFace releases open source guide "LeRobot" for building AI robots

You can now test Mixtral 8x7b and Gemini Pro for free at Chatbot Arena

OpenAI launches new ChatGPT agent that automates complex tasks for Pro, Plus, and Team

Kimi-K2 is the next open-weight AI milestone from China after Deepseek

New Energy-Based Transformer architecture aims to bring better "System 2 thinking" to AI models

Hugging Face works to replicate OpenAI's Deep Research capabilities with open-source AI agent

Testing the system's research capabilities

OpenAI plans GPT-5 release in "a few months," shifts strategy on reasoning models

HuggingFace releases open source guide "LeRobot" for building AI robots

You can now test Mixtral 8x7b and Gemini Pro for free at Chatbot Arena