Nous Research's new Hermes 3 AI models promise high controllability without 'latent thoughtcrime'

Nous Research, an AI research company, has released a new family of language models called Hermes 3. According to the technical report, the models are characterized by high controllability and neutral alignment.

Hermes 3 includes Instruct models in sizes of 8, 70, and 405 billion parameters and is based on Meta's open-source model Llama 3.1. The models are designed to precisely follow instructions and adapt to the world view specified in the system prompt.

This sets Hermes 3 apart from proprietary commercial models that may refuse instructions for moral reasons. For Hermes 3, there is no "latent thoughtcrime," as stated in the report.

Hermes 3 outperforms Meta's Llama 3.1

According to Nous Research, Hermes 3 masters skills such as reasoning, reward modeling, "scratchpads" for intermediate results, structured output with XML tags, generation of internal monologues for transparent decision-making, and Mermaid diagrams for visual communication.

The training took place in two phases: a supervised fine-tuning phase (SFT) and a phase with Direct Preference Optimization (DPO). Nearly 400 million tokens were used for the SFT phase. The models were evaluated epoch-wise, and the best checkpoints for the 8B and 405B models were selected.

In several public benchmarks such as ARC, BoolQ, HellaSwag, IFEval, and Winogrande, the Hermes 3 models achieve top scores among models with open weights - also in comparison to the underlying models from Meta.

For this, a mix of synthetically created reasoning tasks and expressive applications such as role-playing and creative writing was trained.

The models can also use external tools and cite information from documents via "Retrieval Augmented Generation" to answer questions.

The Hermes 3 models are available on Hugging Face.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Recommendation

AI research

Nous Research's new Hermes 3 AI models promise high controllability without 'latent thoughtcrime'

Hermes 3 outperforms Meta's Llama 3.1

OpenAI's o3 is less AGI than originally measured

Attackers can hijack Google Gemini with a simple prompt hidden in a calendar invite

Alibaba's new Qwen-Image model generates high-fidelity text inside images

Google Deepmind's Genie 3 creates interactive 3D worlds that stay consistent for "multiple minutes"

OpenAI launches GPT-5 as a unified system with adaptive reasoning for complex tasks

Google Deepmind's Genie 3 creates interactive 3D worlds that stay consistent for "multiple minutes"

Google upgrades Gemini with Deep Think and flags early warning risks

Nous Research's new Hermes 3 AI models promise high controllability without 'latent thoughtcrime'

Hermes 3 outperforms Meta's Llama 3.1

Share

Bank details