summary Summary

Stability AI releases StableVicuna, the first large-scale open-source chatbot trained with human feedback.


Stability AI, the company behind the successful open-source image model Stable Diffusion, releases StableVicuna, an open-source chatbot. The chatbot is based on the Vicuna chatbot released in early April, which is a 13 billion parameter LLaMA model tuned with the Alpaca formula.

What is special about the Vicuna variant of Stability AI and Carper AI is that the model was improved using so-called "Reinformcent Learning with Human Feedback" (RLHF) (see below for explanation).

This was done using datasets from OpenAssistant, Anthropic, and Stanford University, as well as the open-source training framework rlX, also from Carper AI. Stability AI is working with OpenAssistant on larger RLHF datasets for future models.


According to Stability AI, StableVicuna does simple math in addition to text generation and can write code. In common benchmarks, StableVicuna is on par with previously released open-source chatbots. However, benchmarks are only partially indicative of how a model will perform in practice.

Image: Stability AI

According to Stability AI, StableVicuna will be developed further and launched on Discord soon. A demo is now available on HuggingFace. Stability AI also plans to make StableVicuna available through a chat interface soon.

Developers can download the model's weights as a delta to the original LLaMA model at Hugging Face. Those who want to use StableVicuna themselves will need access to the original LLaMA, which can be requested here. Commercial use is not allowed.

The problem with open-source chatbots that are refined with generated chatbot data is the risk of an echo chamber, in which the AI models reinforce their existing errors and biases through ever new training processes. In addition, training data generated for fine-tuning can reinforce hallucinations if it contains information not present in the original model.


Background: With RLHF to a useful chatbot

Reinforcement learning with human feedback (RLHF) was the key to ChatGPT's success: Only through the small-scale feedback work of thousands of people providing detailed ratings on the usefulness of tens of thousands of chat outputs could the chatbot be tuned to always feel it has an appropriate response ready.

RLHF also ensures that the chatbot's output stays within certain social norms. For example, it does not encourage crime. Without RLHF, GPT-4 would be much more difficult to use. It could generate very extreme content, such as detailed suggestions for the systematic annihilation of humanity.

As the drawing in the following tweet humorously illustrates, the RLHF-enhanced chatbot is just a small part of a large language model optimized for human interaction.

Just a few days ago, Stability AI launched the StableLM open-source language model family. More information and downloads are available on Stability AI's Github.

Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
  • Stability AI is known for its successful open-source AI model, Stable Diffusion. The company is now releasing StableVicuna, an open-source chatbot.
  • It is the first open-source chatbot trained with human feedback, the special sauce behind ChatGPT.
  • StableVicuna will soon be launched on Discord and will have its own chat interface. Those who want to use the chatbot themselves will need access to Meta's LLaMA base model.
Online journalist Matthias is the co-founder and publisher of THE DECODER. He believes that artificial intelligence will fundamentally change the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.