Content
summary Summary

With the publication of the "Model Spec", OpenAI wants to stimulate a public discussion about how AI models should behave. The document defines objectives, rules, and standard behaviors for the design of model behavior.

OpenAI has released the first version of its "Model Spec," a document that specifies desired behavior for AI models in the OpenAI API and ChatGPT, the company announced. The spec contains a set of core objectives, rules, and standard behaviors. Model behavior, or how models respond to user input, is crucial to how people interact with AI, according to OpenAI. However, designing this behavior is still a young science, as models are not explicitly programmed but learn from a variety of data.

The Model Spec reflects OpenAI's documentation, research, and experience in shaping model behavior, as well as ongoing work that will influence the development of future models, the company said.

Objectives, rules, standard behaviors - OpenAI pursues a multi-level approach

The Model Spec serves as a guideline for researchers and "AI trainers" to generate data for Reinforcement Learning from Human Feedback (RLHF). In the long term, OpenAI wants to investigate whether AI models can also learn directly from the Model Spec.

Ad
Ad

The model specification distinguishes between objectives, rules, and standard behaviors:

  • Objectives provide a general direction for desirable behavior, but are often too broad to give specific instructions.
  • Rules resolve conflicts between objectives and ensure safety and legality. They cannot be overridden by developers or users.
  • Standard behaviors outline behaviors that align with the principles but ultimately leave control to developers and users. They also show how to prioritize conflicting goals.

Objectives include supporting developers and end-users, benefiting humanity, and representing OpenAI well. Rules include following instructions by priority, obeying laws, avoiding illegal or harmful content, and protecting copyrights and personality rights.

Standard behaviors include assuming good intentions, asking clarifying questions, being objective, avoiding influencing opinions, expressing uncertainty, and being efficient while respecting length limits.

Image: OpenAI

OpenAI: Model specs will continue to evolve

The company sees the release as part of an ongoing public discussion about how models should behave, how desired model behavior is defined, and how the public can best be involved in these discussions. OpenAI now aims to involve representative stakeholders from around the world, such as policymakers, trusted institutions, and experts.

Over the next two weeks, OpenAI invites the general public to provide feedback on the goals, rules, and standards in the Model Spec. Like the models themselves, the Model Spec will be continuously developed based on the feedback received.

Recommendation

The complete model spec is available in the documentation.

Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • OpenAI has released the first version of the Model Spec, a document that specifies the desired behavior for AI models in the OpenAI API and ChatGPT.
  • The Model Spec distinguishes between general objectives, rules, and recommended standard behaviors. Objectives include supporting users and benefiting humanity. Rules include following instructions and laws. Standard behaviors include objectivity and expressing uncertainty.
  • OpenAI invites the public to provide feedback. The model specification will be continuously developed according to the company.
Sources
Max is managing editor at THE DECODER. As a trained philosopher, he deals with consciousness, AI, and the question of whether machines can really think or just pretend to.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.