Content
summary Summary

French AI company Mistral has launched Codestral, a new coding model that delivers high coding performance with less computational overhead than existing models.

According to Mistral, Codestral handles more than 80 programming languages, including common ones such as Python, Java, C, C++, JavaScript, and Bash, as well as more specialized ones such as Swift and Fortran. Features include code completion, test writing, and filling in incomplete code with a fill-in-the-middle mechanism.

As a 22-billion-parameter model, Codestral sets a new standard for the performance/latency ratio of code generation compared to existing models, Mistral claims. With its larger context window of 32,000 tokens, Codestral outperforms all other models in RepoBench, a benchmark for longer code generation.

Codestral is supposed to be especially good at tasks with long code thanks to its 32K context window. | Image: Mistral

Mistral compared the performance of Codestral in various benchmarks for Python, SQL and other languages with competing models that have higher hardware needs. Codestral consistently performed better, e.g. in completing code repositories over long distances or predicting Python output.

Ad
Ad
In the prestigious HumanEval code benchmark, Codestral slightly outperforms the much larger Llama 3 70B. | Image: Mistral

Codestral is licensed as an open-weight model under the new Mistral AI Non-Production License, which allows it to be used for research and testing purposes. It can be downloaded from HuggingFace.

Mistral also provides two API endpoints for Codestral: codestral.mistral.ai for integration into IDEs, where developers bring their own API keys, and api.mistral.ai for research, batch queries or application development, where results are shown directly to users. The former is free for eight weeks, while the latter is charged per token.

Some early feedback from developers and researchers supports Codestral's good performance. Despite its relatively small size, it delivers results similar to larger models.

Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • French AI company Mistral has launched Codestral, a new coding model that delivers high performance with less computational overhead than existing models and can handle over 80 programming languages.
  • With just 22B parameters, Codestral sets a new standard for the performance/latency ratio of code generation compared to existing models, and outperforms them especially on long code benchmarks such as RepoBench thanks to its larger context window of 32,000 tokens.
  • Codestral is licensed as an open-weight model under the Mistral AI Non-Production License for research and testing purposes and is available via two API endpoints: one for IDE integration and another for research, batch queries, or application development.
Sources
Online journalist Matthias is the co-founder and publisher of THE DECODER. He believes that artificial intelligence will fundamentally change the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.