Mistral Unveils Codestral, a Revolutionary AI Model for Programming

Mistral announces Codestral, its first programming-focused AI model, which specializes in coding tasks and outperforms existing code-centric models.

Mistral Unveils Codestral, a Revolutionary AI Model for Programming

Robot coding on a computer Image Credit: Venturebeat, via Ideogram

Today, Paris-based Mistral, a rising star in the global AI domain, marked its entry into the programming and development space with the launch of Codestral, its first-ever code-centric large language model (LLM). Available today under a non-commercial license, Codestral is a 22B parameter, open-weight generative AI model that specializes in coding tasks, right from generation to completion.

A Performant Model for All Things Coding

At the core, Codestral 22B comes with a context length of 32K and provides developers with the ability to write and interact with code in various coding environments and projects. The model has been trained on a dataset of more than 80 programming languages, which makes it suitable for a diverse range of coding tasks, including generating code from scratch, completing coding functions, writing tests, and completing any partial code using a fill-in-the-middle mechanism.

HumanEval table HumanEval results

Mistral says Codestral can help developers ’level up their coding game’ to accelerate workflows and save a significant amount of time and effort when building applications. Not to mention, it can also help reduce the risk of errors and bugs.

Outperforming Existing Models

While the model has just been launched and is yet to be tested publicly, Mistral claims it already outperforms existing code-centric models, including CodeLlama 70B, Deepseek Coder 33B, and Llama 3 70B, on most programming languages. On RepoBench, designed for evaluating long-range repository-level Python code completion, Codestral outperformed all three models with an accuracy score of 34%. Similarly, on HumanEval to evaluate Python code generation and CruxEval to test Python output prediction, the model bested the competition with scores of 81.1% and 51.3%, respectively.

Industry Partnerships and Availability

Several popular tools for developer productivity and AI application development have already started testing Codestral. This includes big names such as LlamaIndex, LangChain, Continue.dev, Tabnine, and JetBrains. ‘From our initial testing, it’s a great option for code generation workflows because it’s fast, has a favorable context window, and the instruct version supports tool use,’ said Harrison Chase, CEO and co-founder of LangChain.

Mistral is offering Codestral 22B on Hugging Face under its own non-production license, which allows developers to use the technology for non-commercial purposes, testing, and to support research work. The company is also making the model available via two API endpoints: codestral.mistral.ai and api.mistral.ai.