Introduction

Mistral AI has officially unveiled Leanstral on March 16, 2026, marking a significant milestone in the evolution of AI-assisted software engineering. This release addresses the persistent bottleneck of human review in formal verification processes, offering a scalable solution for developers who require mathematical certainty in their code. Unlike previous models that generate syntax without validation, Leanstral is purpose-built to generate machine-checkable proofs alongside executable code.

The industry has long struggled with the reliability of AI-generated software, particularly in critical systems where bugs are unacceptable. Leanstral changes this paradigm by integrating Lean 4, a powerful proof assistant, directly into the AI workflow. This ensures that the output is not just syntactically correct but logically sound, reducing the risk of runtime errors and security vulnerabilities in production environments.

First open-source code agent for Lean 4 formal proof engineering
Generates code AND machine-checkable mathematical proofs
Apache 2.0 license ensures full commercial freedom

Key Features & Architecture

At its core, Leanstral is architected as a massive Mixture of Experts (MoE) model designed for hardware efficiency without sacrificing reasoning capabilities. The model utilizes a total parameter count of 119 billion, with only 6.5 billion active parameters per token. This sparse activation strategy allows for faster inference times and lower memory footprint compared to dense models of similar size, making it viable for deployment on a wider range of hardware infrastructures.

The architecture is specifically tuned for the logical complexity required in formal verification. It understands the syntax of Lean 4 deeply, enabling it to construct proofs that satisfy the theorem prover. This dual capability of coding and proving is unique in the open-source landscape, bridging the gap between rapid prototyping and rigorous validation.

119B Total Parameters (MoE)
6.5B Active Parameters per token
Context Window: 128k tokens
Apache 2.0 License

Performance & Benchmarks

Performance testing reveals significant leaps in capability compared to general-purpose coding models. On the FLTEval benchmark, Leanstral outperforms Claude Sonnet 4.6, demonstrating superior ability in formal verification tasks. This is a critical metric for developers working on safety-critical software where standard code generation is insufficient.

Leanstral by Mistral AI: The Open-Source Proof Agent Revolutionizing Code Verification

Introduction

Key Features & Architecture

Performance & Benchmarks

API Pricing

Comparison Table

Use Cases

Getting Started

Comparison

Sources