Timeline de Lanzamientos de Modelos IA

Una timeline cronológica de los principales lanzamientos de modelos IA

2026

Moonshot AImultimodal2.8T (MoE)Código AbiertoHito

Kimi K3: El Nuevo Gigante Multimodal de Moonshot AI que Redefine el Open Weights

Lanzado el 16 de julio de 2026

2.8 trillion parameter mixture-of-experts model with 1 million token context window and native multimodal capabilities

Introduces Kimi Delta Attention, enabling up to 6.3x faster decoding in million-token contexts

Attention Residuals technique delivers ~25% higher training efficiency at less than 2% additional cost

Built for long-horizon agentic coding and self-evolving workflows

Demonstrates frontier-level performance across evaluations, outperforming other tested open models while trailing Claude Fable 5 and GPT 5.6 Sol

Live on Kimi.com, Kimi Work, Kimi Code, and the Kimi API

Open weights release planned by July 27, 2026

Official Kimi API achieves above 90% cache hit rate in coding workloads via Mooncake disaggregated inference architecture

Leer el artículo completo

SpaceXAIcoding modelReasoning model (low/medium/high)Cerrado

Grok 4.5: La Nueva Frontera de la Ingeniería de Software y Modelos de Razonamiento

Lanzado el 8 de julio de 2026

Grok 4.5 is SpaceXAI frontier model built for coding, agentic tasks, and knowledge work.

Trained in SpaceXAI Memphis data centers on new datasets spanning science, engineering, and math.

Served at 80 TPS (tokens per second), faster than flash-class models.

Delivers 4.2x fewer output tokens than Claude Opus 4.8 (max) on SWE Bench Pro (15,954 vs 67,020 avg tokens).

Scores 83.3% on Terminal-Bench 2.1 vs 78.9% for Claude Opus 4.8.

Reaches 62% on DeepSWE 1.0 vs 55.8% for Claude Opus 4.8.

Claude Opus 4.8 still leads on SWE-Bench Multilingual (84.4% vs Grok 4.5 78%).

Claude Opus 4.8 also leads SWE-Bench Pro (69.2% vs Grok 4.5 64.7%).

Supports reasoning modes: low, medium, or high (default high).

Available via Responses API and Chat Completions, with function calling, web search, X search, and code execution tools.

Not yet available in the EU; EU availability expected mid-July.

Cursor participated in training and published benchmark results.

Leer el artículo completo

Tencentopen source295B (MoE)Hito

Hy3 de Tencent: El nuevo gigante Open Source que desafía a los modelos de billones de parámetros

Lanzado el 7 de julio de 2026

295B parameter Mixture-of-Experts (MoE) architecture

Best in its size class, rivaling trillion-scale flagship models

Apache 2.0 license, friendly for commercial use

Released as free tier on OpenRouter as tencent/hy3:free

Following the late-April preview, Tencent gathered feedback from 50+ products and scaled up post-training with higher-quality data

Blind evaluation with 270 domain experts scored Hy3 at 2.67/4, outperforming GLM-5.1 at 2.51/4

Largest advantages seen in frontend development, data & storage, and CI/CD tasks

Hallucination rate dropped from 12.5% to 5.4%

Commonsense error rates fell from 25.4% to 12.7%

Used 47.4% fewer tokens than GLM-5.2 for document processing

Used 49% fewer tokens than GLM-5.2 for presentation creation

Reliable and affordable for most agentic use cases

Leer el artículo completo

Poolsidecoding model33B (MoE, 3B activated per token)Código Abierto

Laguna XS 2.1: El Nuevo Estándar de Modelos MoE para Coding Agéntico

Lanzado el 2 de julio de 2026

33B total / 3B activated parameters Mixture-of-Experts model for agentic coding

SWE-bench Multilingual up 5.4 points to 63.1% versus XS.2

Same architecture as XS.2; gains come from a training and data refresh

OpenMDW-1.1 license — fully permissive, aligned with NVIDIA and Linux Foundation direction

DFlash open-weighted speculative decoders roughly double achieved tokens per second

256K context length served on Poolside API and OpenRouter

Quantized checkpoints available: FP8, INT4, NVFP4 (GGUF coming with llama.cpp)

Supported in vLLM, SGLang, NVIDIA TensorRT-LLM, HuggingFace Transformers, and Ollama

XS.2 will be sunset on Poolside API after 1 week; remains available via Baseten Model Library

Leer el artículo completo

Meituancoding model1.6T (MoE, ~48B active)Código AbiertoHito

LongCat-2.0: El Nuevo Rey del Código y la Ventana de Contexto Masiva

Lanzado el 30 de junio de 2026

1.6T parameter MoE model with ~48B active parameters and 1M token context window

The full model behind Owl Alpha available on OpenRouter, now released openly

LongCat Sparse Attention (LSA) scales efficiently for 1M-context tokens

Zero-Compute Experts dynamically activate 33B-56B parameters per token with zero wasted compute

MOPD architecture uses three specialized expert groups (Agent / Reasoning / Interaction), gate-routed per task

Scores 70.8 on Terminal-Bench 2.1

Scores 59.5 on SWE-bench Pro, beating GPT-5.5 at 58.6

Scores 77.3 on SWE-bench Multilingual

Scores 73.2 on FORTE, 78.8 on RWSearch, and 79.9 on BrowseComp

Leer el artículo completo

Liquid AIlanguage model230MCódigo Abierto

LFM2.5-230M: La Revolución de la IA On-Device de Liquid AI

Lanzado el 25 de junio de 2026

LFM2.5-230M is Liquid AI's smallest model yet, built to run fast anywhere (CPUs, NPUs, and GPUs) to enable agentic tasks on phones, robots, home and network automation devices.

It has 230M parameters and is built on the LFM2 architecture.

Pre-trained on 19T tokens, with a 32K context extension.

Post-trained with distillation from LFM2.5-350M.

Achieves 213 tok/s decode speed on a Samsung Galaxy S25 Ultra (CPU) and 42 tok/s on a Raspberry Pi 5 (CPU).

Competes with and often beats models more than twice its size on instruction following, data extraction, and tool use.

On Raspberry Pi 5 and Qualcomm Snapdragon Gen4 (Galaxy S25 Ultra), it delivers the highest prefill and decode throughput in its class while keeping the smallest memory footprint.

Available on all platforms: llama.cpp (GGUF) for edge, MLX for Apple Silicon, vLLM and SGLang for GPU serving, and ONNX for cross-platform deployments.

In an early demo, LFM2.5-230M was deployed on a Unitree G1 robot running entirely on-device on its onboard Jetson Orin, acting as a skill-selection layer that decomposes natural-language instructions into structured multi-step tool-call plans.

For production-grade enterprise deployments, Liquid AI developed an internal GPU inference stack that delivers extremely low-latency serving, with LFM2.5-230M achieving considerably lower end-to-end latency than other small models on SGLang across all concurrency levels.

Well-suited for large-scale data extraction pipelines and lightweight on-device agentic workloads.

LFM2.5-230M and LFM2.5-230M-Base are available now.

Leer el artículo completo

Deep Reinforce AIcoding model9B Dense, 31B Dense, 35B MoE, 397B MoECódigo AbiertoHito

Ornith-1.0: La Nueva Familia de Modelos de Codificación Agentic de Deep Reinforce AI Rompe Récords

Lanzado el 25 de junio de 2026

Ornith-1.0 is a family of agentic coding models spanning four parameter sizes: 9B Dense, 31B Dense, 35B MoE, and 397B MoE.

It achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks: Terminal-Bench 2.1 (77.5), SWE-Bench Verified (82.4), SWE-Bench Pro (62.2), SWE-Bench Multilingual (78.9), NL2Repo (48.2), SWE Atlas (41.2 QnA / 42.6 RF / 39.1 TW), and ClawEval (77.1).

The models are post-trained on top of Gemma 4 and Qwen 3.5 base models.

Ornith-1.0 employs a novel self-improving training strategy in which reinforcement learning is used to generate not only solution rollouts, but also the task-specific scaffolds that drive those rollouts, jointly optimizing scaffold and solution for higher-quality agentic coding.

Ornith-1.0-397B (77.5 on Terminal-Bench 2.1, 82.4 on SWE-Bench Verified) matches the performance of Claude Opus 4.7 (70.3 on TB-2.1, 80.8 on SWE-Bench Verified).

Ornith-1.0-397B outperforms leading open-source models of similar size including MiniMax M3 (66.0 TB-2.1, 80.5 SWE-Bench Verified) and DeepSeek-V4-Pro (67.9 TB-2.1, 80.6 SWE-Bench Verified).

Ornith-1.0-9B, deployable on edge devices, matches or exceeds the performance of much larger models such as Gemma 4-31B and Qwen 3.6 35B.

All models are released under the MIT license, enabling full commercial and research use.

Leer el artículo completo

Sakana AIlanguage modelmulti-agent orchestration system, OpenAI-compatible APICerradoHito

Sakana Fugu: La Revolución de la Orquestación Multi-Agente Llega a la IA

Lanzado el 22 de junio de 2026

Sakana AI launched Sakana Fugu on 22 June 2026 as a multi-agent orchestration system that behaves like a single model behind one OpenAI-compatible API.

Fugu is itself a language model trained to call other LLMs in an agent pool, including instances of itself called recursively; it handles model selection, delegation, verification and synthesis internally.

Instead of hard-coded roles, Fugu learns how to coordinate — deciding when to delegate, how agents communicate, and how to combine their outputs into one answer.

Ships in two variants: Fugu (balanced performance/low latency, everyday coding, review, chatbots, Codex-compatible) and Fugu Ultra (max quality on hard multi-step problems).

Fugu allows opting specific agents out of its pool for data, privacy and compliance needs; Fugu Ultra has a fixed pool with no opt-out.

Current Fugu Ultra model ID is fugu-ultra-20260615.

Fugu Ultra leads most published coding and reasoning benchmarks, and the orchestrator beats the individual models it coordinates.

Sakana frames Fugu as a hedge against single-vendor dependency — if a provider restricts access (the team cites recent export controls on Anthropic Fable and Mythos models), Fugu routes around it.

Routing is proprietary: per-query model selection stays hidden from the caller.

Subscription tiers (monthly, both Fugu and Fugu Ultra included): Standard $20/month, Pro $100/month (10x Standard), Max $200/month (30x Standard); subscribe before end of July 2026 for a free second month.

Leer el artículo completo

Zhipu AI (Z.AI)language modelCódigo AbiertoHito

GLM-5.2 de Zhipu AI: el modelo open source de 1M tokens que cambia la ingeniería con IA

Lanzado el 16 de junio de 2026

Flagship foundation model with truly usable 1M-token context window for project-scale engineering

128K maximum output tokens for long-form generation

Open-source under MIT license with weights available on HuggingFace and ModelScope

Introduces IndexShare architecture reducing per-token FLOPs by 2.9x at 1M context length

Improved speculative decoding with 20% acceptance length increase via MTP with IndexShare and KVShare

Highest-ranked open-source model on FrontierSWE (74.4%), trailing Claude Opus 4.8 by only 1%

Strongest open-source coding model: 81.0 on Terminal-Bench 2.1, 62.1 on SWE-bench Pro

Supports multiple thinking effort levels (High and Max) to balance performance and latency

Achieves 99.2% on AIME 2026 and 91.2% on GPQA-Diamond reasoning benchmarks

Introduces anti-hack module for coding RL training to prevent reward hacking

Supports function calling, context caching, structured output, streaming, and MCP integration

Leer el artículo completo

Moonshot AIcoding modelopen-weightsCódigo Abierto

Kimi K2.7 Code de Moonshot AI: open-weights, 256k contexto y razonamiento profundo para código

Lanzado el 12 de junio de 2026

+21.8% improvement on Kimi Code Bench v2 vs K2.6

+11% improvement on Program Bench vs K2.6

+31.5% improvement on MLS Bench Lite vs K2.6

30% fewer tokens on internal reasoning vs K2.6

256k context window with long thinking and deep reasoning support

Native multimodal architecture supporting text, image, and video input

Supports ToolCalls, JSON Mode, Partial Mode, and automatic context caching

Open-source weights available on HuggingFace

Available via API and in Kimi Code IDE

Separate 6x High-Speed mode coming soon

Beta program open for early access to future updates

Leer el artículo completo

Coherecoding modelopen-weights, Apache 2.0Código Abierto

North Mini Code: El Nuevo Estándar Open-Source para Ingeniería de Software con Cohere

Lanzado el 9 de junio de 2026

North Mini Code is a code generation and reasoning model released by Cohere on 9 June 2026.

Released under the Apache 2.0 license as open source, allowing self-hosting and commercial use.

Supports a 256K token context window with up to 64K tokens of output, suited to large repositories and long technical documents.

Text-in, text-out only (no multimodal inputs or outputs).

Scores 75.7% on GPQA Diamond, placing it among advanced generalist reasoning models.

Additional benchmarks: SciCode 38.2%, TAU2-bench 37.4%, TerminalBench-Hard 31.1%, LiveCodeBench Reasoning 32.3%, IF-Bench 57.6%.

Originally listed at $0/M tokens; Cohere has since moved the model to paid pricing — check the official Cohere pricing page for current rates.

Well suited as a local code assistant via Ollama or llama.cpp for audits, test generation, refactors and automated PR review on large monorepos.

Leer el artículo completo

Anthropiclanguage modelCerradoHito

Claude Fable 5: La Era de los Modelos de Clase Mythos ha Llegado

Lanzado el 9 de junio de 2026

Claude Fable 5 is a Mythos-class model made safe for general use

State-of-the-art on nearly all tested AI capability benchmarks

Exceptional performance in software engineering, knowledge work, vision, and scientific research

The longer and more complex the task, the larger Fable 5 lead over other Anthropic models

Safeguards route some cybersecurity queries to Claude Opus 4.8 instead; triggers in less than 5% of sessions on average

Claude Mythos 5 is the same underlying model with safeguards lifted in some areas

Mythos 5 deployed through Project Glasswing with the US government as an upgrade to Claude Mythos Preview

Mythos 5 has the strongest cybersecurity capabilities of any model in the world

Access to Mythos 5 will expand through a broader trusted access program

Pricing is $10 per million input tokens and $50 per million output tokens — less than half the price of Claude Mythos Preview

Available via the Claude API as claude-fable-5

Leer el artículo completo

MindLab Researchopen source749B (MoL): 744B base + 5 × 1B LoRAHito

Macaron-V1-Preview-749B: La Revolución del Mixture-of-LoRA y el Futuro de los Agentes

Lanzado el 7 de junio de 2026

749B-class Mixture-of-LoRA (MoL) agent model: 744B frozen base (GLM-5.1) + 5 specialist LoRA adapters (~1B each)

Post-trained from GLM-5.1 using MinT post-training infrastructure

Five specialist LoRAs: L0 (default chat/general), L1 (personal-life tasks), L2 (coding), L3 (A2UI Generative UI), L4 (OpenClaw-style agent tasks)

Router Tool design exposes model selection as a standard tool call, routing between LoRAs via an explicit change_model tool — debuggable and compatible with vLLM OpenAI server mode

202,752 token context window, bfloat16 precision

Released under MIT open-source license on Hugging Face as a single repository (base at root, LoRAs under l0/ through l4/)

Co-designed with production Agent Harness via Harness Context Protocol (HCP) — same routing, memory, and tool-call tokenization during training and serving

R3 (Rollout Routing Replay) for provable expert-path alignment during MoE RL training, combined with IcePop-style rollout correction and DSA attention alignment fixes

Self-evolution capability through AutoResearch + Context Learning loop: model improves its own prompts and scaffolds, then distills improved trajectories back into parameters

Macaron LivingBench: in-house dynamic personal-life agent benchmark with coupled dynamic noise, dynamic environment, and dynamic user simulation

Trained on A2UI protocol for Generative UI with 3ms TPOT latency in interactive scenarios via TileRT collaboration

A2UI-Bench evaluates protocol correctness, task construction correctness, and real user-experience lift with visual-side evaluation

MindForge agentic RL training framework brings the production harness directly into the RL loop

Live preview available at macaron.im; 30B and 200B open-source variants planned for V1 non-preview release

Managed inference and post-training coming soon on MinT platform

Leer el artículo completo

NVIDIAopen source550B (MoE, 55B active)Hito

Nemotron 3 Ultra: El Nuevo Hito de NVIDIA en la Era de los Modelos Open-Source

Lanzado el 4 de junio de 2026

550B total parameters with 55B active parameters using Mixture-of-Experts architecture

Hybrid Mamba-Attention architecture with LatentMoE for improved expert routing

Multi-Token Prediction (MTP) layers for faster inference through native speculative decoding

Pretrained in NVFP4 precision, running on Hopper, Blackwell, and Ampere GPUs with a single checkpoint

Post-trained with SFT, Reinforcement Learning, and Multi-Teacher On-Policy Distillation (MOPD) using 10+ specialized teacher models

Achieves 5.9x, 4.8x, and 1.6x higher inference throughput vs GLM-5.1, Kimi-K2.6, and Qwen-3.5 respectively on 8k/64k token setting

Supports context length up to 1M tokens, outperforming state-of-the-art open LLMs on RULER at 1M context

Lowers cost of complex agentic tasks by up to 30% while delivering frontier accuracy

Fully open: weights, training data (173B code tokens, legal data, specialized data), and training recipes all released

Licensed under OpenMDW 1.1, the Linux Foundation permissive license for open AI model distributions

Available in multiple checkpoints: NVFP4, BF16, Base BF16, and GenRM

Deployable on-premise, on-cloud, or at-the-edge via NVIDIA NIM and major providers

Leer el artículo completo

Googlemultimodal12BCódigo Abierto

Gemma 4 12B: La Revolución Multimodal de Google que Corre en tu Laptop

Lanzado el 3 de junio de 2026

A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, released under an Apache 2.0 license.

Laptop Ready: Small enough to run locally with just 16GB of VRAM or unified memory.

Unified Architecture: Multimodal tokens flow directly into the LLM backbone. No additional encoders are needed.

Advanced Reasoning: Gemma 4 12B delivers benchmark performance nearing the larger 26B model, but at less than half the memory footprint. Unlocks powerful multi-step reasoning and agentic workflows.

Vision Embedding: The vision encoder is replaced with a lightweight 35M-parameter module. By injecting spatial information directly into the token embeddings, the unified model takes over visual understanding.

Broad Ecosystem Support: Weights available on Hugging Face and Kaggle, compatible with llama.cpp, MLX, LM Studio, vLLM, and SGLang.

Bridges the gap between edge efficiency and advanced reasoning, making it the best model for self/local hardware on a low budget.

Leer el artículo completo

Nex AGIopen source397B total, 17B active (MoE)Hito

Nex-N2-Pro: El Nuevo Rey del Open-Source y la Era de la Inteligencia Agéntica

Lanzado el 2 de junio de 2026

MoE (Mixture of Experts) architecture with 397B total parameters and 17B active parameters

Post-trained on Qwen3.5-397B-A17B base model

262K token context window with up to 256K output tokens

Accepts text and image input, produces text output

Released under Apache-2.0 open-source license

Scores 75.3 on Terminal-Bench 2.1 for coding tasks

Scores 1585 on GDPval for long-running workflows

Achieves SOTA among open-source models on SWE-Verified, SWE-Pro, and DeepSWE benchmarks

Agentic Thinking capability: unifies reasoning, tool use, and environmental execution in a closed loop (comprehension → planning → implementation → feedback → debug → iteration)

Adaptive reasoning depth reduces thinking tokens by 30-50% compared to always-on reasoning

Available on Hugging Face, ModelScope, SiliconFlow (early access serverless), and OpenRouter

Can run locally via llama.cpp, Ollama, and similar tools

Native integration with Claude Code, Cursor, OpenClaw, and other agentic harnesses

Rivals GPT-5.5 and Opus 4.7 performance, reaching top-tier level

Also available as Nex-N2-mini variant

Leer el artículo completo

Alibaba CloudmultimodalCerrado

Qwen3.7-Plus: La Nueva Era de los Agentes Multimodales Híbridos

Lanzado el 1 de junio de 2026

Multimodal interactive hybrid agent: unified GUI & CLI operation across visual and text tasks

Versatile coding agent & productivity assistant with full-modality input

Visual Agent: perception, reasoning, grounding, and search-augmented QA

Cross-harness generalization across diverse agent frameworks

Multimodal improvements extend beyond isolated visual understanding gains

Leer el artículo completo

MiniMaxopen sourceopen-weightsHito

MiniMax-M3: El Hito del Open-Source que Desafía a los Modelos Propietarios

Lanzado el 1 de junio de 2026

Achieves top-tier performance on coding and agentic benchmarks with autonomous task decomposition, tool use, and multi-step reasoning capabilities

Powered by proprietary MiniMax Sparse Attention (MSA) architecture supporting 1M token context window with guaranteed minimum of 512K tokens

Natively multimodal model

On BrowseComp, scores 83.5 surpassing Opus 4.7 (79.3)

First open model to simultaneously achieve frontier coding capabilities, million-token context, and multimodal support

API pricing: input $0.60/M tokens (≤ 512k), $1.20/M tokens (> 512k); output $2.40/M tokens (≤ 512k), $4.80/M tokens (> 512k); prompt caching $0.12/M tokens (≤ 512k), $0.24/M tokens (> 512k)

Leer el artículo completo

StepFunmultimodal198B (sparse MoE, ~11B active)Código Abierto

Step-3.7-Flash: La Nueva Frontera de la IA Multimodal de Alta Eficiencia

Lanzado el 29 de mayo de 2026

#1 on ClawEval-1.1 with a score of 67.1

#1 on SimpleVQA Search with a score of 79.2

#2 on SWE-PRO with a score of 56.3

95.3 on V* Python benchmark

400 tokens per second throughput

198B total parameters, ~11B active (sparse MoE architecture)

256K context window with 3 reasoning levels

Native multimodal — understands UIs, charts, documents, and images, then writes code or calls tools

Web + visual search with more sources and deeper follow-up

98%+ on τ²-bench across all difficulty levels for reliable tool use

Open weights released under Apache 2.0 license

Leer el artículo completo

Anthropiclanguage modelclosed-weightsCerradoHito

Claude Opus 4.8: El Nuevo Estándar de Oro en Agentes de IA y Codificación Profesional

Lanzado el 28 de mayo de 2026

Builds on Opus 4.7 with stronger performance across coding, agentic tasks, and professional work

Around 4x less likely than its predecessor to allow flaws in code it has written to pass unremarked

Higher honesty — more likely to flag uncertainties and less likely to make unsupported claims

Only model to complete every case end-to-end on the Super-Agent benchmark, beating prior Opus models and GPT-5.5 at parity on cost

Scored 84% on Online-Mind2Web, making it the strongest computer-use and browser-agent model tested

Highest score recorded on the Legal Agent Benchmark and first model to break 10% on the all-pass standard

Rates of misaligned behavior substantially lower than Opus 4.7, similar to Claude Mythos Preview

Launches alongside dynamic workflows in Claude Code for running hundreds of parallel subagents

New effort control feature in claude.ai lets users choose how much effort Claude puts into a response

Fast mode runs at 2.5x speed and is now 3x cheaper than for previous Opus models

API model ID is claude-opus-4-8; Messages API now accepts system entries inside the messages array

Leer el artículo completo

Alibaba Qwenlanguage modelCerradoHito

Qwen3.7-Max: La Revolución de los Agentes Autónomos de Alibaba

Lanzado el 20 de mayo de 2026

Agentic flagship model designed for autonomous agents capable of coding, orchestrating workflows via MCP, and sustaining long action chains on multi-step tasks

35-hour autonomous kernel optimization test with over 1,000 tool calls without breaking the reasoning chain

Good cross-framework generalization including Claude Code, OpenClaw, and Qwen Code

Currently offers a pure text-only interface for public experimentation

Deep thinking capabilities for complex reasoning tasks

Deployed via Aliyun Bailian API at $2.5 input / $7.5 output per million tokens

Leer el artículo completo

GooglemultimodalCerradoHito

Gemini 3.5 Flash: El Nuevo Estándar de Oro para Agentes de IA y Codificación de Alta Velocidad

Lanzado el 19 de mayo de 2026

Google high-efficiency multimodal model delivering near-Pro level coding and reasoning at Flash-tier cost and speed

Supports text, image, video, audio, and PDF inputs natively

Defaults to medium thinking effort with full support for thinking levels: minimal, low, medium, high for fine-grained cost/performance tuning

Surpasses Gemini 3.1 Pro on coding and agentic benchmarks: Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo), MCP Atlas (83.6%)

Leads multimodal understanding with 84.2% on CharXiv Reasoning benchmark

4x faster output tokens per second compared to other frontier models

Ranked in the upper-right quadrant of the Artificial Analysis Intelligence Index for top intelligence at exceptional speed

Ideal for long-horizon agentic tasks at less than half the cost of competing frontier models

Integrates with Antigravity for collaborative sub-agent deployment at enterprise scale

1M token context window

Leer el artículo completo

Baidulanguage modelMoE (compressed to ~1/3 total params, ~1/2 active params of ERNIE-5.0)Cerrado

ERNIE-5.1-Preview: El Nuevo Líder en Eficiencia de Baidu

Lanzado el 9 de mayo de 2026

Tops LMArena Search leaderboard as #4 globally and #1 among Chinese models with a score of 1,223

Math: #9 globally on LMArena category leaderboards

Legal & Government: #1 globally on LMArena category leaderboards

Business, Management & Financial Ops: #4 globally on LMArena category leaderboards

Software & IT Services: #7 globally on LMArena category leaderboards

Compresses total parameters to approximately 1/3 and active parameters to approximately 1/2 of ERNIE-5.0

Achieves leading performance using only about 6% of the pre-training cost of comparable models

Uses scaled agentic post-training with Multi-Teacher On-Policy Distillation (MOPD)

Scores 99.6 on AIME26 with tool use, second only to Gemini 3.1 Pro

Surpasses DeepSeek-V4-Pro on tau-cubed-bench and SpreadsheetBench-Verified agent evaluation tasks

Four-stage post-training pipeline: SFT, Domain Expert Model Training, On-Policy Distillation, and General Online RL

Based on Once-For-All elastic training framework with elastic depth, width, and sparsity dimensions

Rolling out on 10+ creative production agent platforms including ISEKAI ZERO and Mulan AI

Creative writing capabilities approach those of Gemini 3.1 Pro

Leer el artículo completo

SpaceXAIreasoningCerrado

xAI Grok 4.3: Nuevo Modelo de Razonamiento Agéntico

Lanzado el 30 de abril de 2026

Reasoning model from xAI with configurable effort levels (none/low/medium/high, default low)

Accepts text and image inputs with text output

Suited for agentic workflows, instruction-following tasks, and high factual accuracy applications

1 million token context window with no output token limit

Well-suited for long-document analysis, deep research, and multi-step agentic tasks

Tiered pricing: requests exceeding 200k total tokens are billed at a higher rate

Leer el artículo completo

Mistral AIopen source128B denseHito

Mistral Medium 3.5: La Revolución del Open Source en 2026

Lanzado el 29 de abril de 2026

New flagship model merging instruction-following, reasoning, and coding into a single 128B dense architecture

Released as open weights under a modified MIT license

Runs self-hosted on as few as four GPUs

API pricing at $1.50/mtok input and $7.50/mtok output

Powers the new Mistral Vibe remote agents for async cloud coding sessions

Drives Work mode in Le Chat for multi-step agentic task execution with parallel tool calling

Sessions can be spawned from CLI or Le Chat, and local CLI sessions can be teleported to the cloud

Leer el artículo completo

NVIDIAmultimodal30B-A3B (MoE)Código Abierto

NVIDIA Nemotron 3 Nano Omni: El Futuro de los Agentes Multimodales

Lanzado el 28 de abril de 2026

Multimodal model unifying video, audio, image, and text understanding in a single architecture

Hybrid Mixture-of-Experts (MoE) 30B-A3B architecture with 30B total and 3B active parameters

Up to 9x higher throughput compared to similar open omnimodal models

256K unified context window with single-pass perception

Hybrid architecture combining Mamba layers for memory efficiency and transformers for precise reasoning

Integrates vision encoders (C3D for video) and audio encoders (Paraquet), eliminating need for separate models

Supports FP8/NVFP4 quantization with optimized inference on NVIDIA Ampere, Hopper, and Blackwell GPUs

Designed for enterprise multimodal agents: document intelligence (OCR, tables), GUI navigation, audio-video reasoning

Runs locally with 25-36GB RAM in 4/8-bit quantization via Unsloth or vLLM

Available on Hugging Face, Ollama, OpenRouter, and NVIDIA NIM

Leer el artículo completo

poolsidecoding model225B total (MoE), 23B activated per tokenCerradoHito

Laguna-M.1: El Nuevo Estándar en Modelado de Código

Lanzado el 28 de abril de 2026

225B total parameter Mixture-of-Experts model with 23B activated parameters per token

Poolside most capable model to date, completed pre-training at end of 2025

Trained from scratch on 30T tokens using Muon optimizer

Trained on 6,144 interconnected NVIDIA Hopper GPUs entirely in-house

Achieves 72.5% on SWE-bench Verified, 67.3% on SWE-bench Multilingual, 46.9% on SWE-bench Pro, 40.7% on Terminal-Bench 2.0

128K context window with up to 8K output tokens

Agentic coding model built for long-horizon software engineering tasks

Foundation for the entire Laguna model family

Uses custom async on-policy RL system with Agent Client Protocol (ACP) server

Free to use for a limited time via poolside API and OpenRouter

Weights available on request for startups, institutions, and universities

Leer el artículo completo

poolsidecoding model33B total (MoE), 3B activated per tokenCódigo AbiertoHito

Laguna-XS.2: El Nuevo Estándar en Modelos de Código Abierto

Lanzado el 28 de abril de 2026

33B total parameter Mixture-of-Experts model with 3B activated parameters per token

First open-weight release from poolside, licensed under Apache 2.0

Trained on 30T tokens using Muon optimizer

Supports native reasoning with interleaved thinking between tool calls

Uses Sliding Window Attention with per-head gating in 30 of 40 layers

KV cache quantized to FP8 for reduced memory per token

Compact enough to run locally on a Mac with 36 GB RAM

128K context window with up to 8K output tokens

Achieves 68.2% on SWE-bench Verified, 62.4% on SWE-bench Multilingual, 44.5% on SWE-bench Pro, 30.1% on Terminal-Bench 2.0

Supports vLLM, Transformers, TRT-LLM, and Ollama

Agentic coding model built for long-horizon software engineering tasks

Free to use for a limited time via poolside API and OpenRouter

Leer el artículo completo

DeepSeek AIopen sourceV4-Pro: 1.6T total / 49B active (MoE) | V4-Flash: 284B total / 13B active (MoE)Hito

DeepSeek-V4: El Nuevo Estándar en Modelos Abiertos de IA (2026)

Lanzado el 24 de abril de 2026

Deux modèles : DeepSeek-V4-Pro (1.6T total / 49B active params) et DeepSeek-V4-Flash (284B total / 13B active params)

Context length de 1M tokens, output max de 384K tokens

Support thinking mode (par défaut) et non-thinking mode

Pricing ultra-agressif : Flash à $0.14/M input tokens (cache miss), $0.028/M (cache hit), $0.28/M output — soit ~7x moins cher que Claude Opus 4.7

Pro à $1.74/M input tokens (cache miss), $0.145/M (cache hit), $3.48/M output

Modèles open-source, poids disponibles sur HuggingFace

Compatible format API OpenAI et Anthropic (https://api.deepseek.com et https://api.deepseek.com/anthropic)

Support JSON output, Tool Calls, Chat Prefix Completion (Beta), FIM Completion (Beta)

Performance rivalisant avec les meilleurs modèles closed-source mondiaux

Leer el artículo completo

OpenAIlanguage modelUndisclosed (frontier model)CerradoHito

GPT-5.5: La Nueva Era de la Inteligencia Artificial de OpenAI

Lanzado el 23 de abril de 2026

GPT-5.5 is OpenAI smartest and most intuitive to use model yet, described as the next step toward a new way of getting work done on a computer

Achieves 82.7% on Terminal-Bench 2.0, 73.1% on Expert-SWE (Internal), and 84.9% on GDPval — all state-of-the-art scores

Matches GPT-5.4 per-token latency while performing at a much higher level of intelligence

Significantly more token efficient — uses fewer tokens to complete the same tasks compared to GPT-5.4

Scores 78.7% on OSWorld-Verified for real computer environment operation and 81.8% on CyberGym

GPT-5.5 Pro achieves 90.1% on BrowseComp and 52.4% on FrontierMath Tier 1-3

On SWE-Bench Pro, reaches 58.6% solving more tasks end-to-end in a single pass than previous models

Proactively deployed with industry-leading cybersecurity safeguards, classified as High under OpenAI Preparedness Framework

Helped discover a new proof about Ramsey numbers in combinatorics, later verified in Lean

Scores 25.0% on GeneBench for multi-stage scientific data analysis in genetics

API pricing: $5/1M input tokens and $30/1M output tokens with 1M context window

GPT-5.5 Pro API pricing: $30/1M input tokens and $180/1M output tokens

Co-designed, trained with, and served on NVIDIA GB200 and GB300 NVL72 systems

Rolling out to Plus, Pro, Business, and Enterprise users in ChatGPT and Codex

GPT-5.5 Thinking unlocks faster help for harder problems with smarter, more concise answers

Outperforms Claude Opus 4.7 and Gemini 3.1 Pro on most coding and professional benchmarks

More than 85% of OpenAI now uses Codex every week across all company functions

Leer el artículo completo

Xiaomilanguage model1T+ total (42B active, MoE)CerradoHito

Xiaomi MiMo-V2.5-Pro: El Nuevo Estándar de IA con 1 Trillón de Parámetros

Lanzado el 22 de abril de 2026

Multimodal Mixture-of-Experts (MoE) architecture with 1T+ total parameters (42B active)

Extended context window up to 1M tokens

Native multimodal perception supporting text, images, video, and audio

Advanced autonomous agent capabilities handling 1000+ tool calls

40-60% better token efficiency compared to Claude Opus and GPT-5.x

ClawEval benchmark: 64% Pass@3 score

SWE-bench Pro: 57.2% task resolution rate

Surpasses Claude 4.6 Sonnet in coding tasks, approaches Claude Opus in agentic performance

Part of the MiMo-V2.5 family alongside MiMo-V2.5 and MiMo-V2.5-TTS

Available via mimo.mi.com with affordable token plans (monthly/annual subscriptions)

Leer el artículo completo

Alibaba Cloudlanguage model27BCódigo AbiertoHito

Qwen3.6-27B: El Nuevo Estándar en IA Abierta y Código

Lanzado el 22 de abril de 2026

27B dense open-source model with Apache 2.0 license

Surpasses Qwen3.5-397B-A17B on all major agentic coding benchmarks

SWE-bench Verified: 77.2 vs 76.2, Terminal-Bench 2.0: 59.3 vs 52.5, SkillsBench: 48.2 vs 30.0

Supports both multimodal thinking and non-thinking modes natively

Native vision-language support for images and video understanding

GPQA Diamond: 87.8, competitive with models several times its size

Compatible with OpenClaw, Claude Code, and Qwen Code coding assistants

Available on Hugging Face, ModelScope, and Alibaba Cloud Model Studio API

Leer el artículo completo

Moonshot AIopen sourceopen-weightsHito

Kimi K2.6 de Moonshot AI: El Nuevo Líder Open-Source en Agentes y Código

Lanzado el 20 de abril de 2026

Open-source SOTA on HLE w/ tools (54.0), SWE-Bench Pro (58.6), SWE-bench Multilingual (76.7), BrowseComp (83.2), Toolathlon (50.0), Charxiv w/ python (86.7), Math Vision w/ python (93.2)

Long-horizon coding: 4,000+ tool calls, over 12 hours continuous execution

Generalization across languages (Rust, Go, Python) and tasks (frontend, devops, perf optimization)

300 parallel sub-agents x 4,000 steps per run (up from K2.5: 100 / 1,500)

Proactive Agents: powers OpenClaw, Hermes Agent for 24/7 autonomous ops

Claw Groups research preview: bring your own agents, command friends bots & humans in the loop

API PRICING (use these EXACT values): kimi-k2.6 — Input $0.16/M tokens (cache hit), Input $0.95/M tokens (cache miss), Output $4.00/M tokens, Context Window 262,144 tokens

Sources: https://platform.moonshot.ai, https://kimi.com/blog/kimi-k2-6, https://huggingface.co/moonshotai/Kimi-K2.6

Live on kimi.com in chat and agent mode, plus Kimi Code at https://kimi.com/code for production-grade coding

Leer el artículo completo

AnthropicreasoningCerradoHito

Claude Opus 4.7: El Nuevo Estándar en Razonamiento y Desarrollo de Software

Lanzado el 16 de abril de 2026

Most capable generally available Anthropic model for complex reasoning and agentic coding

High-resolution image support: 2576px / 3.75MP (up from 1568px / 1.15MP) with 1:1 pixel mapping

New "xhigh" effort level for coding and agentic use cases

Task budgets (beta) — advisory token budget across full agentic loops

128K max output tokens, 1M context window at standard pricing

+12 points on CursorBench coding benchmarks vs Opus 4.6

New tokenizer (up to ~35% more tokens per text, improved performance)

Adaptive thinking only — extended thinking budgets removed

Sampling parameters (temperature, top_p, top_k) removed

Pricing: $5/$25 per MTok input/output, batch $2.50/$12.50 per MTok

Leer el artículo completo

Zhipu AIreasoning744B MoE (40B active)Código AbiertoHito

GLM-5.1: El Nuevo Estándar de Reasoning Abierto Fuente

Lanzado el 7 de abril de 2026

#1 on SWE-Bench Pro (58.4%), beating GPT-5.4 and Claude Opus 4.6

Post-training upgrade to GLM-5 — same 744B MoE architecture (40B active)

Trained entirely on Huawei Ascend chips — no NVIDIA hardware

MIT license, compatible with Claude Code and OpenClaw

202K context window, strong on cybersecurity (CyberGym 68.7%)

Leer el artículo completo

Anthropiclanguage modelCerrado

Claude Opus 4.6 Fast: Análisis Técnico y Despliegue

Lanzado el 7 de abril de 2026

Faster variant of Claude Opus 4.6 with comparable intelligence

Leer el artículo completo

AnthropicreasoningCerradoHito

Claude Mythos Preview: El Nuevo Límite de la IA de Anthropic

Lanzado el 7 de abril de 2026

New Capybara tier above Opus — the most powerful Anthropic model

93.9% on SWE-bench Verified, 77.8% on SWE-bench Pro

97.6% on USAMO 2026, 94.5% on GPQA Diamond

1M context window, limited preview for ~50 partner organizations

Leer el artículo completo

Google DeepMindopen source31BHito

Google DeepMind lanza Gemma 4: El Futuro del Open Source en IA

Lanzado el 2 de abril de 2026

Google's most capable open models, built from Gemini 3 research

Four sizes: E2B, E4B, 26B MoE (3.8B active), 31B Dense

First Gemma release under Apache 2.0 license

Native multimodal, 140+ languages, up to 256K context

Agent-ready with function calling and structured JSON output

Leer el artículo completo

Zhipu AImultimodalCerrado

GLM-5V Turbo de Zhipu AI: El Nuevo Estándar en Modelos Multimodales para Agentes

Lanzado el 1 de abril de 2026

Vision + Code model from Z.ai

Multimodal coding capabilities

API only

Leer el artículo completo

Alibaba Cloudlanguage modelCerrado

Qwen 3.6 Plus: El Nuevo Estándar en Razonamiento Agente y Código

Lanzado el 31 de marzo de 2026

1M token context window with always-on chain-of-thought reasoning

78.8% on SWE-bench Verified — competitive with Claude Opus 4.6

2-3x faster output speed than Claude Opus 4.6

Free preview via OpenRouter, successor to Qwen 3.5

Leer el artículo completo

Mistral AImultimodalCódigo Abierto

Mistral AI presenta Voxtral TTS: El modelo de voz abierto que desafía a ElevenLabs

Lanzado el 23 de marzo de 2026

Mistral's first audio model — direct competitor to ElevenLabs

Zero-shot voice cloning with multilingual support

Real-time streaming capabilities

Open weights under CC BY-NC 4.0 (non-commercial)

Leer el artículo completo

Xiaomireasoning309B MoECódigo Abierto

Xiaomi MiMo-V2-Pro: El Nuevo Estándar en Razonamiento IA Open Source

Lanzado el 18 de marzo de 2026

Xiaomi reasoning model with strong math and code performance

309B MoE architecture

Leer el artículo completo

MiniMaxcoding model230B MoE (10B active)Código Abierto

MiniMax M2.7: El Modelo de Autoevolución que Rivaliza con GPT-5

Lanzado el 18 de marzo de 2026

Self-evolving agent model — first to participate in its own development

56.22% on SWE-Pro, matching GPT-5.3-Codex

57.0% on Terminal Bench 2, GDPval-AA ELO 1495 (highest open-source)

230B MoE (10B active), 200K context, open weights on HuggingFace

Agent Teams for native multi-agent collaboration

Leer el artículo completo

OpenAIlanguage modelCerrado

OpenAI GPT-5.4 Mini: Eficiencia y Uso Nativo de Computadora

Lanzado el 17 de marzo de 2026

Efficient variant of GPT-5.4 with native computer use

Lower cost while maintaining strong reasoning capabilities

Leer el artículo completo

Mistral AIcoding model119B MoE (6.5B active)Código Abierto

Leanstral Mistral: Primer Agente Open Source para Lean 4

Lanzado el 16 de marzo de 2026

First open-source code agent for Lean 4 formal proof engineering

Generates code AND machine-checkable mathematical proofs

119B MoE with 6.5B active, outperforms Claude Sonnet 4.6 on FLTEval

Apache 2.0 license, 15x cheaper than Claude Opus for formal verification

Leer el artículo completo

Mistral AIopen source119B MoE (6.5B active)

Mistral Small 4: El Nuevo Estándar Open Source de 2026

Lanzado el 16 de marzo de 2026

Unifies instruct, reasoning, coding, and multimodal in a single model

119B MoE with 6.5B active parameters, 256K context window

Replaces Magistral (reasoning), Pixtral (vision), and Devstral (coding)

Apache 2.0 license, configurable reasoning parameter

Leer el artículo completo

SpaceXAIlanguage modelCerrado

Grok 4.20: El Nuevo Líder en Agentes y Contexto Masivo

Lanzado el 12 de marzo de 2026

Beta release with parallel agents architecture

500K context window

Iterative improvement via user feedback

Leer el artículo completo

NVIDIAopen source120B MoE (12B active)

NVIDIA Nemotron 3 Super: El Nuevo Estándar para Agentes IA Abiertos

Lanzado el 11 de marzo de 2026

Open MoE model from NVIDIA

120B total parameters with 12B active

Strong enterprise performance

Leer el artículo completo

OpenAIlanguage modelCerrado

OpenAI GPT-5.4: Análisis Técnico y Lanzamiento 2026

Lanzado el 6 de marzo de 2026

Latest OpenAI flagship with 1M token context window

Available in Standard, Mini, and Nano variants

Supports reasoning effort with 4 effort levels

128K max output tokens

Prompt caching with $0.02-$0.25/M cached read

Leer el artículo completo

Google DeepMindlanguage modelCerrado

Gemini 3.1 Flash Lite Preview: El Nuevo Estándar de Eficiencia en 2026

Lanzado el 3 de marzo de 2026

Google's high-efficiency model optimized for high-volume use cases

1M token context window, 65.5K max output

Supports prompt caching, reasoning effort, and reasoning budget

Native tool calling and vision capabilities

Leer el artículo completo

Google DeepMindmultimodalCerrado

Gemini 3.1 Pro: La Revolución del Razonamiento Multimodal

Lanzado el 19 de febrero de 2026

Google's latest flagship model

More than doubles reasoning performance over Gemini 3 Pro

Released in preview via Gemini API, AI Studio, and Vertex AI

Leer el artículo completo

SpaceXAIlanguage modelCerrado

Grok 4.2 de xAI: Análisis Técnico y Despliegue

Lanzado el 17 de febrero de 2026

Beta release with rapid learning architecture — improves weekly via user feedback

256K context window

4-agent parallel reasoning

Medical document analysis added

Leer el artículo completo

Anthropiclanguage modelCerrado

Claude Sonnet 4.6: La Nueva Era del Razonamiento y Código en 2026

Lanzado el 17 de febrero de 2026

Most capable Sonnet yet with full upgrade across coding, computer use, long-context reasoning

1M token context window in beta

200K token context window, 64K max output

Supports prompt caching, reasoning effort, and reasoning budget

Native tool calling and vision capabilities

Leer el artículo completo

Alibaba Cloudlanguage model397B MoE (17B active)Cerrado

Qwen 3.5: El Nuevo Estándar en IA Agente y MoE Eficiente

Lanzado el 14 de febrero de 2026

Agentic AI model with built-in tools for web search and code execution

1M token context window

Qwen3.5-Plus hosted; open weights planned

Leer el artículo completo

MiniMaxcoding model230B MoE (10B active)Código Abierto

MiniMax M2.5: El Nuevo Estándar en Eficiencia y Código Abierto

Lanzado el 12 de febrero de 2026

Frontier MoE model with 80.2% on SWE-Bench Verified

Strong coding and agentic capabilities

230B total parameters, 10B activated per token

Leer el artículo completo

DeepSeek AIopen source671B MoE

DeepSeek V3.2: El Nuevo Gigante Open Source que Rivaliza con GPT-5

Lanzado el 12 de febrero de 2026

Major update to the V3 series with 1M token context

671B MoE focused on code generation and reasoning improvements

Open weights on HuggingFace, MIT license

Leer el artículo completo

Zhipu AIreasoningCódigo Abierto

GLM-5 de Zhipu AI: El Nuevo Líder en Razonamiento y Agentes Abiertos

Lanzado el 11 de febrero de 2026

China's first public AI company frontier model

Targets complex systems engineering and long-horizon agentic tasks

Leer el artículo completo

OpenBMBmultimodal9BCódigo Abierto

MiniCPM-o 4.5: El Nuevo Estándar en IA Multimodal de 9B

Lanzado el 8 de febrero de 2026

On-device multimodal LLM with full-duplex real-time audio, image, video

Built on Qwen3-8B architecture

Gemini 2.5 Flash level performance at only 9B parameters

Leer el artículo completo

OpenAIcoding modelCerrado

GPT-5.3-Codex: El Nuevo Estándar en Ingeniería de Software con IA

Lanzado el 5 de febrero de 2026

Most capable agentic coding model from OpenAI

Available via Codex app, CLI, IDE extensions

Optimized for software engineering workflows

Leer el artículo completo

AnthropicreasoningCerradoHito

Claude Opus 4.6: El Nuevo Estándar en IA de Razonamiento

Lanzado el 5 de febrero de 2026

Huge leap for agentic planning with parallel subtask execution

Tool and subagent orchestration capabilities

Terminal-Bench record holder

1M token context window, 32K max output

State-of-the-art agentic AI behaviors

Leer el artículo completo

StepFunreasoning196B MoE (11B active)Código Abierto

Step-3.5-Flash: Razonamiento de Frontera a Bajo Costo

Lanzado el 1 de febrero de 2026

Open-source sparse MoE with 3-way Multi-Token Prediction

100-350 tok/s generation speed

Frontier reasoning at low cost

Leer el artículo completo

Arcee AIopen source400B MoE (13B active)

Arcee AI lanza Trinity Large: El Gigante Abierto de 400B Parámetros

Lanzado el 27 de enero de 2026

400B sparse MoE with only 13B active parameters

Built in the US with open weights

One of the largest open-source foundation models

Apache 2.0 license

Leer el artículo completo

Alibaba CloudreasoningCerrado

Qwen3-Max-Thinking: El Nuevo Estándar en Razonamiento Lógico para 2026

Lanzado el 27 de enero de 2026

Top-tier reasoning model with adaptive tool use

Retrieves information and runs code during inference

Rivals leading frontier models

Leer el artículo completo

Moonshot AIopen source1T MoE (32B active)

Kimi K2: El Gigante Open Source de 1T Parámetros de Moonshot AI

Lanzado el 20 de enero de 2026

Massive 1T MoE with 32B active parameters

First open-weight model to rank #1 on LMSYS Chatbot Arena

2M token context window, 200+ language support

$0.15/$2.50 per 1M tokens, Modified MIT license

Leer el artículo completo

Sarvam AIlanguage model2BCódigo Abierto

Sarvam-2B: El Modelo Soberano de IA India para Desarrolladores

Lanzado el 15 de enero de 2026

India's multilingual LLM — part of sovereign AI initiative

Supports 10+ Indian languages natively

Leer el artículo completo

2025

Upstageopen source102B MoE (12B active)

SOLAR 102B: La Revolución Coreana del Open Source

Lanzado el 31 de diciembre de 2025

Korea's answer to open frontier models

102B MoE model with 12B active parameters

Leer el artículo completo

Google DeepMindlanguage modelCerrado

Gemini 3 Flash: El Nuevo Estándar de Velocidad y Eficiencia de Google

Lanzado el 17 de diciembre de 2025

Fast frontier-class model rivaling larger models at a fraction of the cost

Default model in the Gemini app

Leer el artículo completo

Allen AImultimodal8BCódigo Abierto

Molmo 2: El Nuevo Estándar Multimodal Abierto de Allen AI

Lanzado el 16 de diciembre de 2025

Multimodal model from AI2

Fully open weights, data, and code

Leer el artículo completo

Xiaomireasoning309B MoECódigo Abierto

Xiaomi MiMo V2 Flash: El Nuevo Estándar de Razonamiento Abierto

Lanzado el 16 de diciembre de 2025

Xiaomi large reasoning model

309B MoE architecture

Strong on math and code

Leer el artículo completo

OpenAIlanguage modelCerradoHito

OpenAI GPT-5.2: El Nuevo Estándar para Ingeniería de IA y Desarrollo

Lanzado el 11 de diciembre de 2025

Improved reasoning and multimodal capabilities over GPT-5.1

Enhanced mental health protections

128K max output tokens

Available on Plus ($20/month), Pro ($200/month), and API

Expert-level performance on 44 knowledge work tasks

Leer el artículo completo

Mistral AIcoding model24BCódigo Abierto

Devstral Small 2: El Nuevo Estándar en Modelos de Código Open Source

Lanzado el 9 de diciembre de 2025

Successor to Devstral Small 1, derived from Mistral Small 3.1

Portable coding agent

Apache 2.0 license

Leer el artículo completo

Mistral AIcoding model123BCódigo Abierto

Mistral AI lanza Devstral 2: El Nuevo Estándar en Código Open Source

Lanzado el 9 de diciembre de 2025

Next-gen coding model with top SWE-Bench score

Modified MIT license (free unless high revenue)

Leer el artículo completo

Mistral AImultimodal14BCódigo Abierto

Ministral 3 14B: El Nuevo Gigante Multimodal de Mistral AI

Lanzado el 2 de diciembre de 2025

Largest Ministral 3 model with vision

Best-in-class text and vision capabilities

Apache 2.0 license

Leer el artículo completo

Mistral AIlanguage model8BCódigo Abierto

Ministral 3 8B: El Nuevo Estándar en Modelos Abiertos Multimodales

Lanzado el 2 de diciembre de 2025

Powerful and efficient model with vision

Best-in-class text and vision at this size

Apache 2.0 license

Leer el artículo completo

Mistral AIlanguage model3BCódigo Abierto

Ministral 3 3B: Potencia de Borde con Visión y Apache 2.0

Lanzado el 2 de diciembre de 2025

Tiny and efficient edge model with vision

Runs on phones, drones, and laptops

Apache 2.0 license

Leer el artículo completo

Amazonlanguage modelCerrado

Amazon Nova 2: El Nuevo Estándar en AWS Bedrock

Lanzado el 2 de diciembre de 2025

Amazon next-gen foundation model

Available via AWS Bedrock

Announced at re:Invent

Leer el artículo completo

Mistral AIlanguage model41B active (MoE)Código Abierto

Mistral Large 3: El Nuevo Estándar Abierto en IA (2025)

Lanzado el 2 de diciembre de 2025

Sparse MoE with 41B active parameters

Open weights

Strong reasoning and multilingual capabilities

Leer el artículo completo

Zhipu AIcoding modelCódigo Abierto

GLM-4.7 de Zhipu: El Nuevo Líder en Código Abierto y Razonamiento

Lanzado el 1 de diciembre de 2025

Open-weights model topping global coding and reasoning leaderboards

Includes GLM-4.7 Flash variant

Cost-effective compared to Western competitors

Leer el artículo completo

MiniMaxcoding model230B MoE (10B active)Código Abierto

MiniMax M2.1: El Nuevo Estándar en Modelos de Código Abiertos

Lanzado el 1 de diciembre de 2025

Fully open-source SOTA coding model

230B params MoE architecture, 10B activated per token

SWE-bench score of 74.0%

92% cheaper than Western alternatives

Leer el artículo completo

AnthropicreasoningCerradoHito

Anthropic Lanza Claude Opus 4.5: Nuevo Estándar en Reasoning

Lanzado el 24 de noviembre de 2025

Exceeds Sonnet 4.5 by 4.3% using 48% fewer tokens at max effort

200K token context, 64K max output

Hybrid reasoning with instant or extended thinking

Multimodal: text, image, and audio support

20% accuracy gain, Excel and financial modeling breakthrough

Leer el artículo completo

Allen AIopen source32B

Allen AI presenta OLMo 3: Nuevo Estándar Open Source 32B

Lanzado el 20 de noviembre de 2025

Fully open model with weights, data, and training code

From AI2 research lab

Leer el artículo completo

Deep Cogitoreasoning671B MoECódigo Abierto

Deep Cogito v2.1: El Nuevo Estándar en Razonamiento Abierto

Lanzado el 19 de noviembre de 2025

Large 671B MoE reasoning model

Strong on complex reasoning tasks

Leer el artículo completo

Google DeepMindreasoningCerrado

Gemini 3 Deep Think: El Nuevo Estándar en Razonamiento Avanzado

Lanzado el 18 de noviembre de 2025

Reasoning variant of Gemini 3

Deep chain-of-thought for complex scientific problems

Leer el artículo completo

Google DeepMindmultimodalCerradoHito

Gemini 3 Pro: El Nuevo Estándar Multimodal de Google DeepMind

Lanzado el 18 de noviembre de 2025

Over 50% improvement over Gemini 2.5 Pro

Most powerful Google model — replaces 2.5 series

1M token context window

Advanced multimodal: text, image, video, audio, code

Leer el artículo completo

OpenAIlanguage modelCerrado

OpenAI lanza GPT-5.1: Más rápido, conversacional y listo para todos

Lanzado el 12 de noviembre de 2025

Family of four models with adaptive reasoning

Faster, more conversational, improved coding

Rolled out to all ChatGPT users

Leer el artículo completo

Moonshot AIreasoningCerrado

Kimi K2.5: El Nuevo Líder en Razonamiento de Moonshot AI

Lanzado el 6 de noviembre de 2025

Upgraded Kimi model with thinking and reasoning capabilities

Leer el artículo completo

Amazonlanguage modelCerrado

Amazon Nova Premier: El Nuevo Estándar en Modelos Multimodales de AWS

Lanzado el 31 de octubre de 2025

Most capable Amazon model

1M context window

Multimodal capabilities

Teacher for distillation on Bedrock

Leer el artículo completo

Yandexlanguage modelCerrado

Alice AI 1.0: El Nuevo Líder Ruso en Grandes Modelos de Lenguaje

Lanzado el 28 de octubre de 2025

First major Russian-developed large language model on the global stage

From Yandex

Leer el artículo completo

MiniMaxopen source230B MoE

MiniMax M2: El Nuevo Gigante Open Source de 230B

Lanzado el 23 de octubre de 2025

Upgraded MiniMax model with improved reasoning and generation

Open weights

Leer el artículo completo

Zhipu AIlanguage modelCódigo Abierto

GLM-4.6 de Zhipu AI: Soporte Nativo para Chips Chinos y Razonamiento Avanzado

Lanzado el 9 de octubre de 2025

First GLM model with native support for China domestic chips

Cambricon and Moore Threads support

FP8 and Int4 quantization

Leer el artículo completo

IBMopen source

IBM Granite 4.0: La Revolución Híbrida Mamba-Transformer Abierta

Lanzado el 2 de octubre de 2025

IBM open enterprise model

Hybrid Mamba-2 Transformer architecture

Apache 2.0 license

Leer el artículo completo

Anthropiclanguage modelCerrado

Claude Haiku 4.5: El Nuevo Estándar en Velocidad y Eficiencia de Anthropic

Lanzado el 1 de octubre de 2025

Anthropic's fastest model with near-frontier intelligence

200K token context window, 64K max output

21K+ tokens per second for prompts under 32K tokens

Supports reasoning budget and effort control

Most cost-effective in the Claude family: $1/M input

Leer el artículo completo

DeepSeek AIopen source671B MoE

DeepSeek V3.2: El Nuevo Estándar Abierto que Desafía a GPT-5

Lanzado el 29 de septiembre de 2025

Further iteration on V3 series

Enhanced capabilities across all benchmarks

Open weights

Leer el artículo completo

Anthropiccoding modelCerrado

Claude Sonnet 4.5: El Nuevo Estándar en Desarrollo de Software

Lanzado el 29 de septiembre de 2025

Anthropic's best model for coding tasks

1M token context window (beta feature)

64K max output tokens

Strong agentic behavior and computer-use skills

Optimized for efficient coding and parallel processing

Leer el artículo completo

Alibaba Cloudopen source80B MoE (3B active)

Qwen3-Next: El Nuevo Gigante Open Source de Alibaba Cloud

Lanzado el 10 de septiembre de 2025

Ultra-efficient MoE from Alibaba

80B total, only 3B active parameters

Strong reasoning with minimal compute

Apache 2.0 license

Leer el artículo completo

Moonshot AIopen source1T MoE (32B active)Hito

Kimi K2: El Gigante Open Source de 1T Parámetros de Moonshot AI

Lanzado el 4 de septiembre de 2025

Massive 1T MoE model with open weights

Highly competitive with frontier models

Major Chinese AI milestone

32B activated parameters

Cost-effective: ~$0.15/M input, $2.50/M output

Strong coding performance across 32+ languages

Leer el artículo completo

SpaceXAIlanguage modelCerrado

Grok 4 Fast de xAI: Eficiencia y Velocidad para el Desarrollo en 2025

Lanzado el 1 de septiembre de 2025

98% cost reduction compared to Grok 4 Standard

40% increase in token efficiency

Real-time search integration via X

$0.20/M input, $1.50/M output

Leer el artículo completo

Mistral AIreasoning~45BCerrado

Mistral AI Despliega Magistral Medium 1.2: El Nuevo Estándar en Razonamiento Multimodal

Lanzado el 1 de septiembre de 2025

Adds vision to Magistral Medium

Multimodal frontier reasoning

Closed API only

Leer el artículo completo

Mistral AIreasoning24BCódigo Abierto

Magistral Small 1.2: El Nuevo Líder en Razonamiento Multimodal

Lanzado el 1 de septiembre de 2025

Adds vision to Magistral Small

Multimodal reasoning model

Apache 2.0 license

Leer el artículo completo

NousResearchopen source405B

Hermes 4 405B: La Nueva Era del Razonamiento Abierto

Lanzado el 28 de agosto de 2025

Latest in the Hermes series

Advanced function calling and structured output

Built on Llama 3.1

Leer el artículo completo

DeepSeek AIopen source671B MoE

DeepSeek V3.1: El Nuevo Estándar en Modelos Open Source de 671B

Lanzado el 21 de agosto de 2025

Major upgrade to V3 with improved reasoning and coding

Open weights

Leer el artículo completo

Mistral AImultimodalCerradoHito

Mistral Medium 3.1: El Nuevo Estándar Multimodal en 2025

Lanzado el 12 de agosto de 2025

Frontier-class multimodal model

Competitive with GPT-4o and Claude 3.5

Strong vision and reasoning capabilities

Leer el artículo completo

Zhipu AImultimodal106BCódigo Abierto

GLM-4.5V: El Nuevo Gigante Multimodal de Zhipu AI

Lanzado el 11 de agosto de 2025

Vision-language model from Z.ai

106B parameters with strong multimodal understanding

Leer el artículo completo

OpenAIlanguage modelCerradoHito

OpenAI GPT-5: El Salto Histórico a la IA Razonable (2025)

Lanzado el 7 de agosto de 2025

Next-generation flagship with major intelligence leap

400K token context window

Built-in reasoning with 4 effort levels

Multimodal: text, image, and video-based reasoning

Available in Standard, Mini, and Nano variants

Leer el artículo completo

OpenAIopen source120BHito

GPT-OSS: El Modelo Abierto de OpenAI que Cambia el Juego

Lanzado el 5 de agosto de 2025

OpenAI's first open-weight models since GPT-2

20B and 120B variants

Historic open-source move from OpenAI

Leer el artículo completo

AnthropicreasoningCerrado

Claude Opus 4.1: El Nuevo Líder en Razonamiento para Desarrolladores

Lanzado el 5 de agosto de 2025

Upgrade to Claude 4 with improved coding and instruction following

200K token context window

Extended thinking support

Vision and tool calling capabilities

Leer el artículo completo

Anthropiclanguage modelCerrado

Claude 4.5 Sonnet: El Nuevo Estándar en Codificación y Razonamiento

Lanzado el 29 de julio de 2025

Newest Anthropic model with improved creative writing

Enhanced nuance and multi-step reasoning

Leer el artículo completo

Zhipu AIlanguage model106B MoECódigo Abierto

GLM-4.5 Air de Zhipu: Eficiencia y Rendimiento en 2025

Lanzado el 28 de julio de 2025

Lightweight variant of GLM-4.5

106B MoE, efficient inference on 8x H20 GPUs

Leer el artículo completo

Zhipu AIopen source355B MoE

GLM-4.5: El Nuevo Estándar Open-Source de Zhipu AI (2025)

Lanzado el 28 de julio de 2025

Z.ai flagship open MoE model

355B total parameters

Strong reasoning, coding, and agentic capabilities

Claimed cheaper to run than DeepSeek

Leer el artículo completo

SpaceXAIlanguage modelCerradoHito

xAI Grok 4: El Nuevo Líder en IA Generativa

Lanzado el 11 de julio de 2025

xAI's most powerful model at the time

Major reasoning leap

Trained on expanded Colossus cluster

Leer el artículo completo

Google DeepMindopen source4B

Gemma 3n: La Revolución del Edge AI de Google DeepMind

Lanzado el 26 de junio de 2025

Efficient on-device model designed for mobile

Runs on phones and edge devices

Leer el artículo completo

OpenAIreasoningCerrado

GPT-o3 Pro: El Nuevo Estándar en Modelos de Razonamiento de OpenAI

Lanzado el 10 de junio de 2025

Most powerful OpenAI reasoning model

Extended thinking for frontier problems

Leer el artículo completo

Mistral AIlanguage model24BCódigo Abierto

Mistral Small 3.2: Mejoras de Razonamiento y Código

Lanzado el 10 de junio de 2025

Update to Mistral Small 3.1

Improved instruction following and reasoning

Apache 2.0 license

Leer el artículo completo

Xiaohongshu (RedNote)open source142B MoE (14B active)

Dots.llm1: El Nuevo Estándar Open Source de Xiaohongshu

Lanzado el 6 de junio de 2025

Open-source MoE from RedNote (China Instagram)

142B total, 14B active

Performance on par with frontier models at time of release

Leer el artículo completo

Mistral AIreasoning24BCódigo Abierto

Magistral Small: El Nuevo Estándar en Razonamiento de Mistral AI

Lanzado el 5 de junio de 2025

Mistral reasoning model with extended thinking

Strong STEM performance

Apache 2.0 license

Leer el artículo completo

Google DeepMindmultimodalCerrado

Gemini 2.5 Pro (06-05): El Nuevo Estándar en Razonamiento y Código

Lanzado el 5 de junio de 2025

Latest 2.5 Pro with enhanced coding, reasoning, and agentic capabilities

Leer el artículo completo

MiniMaxlanguage modelCódigo Abierto

MiniMax-M1: El Nuevo Gigante de Código Abierto con Atención Relámpago

Lanzado el 1 de junio de 2025

Chinese AI lab flagship with strong long-context

Lightning attention architecture

Leer el artículo completo

Anthropiclanguage modelCerrado

Anthropic Lanza Claude Sonnet 4: El Nuevo Líder en Código y Agentes

Lanzado el 22 de mayo de 2025

High-performance model balancing speed and intelligence

200K context window, 64K max output

Best model for complex agents and coding

Native tool calling and computer use

Available on free tier of Claude.ai

Leer el artículo completo

AnthropicreasoningCerradoHito

Claude Opus 4: El Nuevo Líder en Razonamiento y Agentes Autónomos

Lanzado el 22 de mayo de 2025

Most powerful Anthropic model at launch

Parallel tool use, long autonomous tasks

200K token context window

Extended thinking support

Vision capabilities for image understanding

Leer el artículo completo

Mistral AIcoding model24BCódigo Abierto

Mistral AI lanza Devstral: El modelo de código de 24B bajo Apache 2.0

Lanzado el 21 de mayo de 2025

Mistral dedicated coding model

Optimized for software engineering and agentic coding tasks

Apache 2.0 license

Leer el artículo completo

TIIopen source0.5B–34B

Falcon H1: Revolución Abierta con Arquitectura Híbrida de TII

Lanzado el 20 de mayo de 2025

Hybrid SSM+attention architecture

Six model sizes from 0.5B to 34B

Punches above weight class on benchmarks

Apache 2.0 license

Leer el artículo completo

Google DeepMindlanguage modelCerrado

Gemini 2.5 Flash: El Nuevo Estándar de Velocidad y Razonamiento

Lanzado el 20 de mayo de 2025

Cost-efficient reasoning with controllable thinking depth

#1 Chatbot Arena for speed

Leer el artículo completo

Mistral AIlanguage modelCódigo Abierto

Mistral Medium 3: El Nuevo Estándar Abierto en IA (2025)

Lanzado el 14 de mayo de 2025

Front-tier model, competitive with GPT-4o

Strong multilingual capabilities

Apache 2.0 license

Leer el artículo completo

Alibaba Cloudopen source235B MoE (22B active)

Qwen 3: El Nuevo Modelo Open-Source de 235B Parámetros de Alibaba Cloud

Lanzado el 29 de abril de 2025

Excellent multilingual performance (Chinese, English, and more)

0.6B to 235B variants with hybrid thinking

119 languages supported

22B active parameters in MoE architecture

Strong coding performance

Apache 2.0 license

Leer el artículo completo

Zhipu AImultimodal32BCódigo Abierto

Zhipu GLM-4.1V: El Nuevo Gigante Multimodal de 32B

Lanzado el 25 de abril de 2025

Open 32B and 9B multimodal with reasoning

Competitive on vision tasks

Leer el artículo completo

OpenAIreasoningCerrado

OpenAI o4-mini: El Nuevo Estándar en Razonamiento Eficiente para Desarrolladores

Lanzado el 16 de abril de 2025

Efficient reasoning model

Best cost-performance for coding and STEM

Leer el artículo completo

OpenAIreasoningCerrado

OpenAI o3: El Nuevo Estándar de Razonamiento para Ingeniería

Lanzado el 16 de abril de 2025

Full o3 reasoning model — successor to o1

Deep chain-of-thought capabilities

Leer el artículo completo

OpenAIlanguage modelCerrado

OpenAI GPT-4.1 Series: El Nuevo Estándar para Ingeniería de Software

Lanzado el 14 de abril de 2025

Optimized for coding and instruction following

1M token context window

Available in Standard, Mini, and Nano variants

Nano: $0.10/M input, $0.40/M output

Leer el artículo completo

Meta AIopen source400B+ MoEHito

Llama 4 de Meta: El Nuevo Estándar Abierto para IA Multimodal

Lanzado el 5 de abril de 2025

Open-weight natively multimodal models

Scout: 109B, runs on single H100 GPU, 10M token context

Maverick: 400B, requires H100 DGX system

Early fusion for native text, image, and video understanding

Leer el artículo completo

Google DeepMindmultimodalCerradoHito

Gemini 2.5 Pro: El Nuevo Estándar en IA Multimodal de Google DeepMind

Lanzado el 25 de marzo de 2025

#1 on LMArena at launch

Built-in reasoning capabilities

1M token context window

Native code execution and Google Search grounding

Best overall model at launch

Leer el artículo completo

NVIDIAreasoning253B MoECódigo Abierto

NVIDIA Nemotron Ultra: El Nuevo Estándar en Razonamiento Abierto

Lanzado el 18 de marzo de 2025

Open reasoning model based on Llama

253B MoE architecture

Strong enterprise tasks

Leer el artículo completo

Mistral AIopen source24B

Mistral Small 3.1: Visión Multimodal y 128K Contexto en Open Source

Lanzado el 17 de marzo de 2025

Adds vision capabilities to Small 3.0

Multimodal, 128K context

Apache 2.0 license

Leer el artículo completo

Coherelanguage model111BCódigo Abierto

Cohere Command A: El Nuevo Estándar Open Source para Empresas

Lanzado el 13 de marzo de 2025

Cohere's 111B flagship model

Enterprise RAG and agentic tasks

Multilingual capabilities

Runs on 2 GPUs

Leer el artículo completo

Google DeepMindmultimodal27BCódigo Abierto

Google DeepMind lanza Gemma 3: El estándar multimodal abierto

Lanzado el 12 de marzo de 2025

1B/4B/12B/27B variants

Multimodal (text+vision)

Single GPU capable, 128K context

Leer el artículo completo

Shanghai AI Labopen source8B

InternLM 3: El Nuevo Líder en Modelos Open Source de 8B

Lanzado el 5 de marzo de 2025

8B bilingual (English + Chinese) model with deep thinking mode

Surpasses Llama 3.1 8B and Qwen2.5 7B on reasoning/knowledge tasks

128K context, trained on 4T tokens with 75%+ cost savings

Apache 2.0 license

Leer el artículo completo

Alibaba Cloudreasoning32BCódigo Abierto

QwQ-32B: El Nuevo Estándar en Razonamiento de Código y Lógica

Lanzado el 5 de marzo de 2025

Dedicated reasoning model from Qwen team

Strong mathematical and logical reasoning

Apache 2.0 license

Leer el artículo completo

OpenAIlanguage modelCerrado

OpenAI GPT-4.5: Redefiniendo la IA con Mayor EQ y Precisión

Lanzado el 27 de febrero de 2025

Largest OpenAI model at the time

Focus on EQ, creativity, reduced hallucinations

Leer el artículo completo

Anthropiccoding modelCerrado

Claude 3.7 Sonnet: El Nuevo Estándar en Ingeniería de Código

Lanzado el 24 de febrero de 2025

Hybrid reasoning — toggle instant/extended thinking

Best coding model at launch

200K context window, 64K max output

Leer el artículo completo

Microsoftopen source3.8B

Microsoft Phi-4-Mini: Eficiencia y Potencia en 3.8B

Lanzado el 18 de febrero de 2025

3.8B dense model outperforming 2x-size models (Phi-3.5-mini, Llama 3.2 3B)

128K context, 22 languages, function calling and tool use

Trained on 5T tokens (synthetic + filtered public data + code)

MIT license — smallest Phi model with strong reasoning

Leer el artículo completo

SpaceXAIlanguage modelCerrado

xAI Grok 3: Nuevo Líder en Razonamiento AI

Lanzado el 17 de febrero de 2025

Trained on Colossus supercluster (100K GPUs)

Strong reasoning capabilities

Leer el artículo completo

DeepSeek AIreasoning671B MoECódigo AbiertoHito

DeepSeek R1: El Modelo de Razonamiento que Desafiaba a OpenAI

Lanzado el 20 de enero de 2025

Open-source reasoning model rivaling o1

Pure reinforcement learning approach

Caused global market shockwaves

671B MoE architecture

Leer el artículo completo

Mistral AIlanguage model24BCódigo Abierto

Mistral Small 3.0: El Nuevo Estándar Abierto para IA en 2025

Lanzado el 15 de enero de 2025

Refreshed Small with state-of-the-art performance

Apache 2.0 license

Leer el artículo completo

Allen AIopen source7B / 13B

OLMo 2 de Allen AI: Transparencia Total y Rendimiento Superior

Lanzado el 6 de enero de 2025

Truly open: weights + training data + training code + evaluation all released

7B and 13B sizes — 7B competitive with Llama 3.1 8B, 13B with Gemma 2 9B

Trained on 4T–5T tokens, 9-point MMLU increase over OLMo 1

Apache 2.0 license

Leer el artículo completo

2024

DeepSeek AIopen source671B MoEHito

DeepSeek V3: El Hito Open-Source de 671B Parámetros

Lanzado el 26 de diciembre de 2024

671B MoE trained for $5.5M — matches GPT-4o/Claude 3.5 Sonnet

Revolutionized cost efficiency

Open-source on GitHub and HuggingFace

Strong coding and mathematical reasoning

Leer el artículo completo

TIIopen source10B

Falcon 3 de TII: El Nuevo Estándar Open Source para Razonamiento y Multimodalidad

Lanzado el 17 de diciembre de 2024

1B/3B/7B/10B sizes

Enhanced multilingual and multimodal

Apache 2.0 license

Leer el artículo completo

Microsoftopen source14B

Microsoft Phi-4: El Modelo de 14B que Desafía a los Gigantes en Razonamiento

Lanzado el 12 de diciembre de 2024

14B excelling at STEM reasoning

Outperforms much larger models on math

Leer el artículo completo

Google DeepMindmultimodalCerrado

Gemini 2.0 Flash: El Nuevo Estándar en IA Multimodal y Agéntica

Lanzado el 11 de diciembre de 2024

Google's model for the agentic era with native image and audio generation

Outperforms Gemini 1.5 Pro at twice the speed

Native tool use including Google Search and code execution

Foundation for Project Astra and Project Mariner

Leer el artículo completo

Meta AIopen source70B

Meta Llama 3.3: Eficiencia Extrema con 70B parámetros

Lanzado el 6 de diciembre de 2024

70B matching Llama 3.1 405B performance

Massive efficiency gain

Leer el artículo completo

OpenAIreasoningCerrado

OpenAI o1-pro: El Nuevo Estándar en Modelos de Razonamiento

Lanzado el 5 de diciembre de 2024

Enhanced reasoning with more compute for complex tasks

Available in ChatGPT Pro tier

Leer el artículo completo

Amazonlanguage modelCerrado

Amazon Nova: El Nuevo Estándar en Modelos de Lenguaje para AWS

Lanzado el 3 de diciembre de 2024

Foundation model family: Micro/Lite/Pro/Premier

Multimodal, optimized for AWS Bedrock

Leer el artículo completo

Alibaba Cloudcoding model0.5B–32BCódigo Abierto

Qwen2.5-Coder: El Nuevo Estándar en Modelos de Código Abiertos

Lanzado el 22 de noviembre de 2024

Code-specialized model in 6 sizes: 0.5B, 1.5B, 3B, 7B, 14B, 32B

32B variant matches GPT-4o coding ability — state-of-the-art open code LLM

Trained on 5.5T tokens (source code + text-code grounding + synthetic)

300+ programming languages, 128K context with YaRN extension

Apache 2.0 license

Leer el artículo completo

Mistral AImultimodal124BCódigo Abierto

Pixtral Large: El Nuevo Gigante Multimodal de Mistral AI

Lanzado el 17 de noviembre de 2024

Mistral's large multimodal model

128K context, native image understanding at scale

Open weights

Leer el artículo completo

Tencentopen source389B MoE (52B active)

Tencent Lanza Hunyuan-Large: El Nuevo Líder Open Source

Lanzado el 5 de noviembre de 2024

Largest open-source Transformer-based MoE model at release

389B total parameters with 52B active per token

256K context window

Outperforms Llama 3.1 405B on benchmarks

Leer el artículo completo

Anthropiclanguage modelCerrado

Claude Haiku 3.5: Velocidad y Costo para Devs

Lanzado el 22 de octubre de 2024

Fast and cost-effective model

200K token context window, 8K max output

Multilingual and vision capabilities

$0.80/M input, $4/M output

Ideal for high-volume tasks like chatbots and moderation

Leer el artículo completo

01.AIlanguage modelCerrado

Yi-Lightning: El Nuevo Modelo Propietario de 01.AI Desafia a los Gigantes

Lanzado el 16 de octubre de 2024

Ranked #6 on LMSYS Chatbot Arena at launch — #1 in China

Surpassed GPT-4o-0513 and Claude 3.5 Sonnet in overall ranking

Top-3 in Chinese, Math, Coding, and Hard Prompts categories

Founded by Kai-Fu Lee, proprietary model

Leer el artículo completo

Meta AImultimodal90BCódigo Abierto

Meta Llama 3.2: El Nuevo Estándar Multimodal para Desarrolladores

Lanzado el 25 de septiembre de 2024

First Llama models with vision capabilities — 11B and 90B multimodal variants

Lightweight 1B and 3B edge models for on-device deployment

128K context window, competitive with Claude 3 Haiku and GPT-4o-mini

Drop-in replacements for Llama 3.1 text models

Leer el artículo completo

Alibaba Cloudopen source72B

Qwen2.5: El Nuevo Líder Abierto de Alibaba para Desarrollo de IA

Lanzado el 19 de septiembre de 2024

0.5B to 72B range

SOTA open model for coding and math

18T training tokens

Apache 2.0 license

Leer el artículo completo

Mistral AIopen source22B

Mistral Small 2409: El Nuevo Estándar Open Source de 22B

Lanzado el 18 de septiembre de 2024

Updated Mistral Small with improved instruction following

22B parameters, Apache 2.0 license

Leer el artículo completo

Mistral AImultimodal12BCódigo Abierto

Pixtral 12B: Revolución Multimodal con Visión Nativa

Lanzado el 17 de septiembre de 2024

Built on NeMo architecture with native vision support

128K context, Apache 2.0 license

Leer el artículo completo

OpenAIreasoningCerradoHito

OpenAI o1-preview: El Nuevo Estándar de Razonamiento para IA

Lanzado el 12 de septiembre de 2024

First 'reasoning' model with chain-of-thought at inference

PhD-level science and math performance

Leer el artículo completo

DeepSeek AIopen source236B MoE (21B active)

DeepSeek V2.5: El Nuevo Gigante Open Source que Combina Coder y Chat

Lanzado el 5 de septiembre de 2024

Merged DeepSeek-V2-Chat and DeepSeek-Coder-V2 into a single model

236B MoE with 21B active parameters, 128K context

Strong coding and general capabilities in one model

MIT license, available on HuggingFace

Leer el artículo completo

AI21 Labsopen source398B MoE (94B active)

Jamba 1.5: La Revolución del Híbrido Mamba-Transformer de AI21 Labs

Lanzado el 22 de agosto de 2024

Mamba-Transformer hybrid MoE

94B active, 256K context

Fastest long-context model at release

Leer el artículo completo

Microsoftopen source4B MoE

Microsoft Phi-3.5: El Nuevo Estándar en Modelos de 4B MoE para Edge

Lanzado el 20 de agosto de 2024

4B MoE and 3.8B variants optimized for edge devices

Phone-capable AI with 128K context window

Improved multilingual support over Phi-3

Strong reasoning for its size class

Leer el artículo completo

SpaceXAIlanguage modelCerrado

Grok-2 de xAI: Análisis Técnico y Comparativa

Lanzado el 13 de agosto de 2024

Competitive with GPT-4o and Claude 3.5 Sonnet

Available on X platform

Leer el artículo completo

Naverlanguage model104BCerrado

HyperCLOVA X: El Nuevo LLM de Naver Optimizado para Asia

Lanzado el 7 de agosto de 2024

Korean web giant Naver's flagship LLM optimized for Korean language and culture

Two sizes: HCX-L (largest) and HCX-S (lighter), built on LLaMA 2 architecture

100K context window with Korean-optimized tokenizer

Strong cross-lingual reasoning in Asian languages — Korean, Japanese, Chinese

Leer el artículo completo

Black Forest Labsimage generation12BCódigo Abierto

FLUX.1: El Nuevo Estándar de Generación de Imágenes Abiertas

Lanzado el 1 de agosto de 2024

State-of-the-art text-to-image model from ex-Stability AI founders

12B rectified flow transformer architecture

FLUX.1 [schnell] open under Apache 2.0, [dev] non-commercial

Surpassed closed-source alternatives in image quality

Leer el artículo completo

Mistral AIlanguage model123BCódigo Abierto

Mistral Large 2: El Nuevo Gigante Abierto de Mistral AI

Lanzado el 24 de julio de 2024

128K context, competitive with GPT-4o and Llama 3.1 405B

12 languages supported

Open weights

Leer el artículo completo

Meta AIopen source405BHito

Llama 3.1: El Modelo Abierto de 405B Parámetros que Desafía a GPT-4

Lanzado el 23 de julio de 2024

Largest open model — 405B parameters

Matches GPT-4 on many benchmarks

128K context window

Leer el artículo completo

Mistral AI & NVIDIAopen source12B

Mistral NeMo 12B: Análisis Técnico y Guía de Implementación

Lanzado el 18 de julio de 2024

Co-built with NVIDIA, runs on a single GPU

12B parameters with 128K context window

Drop-in replacement for Mistral 7B with SOTA performance in its class

Apache 2.0 license, strong multilingual support

Leer el artículo completo

Shanghai AI Labopen source20B

InternLM 2.5: El Nuevo Estándar en Razonamiento Open-Source

Lanzado el 3 de julio de 2024

Strong reasoning from China's national lab

Competitive on math and coding

Leer el artículo completo

Google DeepMindopen source27B

Gemma 2 de Google DeepMind: El Nuevo Estándar de IA Abierta

Lanzado el 27 de junio de 2024

9B and 27B sizes

Outperforms models 2x its size

Knowledge distillation from Gemini

Leer el artículo completo

Anthropiclanguage modelCerradoHito

Claude 3.5 Sonnet: El Nuevo Estándar en IA para Desarrolladores

Lanzado el 20 de junio de 2024

Surpassed GPT-4o and Gemini 1.5 Pro at launch

2x faster than Claude 3 Opus at lower cost

Leer el artículo completo

DeepSeek AIcoding model236B MoECódigo Abierto

DeepSeek Coder V2: El Nuevo Estándar Abierto en Ingeniería de Código

Lanzado el 17 de junio de 2024

First open MoE code model matching GPT-4 Turbo on coding

338 programming languages supported

Leer el artículo completo

NVIDIAopen source340B

NVIDIA Nemotron-4 340B: El Nuevo Estándar de IA Abierta para Empresas

Lanzado el 14 de junio de 2024

NVIDIA's open model for synthetic data generation

Permissive enterprise license

Leer el artículo completo

Alibaba Cloudopen source72B

Qwen2: El Nuevo Gigante Open Source de 72B de Alibaba Cloud

Lanzado el 7 de junio de 2024

Major upgrade, 0.5B to 72B range

Competitive with Llama 3 70B

Apache 2.0 license

Leer el artículo completo

Zhipu AIopen source9B

GLM-4: El modelo open-source de 9B que desafía a Llama 3

Lanzado el 5 de junio de 2024

128K context, 26 languages

Competitive with Llama 3 8B

Open-source GLM-4 series

Leer el artículo completo

Mistral AIcoding model22BCódigo Abierto

Codestral: El Nuevo Modelo de Código de Mistral AI (22B)

Lanzado el 29 de mayo de 2024

Specialized code model, 80+ languages

32K context, fill-in-the-middle support

Leer el artículo completo

ByteDancelanguage modelCódigo Abierto

Doubao de ByteDance: El Nuevo Líder Open Source en IA

Lanzado el 15 de mayo de 2024

ByteDance's flagship LLM, most popular AI product in China

Available via Doubao app and Volcano Engine API

Supports 50+ application scenarios including voice, vision, and coding

Open-source Seed 1.5 variants released under permissive license

Leer el artículo completo

OpenAImultimodalCerradoHito

GPT-4o: El Futuro del Procesamiento Multimodal en IA

Lanzado el 13 de mayo de 2024

'Omni' model with native audio/vision/text

2x faster, 50% cheaper than GPT-4 Turbo

Real-time voice conversation capabilities

Leer el artículo completo

DeepSeek AIopen source236B MoE (21B active)

DeepSeek V2: El Nuevo Estándar en Modelos Open Source de Alta Eficiencia

Lanzado el 7 de mayo de 2024

236B MoE with only 21B active parameters

Multi-head Latent Attention for efficiency

Open weights

Leer el artículo completo

Snowflakeopen source480B MoE (17B active)

Snowflake Arctic: El Modelo Open-Source Empresarial Definitivo

Lanzado el 24 de abril de 2024

480B MoE with 17B active parameters

Enterprise-focused, strong on SQL and coding

Apache 2.0 license

Leer el artículo completo

Microsoftopen source14B

Phi-3 de Microsoft: El Modelo Open Source que Rompe los Límites del Móvil

Lanzado el 23 de abril de 2024

Mini/Small/Medium variants

Phi-3 Mini (3.8B) rivals Mixtral 8x7B

Phone-capable AI

Leer el artículo completo

Meta AIopen source70BHito

Llama 3 de Meta: El Nuevo Estándar en IA Abierta

Lanzado el 18 de abril de 2024

Trained on 15T tokens, 8B and 70B sizes

New open-source SOTA with massive community adoption

Leer el artículo completo

Mistral AIopen source176B MoE

Mixtral 8x22B: El modelo MoE de 176B que revoluciona el rendimiento multilingüe y código

Lanzado el 17 de abril de 2024

Large MoE with strong multilingual and code performance

Open weights

Leer el artículo completo

Coherelanguage model104BCódigo Abierto

Command R+: El modelo de lenguaje de 104B parámetros de Cohere optimizado para RAG empresarial

Lanzado el 4 de abril de 2024

Optimized for RAG and enterprise

128K context, 10 languages

Grounded generation capabilities

Leer el artículo completo

AI21 Labsopen source52B

Jamba 52B: El Revolucionario Modelo Híbrido Mamba-Transformer de Código Abierto de AI21 Labs

Lanzado el 28 de marzo de 2024

First production Mamba-Transformer hybrid

256K context, novel SSM architecture

Leer el artículo completo

Databricksopen source132B MoE (36B active)

DBRX de Databricks: El modelo open source de 132B que supera a Llama 2 y Mixtral

Lanzado el 27 de marzo de 2024

Open MoE with 36B active parameters

Outperformed Llama 2 70B and Mixtral

Apache 2.0 license

Leer el artículo completo

SpaceXAIopen source314B MoE

Grok-1: El Primer Modelo de Código Abierto de xAI con 314B MoE

Lanzado el 17 de marzo de 2024

xAI's first open-source model

314B MoE under Apache 2.0

Largest open MoE at time of release

Leer el artículo completo

Anthropiclanguage modelCerradoHito

Claude 3 de Anthropic: El Lanzamiento Histórico que Desafía a GPT-4

Lanzado el 4 de marzo de 2024

Haiku/Sonnet/Opus family

Opus matched GPT-4 on most benchmarks

200K context window, vision capabilities

Leer el artículo completo

AnthropicreasoningCerradoHito

Claude Opus 3: El modelo de razonamiento revolucionario de Anthropic

Lanzado el 4 de marzo de 2024

First Claude Opus model with advanced reasoning

200K context window

Pioneered extended thinking capabilities

Vision and tool use support

Leer el artículo completo

Mistral AIlanguage modelCerrado

Mistral Large: El modelo comercial insignia de Mistral AI con razonamiento de élite

Lanzado el 26 de febrero de 2024

Mistral's first flagship commercial model

32K context, top-tier reasoning

Leer el artículo completo

Google DeepMindopen source7B

Google DeepMind Lanza Gemma: El Modelo de Código Abierto que Revoluciona la IA Local

Lanzado el 21 de febrero de 2024

Google's open-source model from Gemini research

2B and 7B sizes, strong for its class

Leer el artículo completo

Google DeepMindmultimodalCerradoHito

Gemini 1.5 Pro: El revolucionario modelo multimodal con ventana de contexto de 1 millón de tokens

Lanzado el 15 de febrero de 2024

1 million token context window — 10x previous record

MoE architecture, processes entire codebases

Leer el artículo completo

Google DeepMindmultimodalCerrado

Gemini 1.0 Ultra: El modelo multimodal más potente de Google supera a GPT-4 en la mayoría de benchmarks

Lanzado el 8 de febrero de 2024

Most capable Gemini 1.0 model

Beat GPT-4 on 30/32 benchmarks

Powers Gemini Advanced

Leer el artículo completo

Stabilityopen source1.6B / 12B

StableLM 2: El modelo de lenguaje abierto de Stability AI que desafía a los gigantes con 1.6B y 12B de parámetros

Lanzado el 6 de febrero de 2024

Open language model in two sizes: 1.6B and 12B

Trained on 2T tokens (Falcon RefinedWeb, RedPajama, The Pile, CulturaX)

Competitive with Mistral-7B despite smaller footprint

Stability AI Community License

Leer el artículo completo

BigCode / HuggingFacecoding model3B / 7B / 15BCódigo Abierto

StarCoder 2: La Revolución de Código Abierto con Tres Tamaños y 600+ Lenguajes

Lanzado el 6 de febrero de 2024

Open code LLM in 3 sizes: 3B, 7B, 15B — trained on 4T+ tokens from The Stack v2

600+ programming languages, fill-in-the-middle capability

16K context with sliding window attention

Trained on permissively licensed code only

Leer el artículo completo

2023

Upstageopen source10.7B

SOLAR 10.7B: El modelo de código abierto que revoluciona el rendimiento de IA en Corea

Lanzado el 13 de diciembre de 2023

Korean startup Upstage's open model using depth up-scaling

Topped HuggingFace Open LLM Leaderboard at release

Apache 2.0 license

Leer el artículo completo

Mistral AIopen source46.7B MoE (12.9B active)Hito

Mixtral 8x7B de Mistral AI: El Modelo Abierto que Revoluciona la Eficiencia de IA

Lanzado el 11 de diciembre de 2023

Open-source MoE matching GPT-3.5 quality with only 12.9B active params

Game-changer for open-source efficiency

Apache 2.0 license

Leer el artículo completo

Google DeepMindmultimodalCerradoHito

Gemini 1.0 de Google DeepMind: El revolucionario modelo multimodal que redefine la IA

Lanzado el 6 de diciembre de 2023

Google's multimodal model family (Nano/Pro/Ultra)

Natively multimodal from training

Leer el artículo completo

NousResearchopen source34B

Nous Hermes 2: El modelo de código abierto que revoluciona la IA local

Lanzado el 13 de noviembre de 2023

Community fine-tuned model on Mistral/Yi

Strong at instruction following

Popular for local AI

Leer el artículo completo

01.AIopen source34B

Yi 34B de 01.AI: El Modelo Bilingüe Abierto que Desafía a Llama 2 70B

Lanzado el 2 de noviembre de 2023

Founded by Kai-Fu Lee

Strong bilingual (English/Chinese) model

Competitive with Llama 2 70B

Leer el artículo completo

Zhipu AIopen source6B

ChatGLM3-6B: El modelo de código abierto que revoluciona la IA conversacional

Lanzado el 27 de octubre de 2023

Third gen GLM with function calling, code interpreter, and agent capabilities

Leer el artículo completo

HuggingFaceopen source7B

Zephyr 7B: El modelo de código abierto que supera a modelos más grandes con DPO

Lanzado el 25 de octubre de 2023

Mistral 7B fine-tuned with DPO

Showed distilled alignment can match RLHF quality

Leer el artículo completo

Mistral AIopen source7BHito

Mistral 7B: El Modelo de Código Abierto que Revolucionó la IA en 2023

Lanzado el 27 de septiembre de 2023

Outperformed Llama 2 70B on all benchmarks despite being smaller

Sliding window attention

Apache 2.0 license

Leer el artículo completo

Alibaba Cloudopen source72B

Qwen 72B: El modelo de código abierto de Alibaba con 72 mil millones de parámetros que desafía a los líderes del mercado

Lanzado el 25 de septiembre de 2023

Alibaba's multilingual model series

Strong on Chinese and English tasks

Open weights

Leer el artículo completo

WizardLM Teamcoding model34BCódigo Abierto

WizardCoder 34B: El modelo de código de código abierto que supera a ChatGPT en benchmarks

Lanzado el 26 de agosto de 2023

Evol-Instruct tuned Code Llama

Top open-source coding model of its era

Strong on HumanEval

Leer el artículo completo

Meta AIcoding model34BCódigo Abierto

Code Llama 34B: El modelo de código abierto de Meta que revoluciona la programación asistida por IA

Lanzado el 24 de agosto de 2023

Specialized Llama 2 for code generation

Supports Python, C++, Java, and more

100K context window

Leer el artículo completo

Meta AIopen source70BHito

Llama 2: El modelo de código abierto que revolucionó la IA comercial

Lanzado el 18 de julio de 2023

First truly open-weight large model for commercial use

7B/13B/70B sizes with RLHF-tuned chat variants

Founded the modern open LLM ecosystem

Leer el artículo completo

Anthropiclanguage modelCerrado

Claude 2 de Anthropic: El modelo de lenguaje que revoluciona el contexto y la seguridad

Lanzado el 11 de julio de 2023

200K context window

Constitutional AI approach

Strong coding and analysis capabilities

Leer el artículo completo

Zhipu AIopen source6B

ChatGLM2: El modelo de código abierto de 6B parámetros que revoluciona el procesamiento de lenguaje natural

Lanzado el 25 de junio de 2023

Second generation GLM, 32K context

42% faster inference

Stronger math and coding

Leer el artículo completo

TIIopen source180B

Falcon 180B: El modelo de código abierto de 180 mil millones de parámetros que revoluciona el ranking de LLMs

Lanzado el 25 de mayo de 2023

Trained on 3.5T tokens of RefinedWeb

Topped the Open LLM Leaderboard

Apache 2.0 license

Leer el artículo completo

Googlelanguage model340BCerrado

PaLM 2: El modelo de lenguaje de próxima generación de Google que impulsa Bard y Gemini

Lanzado el 10 de mayo de 2023

Google's next-gen model powering Bard/Gemini

Improved multilingual, reasoning, and coding

Leer el artículo completo

MosaicMLopen source7B

MPT-7B: El modelo de código abierto comercialmente viable que revoluciona la IA

Lanzado el 5 de mayo de 2023

Commercially usable open-source model

Trained on 1T tokens

Apache 2.0 license

Leer el artículo completo

BigCode / HuggingFacecoding model15.5BCódigo Abierto

StarCoder: El modelo de código abierto de 15.5B que revoluciona la generación de código

Lanzado el 4 de mayo de 2023

Open-source code LLM trained on The Stack (1T tokens, 80+ languages)

8K context window

Leer el artículo completo

Stabilityopen source7B

StableLM: La Revolución de los Modelos de Lenguaje Abiertos de Stability AI

Lanzado el 19 de abril de 2023

Stability AI's open-source LLM family

3B and 7B sizes, trained on 1.5T tokens

CC-BY-SA license

Leer el artículo completo

LMSYSopen source13B

Vicuna de LMSYS: El modelo de código abierto que logra el 90% del rendimiento de ChatGPT

Lanzado el 30 de marzo de 2023

Fine-tuned LLaMA on ShareGPT conversations

Achieved ~90% of ChatGPT quality

Launched the Chatbot Arena

Leer el artículo completo

Anthropiclanguage modelCerrado

Claude 1 de Anthropic: El Lanzamiento que Revolucionó la IA Segura

Lanzado el 14 de marzo de 2023

Anthropic's first public model

Constitutional AI for safety

100K context window

Leer el artículo completo

OpenAImultimodal~1.8T (MoE)CerradoHito

GPT-4 de OpenAI: El revolucionario modelo multimodal que cambió la IA

Lanzado el 14 de marzo de 2023

Multimodal (text + vision), passed the bar exam (90th percentile)

Massive leap in reasoning over GPT-3.5

~1.8T parameters (MoE estimated)

Leer el artículo completo

Stanfordopen source7B

Alpaca 7B de Stanford: El Modelo de Código Abierto que Revolucionó el Fine-Tuning de Instrucciones

Lanzado el 13 de marzo de 2023

Fine-tuned LLaMA on 52K instructions generated by GPT-3.5

Showed cheap instruction tuning works

Leer el artículo completo

Meta AIopen source65BHito

LLaMA 1 de Meta AI: El Revolucionario Modelo Abierto que Cambió Todo

Lanzado el 24 de febrero de 2023

Leaked weights ignited the open-source LLM revolution

Showed small models can match GPT-3

65B parameters

Leer el artículo completo

2022

OpenAIlanguage model175BCerradoHito

ChatGPT de OpenAI: El modelo que definió la era de la IA conversacional

Lanzado el 30 de noviembre de 2022

GPT-3.5 with RLHF in a chat interface

Reached 100M users in 2 months

Defined the AI era

Leer el artículo completo

Googlelanguage model11BCódigo Abierto

Flan-T5: El modelo de lenguaje instruccional de Google que revoluciona la transferencia cero

Lanzado el 20 de octubre de 2022

Instruction-tuned T5

Demonstrated instruction tuning dramatically improves task generalization

Leer el artículo completo

BigScienceopen source176BHito

BLOOM: El modelo de lenguaje multilingüe de 176 mil millones de parámetros que revolucionó el mundo del AI abierto

Lanzado el 6 de julio de 2022

First 100B+ open-source multilingual model

Built by 1000+ researchers across 70+ countries

46 languages supported

Leer el artículo completo

Meta AIopen source175B

OPT 175B: El Modelo de Código Abierto de Meta que Retó a GPT-3

Lanzado el 3 de mayo de 2022

Meta's open-source GPT-3 equivalent

Full model weights released for research

175B parameters

Leer el artículo completo

EleutherAIopen source20B

GPT-NeoX 20B: El modelo de código abierto que revolucionó la IA generativa en 2022

Lanzado el 14 de abril de 2022

EleutherAI's 20B open model

First glimpse that local LLMs could scale to GPT-3 territory

Predecessor to today open-source ecosystem

Leer el artículo completo

Googlelanguage model540BCerrado

PaLM 540B: El modelo de lenguaje de Google que revolucionó el razonamiento y la codificación

Lanzado el 4 de abril de 2022

540B parameter model

Breakthrough capabilities in reasoning, code, and multilingual tasks

Leer el artículo completo

Google DeepMindlanguage model70BCerradoHito

Chinchilla de Google DeepMind: El modelo que revolucionó las leyes de escalado de LLM

Lanzado el 29 de marzo de 2022

Proved smaller models trained on more data outperform larger undertrained ones

Redefined scaling laws for LLMs

Leer el artículo completo

OpenAIlanguage model175BCerradoHito

InstructGPT: El modelo que revolucionó la alineación de IA con instrucciones humanas

Lanzado el 27 de enero de 2022

Introduced RLHF for alignment

Pioneered training models to follow human instructions safely

Leer el artículo completo

2021

Google DeepMindlanguage model280BCerrado

Gopher de Google DeepMind: El modelo de lenguaje de 280 mil millones de parámetros que revolucionó la IA

Lanzado el 8 de diciembre de 2021

280B parameter model

Extensive analysis of scaling laws across 152 tasks

Leer el artículo completo

OpenAIcoding model12BCerradoHito

OpenAI Codex: El modelo de código que revolucionó la programación asistida por IA

Lanzado el 10 de agosto de 2021

GPT-3 fine-tuned on code

Powered GitHub Copilot

Proved LLMs could write functional programs

Leer el artículo completo

EleutherAIopen source6B

GPT-J: El modelo de código abierto que revolucionó la IA accesible en hardware doméstico

Lanzado el 9 de junio de 2021

First open model runnable on consumer hardware

6B params, GPT-2 architecture

Widely deployed in early local AI applications

Leer el artículo completo

Googlelanguage model1571BCódigo Abierto

Switch Transformer de Google: El modelo MoE de 1.6 trillones de parámetros que revolucionó el escalado eficiente

Lanzado el 11 de enero de 2021

1.6 trillion parameter MoE model

Demonstrated efficient scaling through sparse expert routing

Leer el artículo completo

2020

Googlelanguage model600B MoECerrado

GShard: El modelo de lenguaje revolucionario de Google con 600 mil millones de parámetros

Lanzado el 30 de junio de 2020

First Mixture of Experts model at massive scale

600B parameters for machine translation

Leer el artículo completo

OpenAIlanguage model175BCerradoHito

GPT-3 de OpenAI: El Modelo que Revolucionó la IA Generativa

Lanzado el 28 de mayo de 2020

175B parameters — demonstrated few-shot learning without fine-tuning

Sparked the modern LLM revolution

Leer el artículo completo

2019

Googlelanguage model11BCódigo AbiertoHito

T5: El revolucionario modelo de Google que transformó la NLP con su enfoque Text-to-Text

Lanzado el 23 de octubre de 2019

Text-to-Text Transfer Transformer Explained T5: Google's Revolutionary #T5

Unified framework treating all NLP tasks as text generation

Leer el artículo completo

Meta AIlanguage model355BCódigo Abierto

RoBERTa: El modelo de lenguaje revolucionario que demostró que BERT estaba subentrenado

Lanzado el 26 de julio de 2019

Robustly Optimized BERT

Showed BERT was significantly undertrained

Achieved new SOTA with better training

Leer el artículo completo

Google / CMUlanguage model340BCódigo Abierto

XLNet: El modelo de lenguaje revolucionario que superó a BERT en 20 tareas

Lanzado el 19 de junio de 2019

Generalized autoregressive pretraining

Outperformed BERT on 20 NLP tasks

Leer el artículo completo

OpenAIlanguage model1.5BCódigo AbiertoHito

GPT-2: El modelo que revolucionó la IA y fue considerado 'demasiado peligroso' para su lanzamiento

Lanzado el 14 de febrero de 2019

Initially withheld due to misuse concerns — "Too dangerous to release"

Showed emergent text generation quality at scale

Leer el artículo completo

2018

Googlelanguage model340BCódigo AbiertoHito

BERT de Google: El modelo que revolucionó el procesamiento del lenguaje natural en 2018

Lanzado el 11 de octubre de 2018

Bidirectional Encoder Representations from Transformers

Revolutionized NLP benchmarks

Became the foundation for search engines

Leer el artículo completo

OpenAIlanguage model117BCódigo Abierto

GPT-1: El Pionero que Revolucionó el Procesamiento del Lenguaje Natural

Lanzado el 11 de junio de 2018

First GPT model — decoder-only transformer

Demonstrated generative pre-training for language understanding

Leer el artículo completo

Allen AIlanguage model94MCódigo Abierto

ELMo: El modelo revolucionario de embeddings contextualizados de Allen AI

Lanzado el 15 de febrero de 2018

Embeddings from Language Models

Contextualized word representations using bidirectional LSTMs

Leer el artículo completo

2017

Googlelanguage modelCódigo AbiertoHito

Transformer de Google: El modelo que revolucionó la IA y sentó las bases de todos los LLM modernos

Lanzado el 12 de junio de 2017

'Attention Is All You Need' paper introduces the Transformer architecture

The foundation of all modern LLMs

Leer el artículo completo