Reasoning Models
61 articles
ARC-AGI 1
AI Benchmarks, Artificial Intelligence
Adaptive thinking
AI Inference, Anthropic
Agent planning
AI Agents, Artificial Intelligence
AlphaGeometry
AI Models, AI for Science, Google DeepMind
AlphaProof
AI Models, AI for Science, Google DeepMind
BIG-Bench Extra Hard
AI Benchmarks, Google DeepMind
Command A Reasoning
AI Companies, Large Language Models
Commonsense reasoning
Artificial Intelligence
DeepSeek V3.1
AI Models, Chinese AI, Large Language Models
DeepSeek-Prover
Chinese AI, Large Language Models
DeepSeek-R1
Chinese AI, Large Language Models
DeepSeek-R1-Distill
AI Models, Chinese AI, Large Language Models
DeepSeekMath
Chinese AI, Large Language Models
Extended thinking
AI Tools & Products, Anthropic
GPQA
AI Benchmarks
GPT-5 Pro
Large Language Models, OpenAI
GRPO
AI Inference, Chinese AI, Reinforcement Learning
GSM8K
AI Benchmarks, Large Language Models, Machine Learning
Gemini 2.0 Flash Thinking
Google DeepMind, Large Language Models
Gemini 2.5 Deep Think
Google DeepMind, Large Language Models
Goedel-Prover
Large Language Models, Open Source AI
Grok 4
AI Companies, AI Models, Large Language Models
Inference-time scaling
AI Inference, AI Research, Artificial Intelligence
Interleaved thinking
AI Agents, Anthropic
Kimi K2 Thinking
Chinese AI, Large Language Models, Open Source AI
Llama Nemotron
Large Language Models, NVIDIA
MAI-Thinking-1
Microsoft
MATH
AI Benchmarks, Model Evaluation
MATH (benchmark)
AI Benchmarks, Large Language Models, Machine Learning
Magistral
Large Language Models, Open Source AI
Marco-o1
Chinese AI, Open Source AI
MathArena
AI Benchmarks, Artificial Intelligence, Natural Language Processing
MiniMax M1
Chinese AI, Large Language Models
MiniMax M2.7
Chinese AI, Large Language Models
MuSR
AI Benchmarks, Model Evaluation
Muse Spark
Meta AI, Multimodal AI
Natural language inference (NLI)
AI Benchmarks, Natural Language Processing
OLMo 3
AI Models, Large Language Models, Open Source AI
OpenAI o-series
Artificial Intelligence, Large Language Models, OpenAI
OpenAI o1
AI Models, Large Language Models, OpenAI
OpenAI o1-mini
Large Language Models, OpenAI
OpenAI o1-pro
Large Language Models, OpenAI
OpenAI o3
AI Models, Large Language Models, OpenAI
OpenAI o3-mini
Large Language Models, OpenAI
OpenAI o3-pro
Large Language Models, OpenAI
Phi-4 Reasoning
AI Models, Large Language Models, Open Source AI
Phi-4-mini-flash-reasoning
AI Models, Large Language Models, Open Source AI
ProcessBench
AI Benchmarks, Model Evaluation
QvQ
Chinese AI, Multimodal AI
QwQ
Chinese AI, Large Language Models, Open Source AI
RLVR
AI Inference, Reinforcement Learning, Training & Optimization
ReAct (prompting)
AI Agents, Prompt Engineering
Reflexion
AI Agents, Machine Learning
Self-consistency
Large Language Models, Prompt Engineering
SimpleBench
AI Benchmarks, Artificial Intelligence, Natural Language Processing
Skywork-R1V
Chinese AI, Multimodal AI
Strawberry (OpenAI codename)
AI History, OpenAI
Test-time compute
Artificial Intelligence, Large Language Models, Machine Learning
Tree of Thoughts
Artificial Intelligence, Prompt Engineering
ZAYA1-8B
AI Models, Large Language Models, Mixture of Experts