DeepSeek R1 Model
2025-01-27 (updated: 2025-01-27 ) | #deepseek #mixture of experts #r1 model #reasoning model
Last week Chinese AI labs DeepSeek released their latest reasoning model R1, their models are on par with the most advanced models from OpenAI, Anthropic and Meta. This post is about the details of DeepSeek R1 model and the architecture behind it.