Research Blog

DeepSeek-R1 (Nature Version) – Notes

8 minute read

Published:

Key takeaways from DeepSeek R1’s Nature version and how staged SFT + RL shaped reasoning, coding, and writing performance.