DeepSeek-R1 (Nature Version) – Notes
Published:
Key takeaways from DeepSeek R1’s Nature version and how staged SFT + RL shaped reasoning, coding, and writing performance.
Published:
Key takeaways from DeepSeek R1’s Nature version and how staged SFT + RL shaped reasoning, coding, and writing performance.