11 Comments

This is very helpful. Thank you for writing it!

Expand full comment

Thank you, for the awesome content. I have a question, what tools do you use for visualization?

Expand full comment

Thank you for the kind words! I'm using Figma to create the visualizations but all visuals (same with my other visual guides) could have been done with something like Powerpoint, KeyNote, etc.

Expand full comment

The article is awesome, thanks for the great content and the simplicity of the explanation and the flow of the content. I haven't finished it yet but can't wait to finish it and read it again and again

Expand full comment

Thank you! Great content! One typo: the exploration and exploitation annotation in the illustration of the Monte Carlo tree search seems flipped.

Expand full comment

Thanks for the feedback! I updated the visual.

Expand full comment

Great article, did you use deepseek for any part?

Expand full comment

OMG, I love it!

Expand full comment

Great article!

Can you explain why samples for R1 are focused on coding tasks (e.g. "does it compile?")?

Technical report says "large-scale RL has not been applied extensively in software engineering tasks" and leave this task for the future versions of R1 model.

Expand full comment

This is an amazing post, thank you!

Expand full comment

Very helpful for me, a beginner. Thanks a lot!

Expand full comment