N
Hacker Next
new
show
ask
jobs
submit
login
Implementing DeepSeek R1's GRPO algorithm from scratch
github.com
192 points by
xcodevn
7 days ago
|
3 comments
add comment