HN Reader
New
Top
Best
Ask
Show
Job
Reinforcement Learning from Human Feedback
14
1
2 hours ago
by onurkanbkrc
Web version with links, etc:
https://rlhfbook.com/
1 hour ago
by klelatti