Naifan Li
Tags Categories Archives About
Naifan Li
Cancel
TagsCategoriesArchivesAbout

 Reinforcement Learning

2025

DeepSeek R1 06-10

2022

InstructGPT: Training language models to follow instructions with human feedback 03-04
2018 - 2026 Naifan Li