Lifan Yuan

统计数据

1
文章
0
粉丝
0
获赞
3
阅读

热门文章

1

TechFlow 深潮 发布的文章:近期教育领域的变化引发了广泛讨论,我认为教育改革应该更加注重学生的个性化发展和创新能...

145 32
avatar
Lifan Yuan
9个月前
How to unlock advanced reasoning via scalable RL? 🚀Introducing PRIME (Process Reinforcement through Implicit Rewards) and Eurus-2, trained from Base model to surpass Qwen2.5-Math-Instruct using only 1/10 of the data. We're still scaling up - w/ 3x more training data to go! 🧵
#PRIME #Eurus-2 #ReinforcementLearning #Qwen2.5-Math-Instruct #AdvancedReasoning
© 2025 news.news. All rights reserved. 0.01858 秒. v1.0.46
我的评论