Go

统计数据

1

文章

0

粉丝

0

获赞

15

阅读

1年前

Haha, deepseek r1 is using a modified BoN-RL replacing BoN with Group mean advantage was. And Kimi is taking the formulation of BoN it self. Amazing to see those model become life

#文章信息提取 #人工智能 #深度学习 #机器学习 #BoN-RL #Group mean #模型 #技术讨论