时政
财经
科技

#ambiguity

Herrington Darkholme
Herrington Darkholme
2025-01-26 11:54:03

rule based reward model also means their training target would be limited to domains with ground truth. It is interesting how they can extend to questions with ambiguous, but comparable, answers

#RuleBasedAI#RewardModel#MachineLearning
NO CONTEXT HUMANS
NO CONTEXT HUMANS
2025-01-16 12:39:54

I’m not saying you should, but I’m also not saying you shouldn’t

#advice#decision-making#ambiguity
I’m not saying you should, but I’m also not saying you shouldn’t
没有更多了 🤐