Fine tuning GPT-4o to reason like r1

(wandb.ai)

1 points | by byyoung3 5 hours ago

0 comments