Large-scale language models with long CoT reasoning, such as DeepSeek-R1, have shown good results on Olympiad-level mathematics. However, models trained through Supervised Fine-Tuning or Reinforcement ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results