
VibeThinker-3B hits 94.3 on AIME 2026 with 3B params, and the benchmark fight resurfaces
Sina Weibo’s tiny VibeThinker-3B posts first-tier reasoning scores, then faces instant skepticism about whether benchmarks are gameable.
By Lama Al-Rashid·· 5 min
