Furthermore, they show a counter-intuitive scaling limit: their reasoning exertion will increase with difficulty complexity nearly a degree, then declines despite obtaining an enough token price range. By evaluating LRMs with their normal LLM counterparts less than equivalent inference compute, we detect 3 performance regimes: (1) very low-complexity duties https://www.youtube.com/watch?v=snr3is5MTiU