How much have AI applications learned, and how can one know their capabilities if they are being evaluated with an exam that is far too easy? In 2024, with the publication of the previous benchmark to ...