To find out, systems are subjected to a range of tests—often called evaluations ... AI announced a set of exceptionally challenging math questions developed in collaboration with leading ...