This article takes a gentle dive into the ultimate AI systems evaluation benchmark, outlining why it was created, curating diverse opinions from groups of experts in the field about it, and wrapping up with a summary of the most widely accepted verdict.