November 5, 2024: Testing AI Ingenuity with Pictionary and Minecraft - AI enthusiasts are using games like Pictionary and Minecraft to test models problem-solving skills, shifting away from traditional benchmarks that rely heavily on rote memorization. Models in these games must demonstrate resourcefulness and understanding of concepts beyond training data. While games provide a visual and intuitive benchmarking method, experts warn they are not reliable reflections of real-world reasoning. Still, proponents argue games could be early steps towards spatial understanding and multimodal AI benchmarks, offering unique insights into model capabilities and behaviors.