Researchers tested artificial intelligenceβs ability to solve abstract visual puzzles similar to human IQ tests, revealing gaps in AIβs reasoning skills. Open-source AI models struggled, while closed-source models like GPT-4V performed better, but far from perfectly.