Brain the size of a planet and still cannot make the tea
Humanity’s Last Exam is a fiendishly hard benchmark for human like intelligence. But a good score at this benchmark does not correlate to simple tasks in the real world like making tea. Assistive tech for independent living in a visual world must handle such mundane tasks robustly.