> Failure to pass signifies a near-total inability to adapt or problem-solve in ... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

		energy123 8 days ago \| parent \| context \| favorite \| on: Killed by LLM > Failure to pass signifies a near-total inability to adapt or problem-solve in unfamiliar situations. Not necessarily. Get a human to solve ARC-AGI if the problems are shown as a string. They'll perform badly. But that doesn't mean that humans can't reason. It means that human reasoning doesn't have access to the non-reasoning building blocks it needs (things like concepts, words, or in this case: spatially local and useful visual representations). Humans have good resolution-invariant visual perception. For example, take an ARC-AGI problem, and for each square, duplicate it a few times, increasing its resolution from XX to 2X2X. To a human, the problem will be almost exactly equally difficulty. Not for LLMs that have to deal with 4x as much context. Maybe for an LLM if it can somehow reason over the output of a CNN, and if it was trained to do that like how humans are built to do that.

refulgentis 8 days ago | [–]

Excellent point, I'm not sure people are aware, but these are straight-up lifted from standard IQ tests, so they're definitely not all trivially humanly solvable.

I needed an official one for medical reasons a few years back

echelon 8 days ago | [–]

ARC-AGI feels like it would fall to a higher dimensional convolution rather than reasoning.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact