Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
zarzavat
69 days ago
|
parent
|
context
|
favorite
| on:
GLM-5: Targeting complex systems engineering and l...
This test is so far beyond AGI. Try to spit out the SVG for a pelican riding a bicycle. You are only allowed to use a simple text editor. No deleting or moving the text cursor. You have 1 minute.
RC_ITR
68 days ago
[–]
Sorry, is your definition of AGI "doing things worse than humans can do, but way faster?" because that's been true of computers for a long time.
pixl97
68 days ago
|
parent
[–]
I mean for this particular benchmark, yes.
You'd have to put it in an agentic loop to perform corrections otherwise.
Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: