Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Also, the racoon it circled isn't in the original.


I love how perfectly this captures the difficulties of using generative AI for detection tasks.


Oh god yes, I've been trying to make a LLM Assisted Magic the Gathering card scanner... its been a hell of a time trying to get it to just OCR card names well....


Why would you use an LLM for OCR?


Because if it's multimodal, oops all transformers and they're pretty much best in class for ocr now, afaik?


Yep, Its pretty damn good compared to classic OCR and even more lightweight ones as well that I can run locally. the cards just vary too much over time.


Because apparently that's what programming is and can only be these days...


Indeed. I suppose one way to ensure you can find Waldo in any image is to add it yourself.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: