Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Bullshit Benchmark Explorer (petergpt.github.io)
9 points by smusamashah 3 months ago | hide | past | favorite | 3 comments


Such a great project that could automate a lot vibes testing hopefully! A pity that the dataset only contains 55 questions. I'd like to see this number in the thousands.



this isn't really bullshit, it's just nonsense. bullshit can only be understood in proper context. i swear i'm not bullshitting you.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: