Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
Bullshit Benchmark Explorer
(
petergpt.github.io
)
9 points
by
smusamashah
3 months ago
|
hide
|
past
|
favorite
|
3 comments
fragebogen
3 months ago
|
next
[–]
Such a great project that could automate a lot vibes testing hopefully! A pity that the dataset only contains 55 questions. I'd like to see this number in the thousands.
smusamashah
3 months ago
|
prev
|
next
[–]
https://github.com/petergpt/bullshit-benchmark
drsalt
3 months ago
|
prev
[–]
this isn't really bullshit, it's just nonsense. bullshit can only be understood in proper context. i swear i'm not bullshitting you.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: