Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
jononor
9 months ago
|
parent
|
context
|
favorite
| on:
“Imprecise” language models are smaller, speedier,...
Several years of ML research for CNNs indicate at least that one can do very well with 8 bit integers. Such quantization is basically standard now for any deployment outside of GPU (where 8bit isn't any faster anyway due to the hardware).
Join us for
AI Startup School
this June 16-17 in San Francisco!
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: