Great topic: Model minification and algorithmic omplexity
The only other super cheap model they compare with is FastText, and FastText beat them quite substantially.
Are time-domain signals better compressed with A/V codecs (compression-decompression); and does this gzip finding suggest that other compression algorithms likely to outperform LLMs at certain tasks like computational complexity, at least?
Great topic: Model minification and algorithmic omplexity