Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You can use min hash to avoid the full O(N^2) distance matrix, but with just 600000 items you might just brute force compute the full matrix for simiplicity. What's your time budget?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: