Hacker News new | past | comments | ask | show | jobs | submit login

Nice work. Any other technical comments, why did you use those embeddings, did you binarzue them, did you use any dpecial prompts?



At the beginning of the project, MixedBread's embedding model was small and leading the MTEB leaderboard [^1], hence I went with it.

Yes, I did binarize them for a faster search experience. However, I think the search quality degrades significantly after the first 10 results, which are same as fp32 search but with a shuffled order. I am planning to add a reranking strategy to boost better results upwards.

At the moment, this is plain search with no special prompts.

[1]: https://huggingface.co/spaces/mteb/leaderboard




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: