Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
pvg
on Aug 14, 2010
|
parent
|
context
|
favorite
| on:
Ask HN: Sorting massive text files?
Hence the split. It's still likely to beat the crap out of sorting and it might not be needed, depending on the data. I do wonder what the dataset is that makes for such tiny, short lines.
aristus
on Aug 14, 2010
[–]
Given the numbers, the need to dedupe, the poster's strong motivation and relative inexperience...
I'd guess it's a gigantic spam list.
pvg
on Aug 14, 2010
|
parent
[–]
Hah, self-duh. Total failure of technical cynicism on my part.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: