Quote Originally Posted by d_stranz View Post
Assuming you can fit 50 million lines in memory, that is.
Exactly, hence the idea of processing each ID separately.

But if you have the memory to do all in one go then I would go for a map of maps or a hash of maps.

Cheers,
_