If I can manage to store all the data from HN comments and submissions in 99 GB (31993925 "items", in a very naive way), we should be able to have a DB with most common translations for most web apps way below that, closer to 1GB, if some clever people do it :)