Without this trove of data, you can't do something as simple as length(str) or uppercase(str) — even in a CLI if you want to line text up.
So yes, this database has a big chunk that represents rarely useful data like you mention. But majority of it is still generally useful.