I know that some specific parts of what's in my training data is false, even though it was in there often. I am not just the average-by-volume of everything I've read.
It's a good question, but there are things I figured out by myself, that weren't in my training data, some, even, where my training data said the exact opposite.