- of a commercial nature;
- plagiarism;
- substantially large (e.g. whole work);
you're not on good legal footing.
Fair use and plagiarism are related, but they are two separate things. Especially when talking about the legalities of things, as we are here, it's vital to be clear and accurate about what specific legal issues are under discussion.
Facebook isn't claiming fair use to write academic papers about the books they're taking parts from. They're claiming fair use to feed them into LLM training. If that is a usage that is deemed to fall under fair use, then it won't require specific citation, even if it requires attribution in the more general sense (ie, crediting all the works you fed into your word-chipper), any more than you're required to cite specific passages when you're making a wholesale parody of a copyrighted work (also fair use) or writing a fanfic based on it (also fair use).
Anyway, the organizations behind popular LLMS are perpetrating massive copyright infringement and plagiarism. A fair use defense is rationally not possible.
Firstly, to claim fair use while you are making commercial use of the material is going to be difficult right off the bat. Fair use claims are bolstered by non-commercial use.
Small size of excerpts favors fair use claims. Can't do that here.
Transformative use? Not really; LLMs spit out the information verbatim if prompted in the right ways. Exploitative uses will probably not be viewed as transformative by the court.