I claim I can juggle. I pick up three tennis balls and juggle them. You hand me three basketballs. I try and fail. My original claim, that I can juggle, still stands.
The more correct claim is you can juggle [some small number of items with particular properties].
“Can you ride a bike?”
“Yeah.”
“Prove it. Here I have the world’s smallest bicycle.” <- this person is not worth your time and attention.
> I can juggle
is here shorthand for
> I can juggle at all; I can juggle at least some things
and the basketball case is only a counterexample to the much stronger claim
> I can juggle anything
But the argument about AIs reasoning has little to do with such examples, because juggling is about the ability to complete the task alone. When it comes to reasoning there are questions about authenticity that don't have analogs I'm determine whether a person can juggle.
What would “not alone” mean? Do you think someone is passing it the answers? Of course it was trained, but that’s like cheating on a test by reading the material so you can keep a cheat sheet in your brain.