The paper confirms that even the biggest and most advanced models can still be misled by adding irrelevant information that can be easily filtered out by humans.
A calculator can "think" is "AI" and however you want to frame it. Reasoning is a very specific and defined concept. Computers can not reason (per this paper)