I did some bad use of words there "it's linear process" + "it's not linear process" :)
Let me clarify:
It's serialised, iterative, step repeating process where each step depends on output of previous one - aka linear process.
Where each step is non-linear transformation (gradient descent).
It's not distributable (over internet) task because it'd require transferring gigabytes of data (whole model weights) on each step.
To put it in other words - distributed task has massive input size and requires quick computation and tasks arrive very frequently - which means it can't be distributed over internet.