Let me know what you'd want to see added!
One thing I cant glean ; What GPu/kit are preferred for which type of output?
Like chat vs imaging...
Do locally run models/agents have access to the internet?
Whats the best internet connected crawler version on can use?
2. No, they don't have access to the internet unless you build something that gives them access
3. I'm not sure what you're asking
If you want to go even cheaper vast.ai is a popular option. It's a P2P marketplace for individuals to rent out their GPUs. You can generally get a ~20-30% discount vs RunPod prices by using Vast, but network speeds and perf are much more variable and there's always the possibility that the host will just shut you off without warning. I also wouldn't recommend using it if you're training with proprietary data since they can't guarantee the host isn't logging it, but most of the OSS fine-tuning community publishes their datasets anyway.