Then you would need to set up a server that would do all this and serve as a 'mirror' to your podcasts without the ads.
I also have a setup like this, I transcribe with Whisper and send it to OpenAI 4o-mini to detect ads then clip those segments with pydub, but my prompt must be lacking because the success rate on detecting ads is maybe 60%