My Gemini Flash 2.0 prompt:
"Below is the transcript of a podcast preceded by a line number. Reply with the line numbers that are likely to be from advertisements, promotions, commercials, sponsorships, or ending credits."
I think it's better than 60%, but I should definitely set up some evals.
I split the text by sentence, but was considering having the LLM try and put into paragraph (that might conceptually chunk commercial sentences together), but what I've got has been good enough for me.
I wanted to switch to Flash 2.5, but it looks like they increased the price a lot.
I think I could do a fair bit of ad identification just with text heuristics: "This podcast is sponsored/supported by...", etc.