I built this Clojure lib for robust high scale LLM calls wherein the consumer is usually a http request waiting on an SSE stream.
https://github.com/jhancock/aimeeThe article states: "Most applications are built on an architecture like the one above, where there are a number of stateless horizontally scaleable server replicas that can handle client requests."
Using the library I built, I have yet to worry about this as Clojure core.async, http libs and Java VM are so rock solid, I don't have a fragile set of stateless servers. Sure, at some point there are rare edge cases but it's nice to get very far along without worrying about them.