Our streamer (prudynt-t) communicates with those blocks to handle settings, overlays, etc and receives the pre-processed image/video/audio data, packages it up for rtsp//mjpeg/etc, and handles client connections, motion sending, day/night vision, etc.
This is the status quo for ip cameras regardless of vendor!