i keep it simple with an uptime monitoring service that monitors all the elements of my stack and run tests every minute:
- regular http monitoring for websites
- run test queries on my sql & mongo databases
- check that rabbitmq queues are not overflowing
- check that docker container are up
If something goes wrong, email & telegram alerts.
fwiw i'm using https://uptimefunk.com