As far as I know you can do object detection and tracking by gluing it with a yolo model using a few lines of python like this [0]. I saw a bunch of people doing this.
I really wish there was a more unixy tool available in package managers doing this.
- [0] https://github.com/xj25vm/MotionSpot/blob/main/motionspot.py