1Replace OCR with Vision Language Models (opens in new tab)(github.com)GitHub292EarlyOom1y ago125Save
2Show HN: Visually parse an entire YouTube video frame by frame (opens in new tab)(github.com)GitHub5EarlyOom1y ago0Save
4A Node.js SDK for calling Vision Language Models (opens in new tab)(github.com)GitHub6EarlyOom1y ago0Save
5Run structured extraction on documents/images locally with Ollama and Pydantic (opens in new tab)(github.com)GitHub170EarlyOom1y ago29Save
6Show HN: Vlm Run, Extract JSON from images, videos and documents in a simple API (opens in new tab)(vlm.run)2EarlyOom1y ago0Save
7Fine-grained Visual Transcription for YouTube videos (opens in new tab)(vlm-docs.nos.run)9EarlyOom2y ago3Save
9Show HN: NOS – A fast, and ergonomic PyTorch inference server (opens in new tab)(github.com)GitHub3EarlyOom2y ago0Save