Skip to content

Top Best Ask Show New Jobs

EarlyOom | Better HN

EarlyOom

205 karmaJoined March 22, 202229 submissions

Recent submissions

1

Replace OCR with Vision Language Models (opens in new tab)

(github.com)GitHub

292EarlyOom1y ago125

2

Show HN: Visually parse an entire YouTube video frame by frame (opens in new tab)

(github.com)GitHub

5EarlyOom1y ago0

3

Ask HN: What are folks using to train/fine-tune Vision Language Models

1EarlyOom1y ago0

4

A Node.js SDK for calling Vision Language Models (opens in new tab)

(github.com)GitHub

6EarlyOom1y ago0

5

Run structured extraction on documents/images locally with Ollama and Pydantic (opens in new tab)

(github.com)GitHub

170EarlyOom1y ago29

6

Show HN: Vlm Run, Extract JSON from images, videos and documents in a simple API (opens in new tab)

(vlm.run)

2EarlyOom1y ago0

7

Fine-grained Visual Transcription for YouTube videos (opens in new tab)

(vlm-docs.nos.run)

9EarlyOom2y ago3

8

"Ok Computer, why are you slow?" (opens in new tab)

(scottloftin.substack.com)

2EarlyOom2y ago0

9

Show HN: NOS – A fast, and ergonomic PyTorch inference server (opens in new tab)

(github.com)GitHub

3EarlyOom2y ago0