Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
Ask HN: How do we reduce latency for AI applications?
4 points
MrAR
1y ago
1 comments
Save
Share
I am connecting 3-4 AI models serially, such that output of one model is fed as input of another model. I am getting a lot of latency, even after using GPUs, how to reduce it?
1 comments
1 comments · 1 top-level
top
newest
oldest
compressedgas
1y ago
Use smaller models.
j
/
k
navigate · click thread line to collapse