Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
Practical Llama 3 inference in Java
(opens in new tab)
(github.com)
4 points
mukel
1y ago
1 comments
Share
Practical Llama 3 inference in Java | Better HN
1 comments
default
newest
oldest
mukel
OP
1y ago
Llama3.java: featuring .GGUF file format support, Q8_0 and Q4_0 quantizations, fast matrix/vector multiplication routines using Java's Vector API; served by a simple CLI with a --chat mode to interact with the Llama 3 models.
j
/
k
navigate · click thread line to collapse