stealthcat on Hacker News

Ask HN: Traning LLM directly on file bytes

Multi-modal LLM like PaLM, GPT4, MiniGPTv2 relies on data encoder (image, speech models) to map data to token embedding space.

Is there any attempt to directly train on file bytes? Make the only vocab of LLM as base-2, base-8 or hexadecimal, then do next token prediction on this.

I know some attempts have been done like MEGABYTE and Charformer but some may have is not directly learning from bytes with all the header info

2stealthcat2y ago0

Ask HN: Lighter alternatives to LLVM

Is it the only way to produce fast cross-platform binaries? LLVM is huge and compile times can be slow.

I found several compilers related:

0. GCC

1. Zig started as LLVM frontend, finally with self-hosting, can do cross-platform codegen without LLVM.

13stealthcat3y ago10

Ask HN: Lighter Alternatives to LLVM

Is LLVM the only way to generate fast cross-platform nowadaysk?

Last time Zig managed to be self-hosting and codegen for various platforms is possible without LLVM.

1stealthcat3y ago0

Ask HN: How is Python's OOP is superior vs. Lisp CLOS?

Python classes may be clunky but it is said that Python class system is more flexible than other languages' OOP, even Lisp CLOS.

How does Python class system actually compare to Lisp CLOS?

I've seen arguments for Python class system is the blocker for important code optimizations, AOT or JIT. Are there elaborate explanations on why we get near-machine code speed for compiled SBCL Lisp but we cannot even save the image for PyPy runs? Seems like a problem worse than GIL.

30stealthcat4y ago38

Ask HN: Traning LLM directly on file bytes

Multi-modal LLM like PaLM, GPT4, MiniGPTv2 relies on data encoder (image, speech models) to map data to token embedding space.

Is there any attempt to directly train on file bytes? Make the only vocab of LLM as base-2, base-8 or hexadecimal, then do next token prediction on this.

I know some attempts have been done like MEGABYTE and Charformer but some may have is not directly learning from bytes with all the header info

2stealthcat2y ago0

Ask HN: Lighter alternatives to LLVM

Is it the only way to produce fast cross-platform binaries? LLVM is huge and compile times can be slow.

I found several compilers related:

0. GCC

1. Zig started as LLVM frontend, finally with self-hosting, can do cross-platform codegen without LLVM.

13stealthcat3y ago10

Ask HN: Lighter Alternatives to LLVM

Is LLVM the only way to generate fast cross-platform nowadaysk?

Last time Zig managed to be self-hosting and codegen for various platforms is possible without LLVM.

1stealthcat3y ago0

Ask HN: How is Python's OOP is superior vs. Lisp CLOS?

Python classes may be clunky but it is said that Python class system is more flexible than other languages' OOP, even Lisp CLOS.

How does Python class system actually compare to Lisp CLOS?

30stealthcat4y ago38

stealthcat

Recent submissions

The Judgment on Fiat Currency [pdf] (opens in new tab)

Show HN: Slope -- a small ML library with IREE and StableHLO MLIR backend (opens in new tab)

Ask HN: Traning LLM directly on file bytes

Unary Computer for Matrix Multiplication [pdf] (opens in new tab)

Ask HN: Lighter alternatives to LLVM

Ask HN: Lighter Alternatives to LLVM

Ask HN: How is Python's OOP is superior vs. Lisp CLOS?

Compiling AI with AI (opens in new tab)

Recent submissions

The Judgment on Fiat Currency [pdf] (opens in new tab)

Show HN: Slope -- a small ML library with IREE and StableHLO MLIR backend (opens in new tab)

Ask HN: Traning LLM directly on file bytes

Unary Computer for Matrix Multiplication [pdf] (opens in new tab)

Ask HN: Lighter alternatives to LLVM

Ask HN: Lighter Alternatives to LLVM

Ask HN: How is Python's OOP is superior vs. Lisp CLOS?

Compiling AI with AI (opens in new tab)