gpjt on Hacker News

1

Thoughts on Role Confusion (opens in new tab)

(gilesthomas.com)

3gpjt1d ago0

2

Flax debugging: making a hash of things (opens in new tab)

(gilesthomas.com)

2gpjt9d ago0

3

10Gb/s Ethernet: switching to a Broadcom SFP+ module (opens in new tab)

(gilesthomas.com)

195gpjt9d ago170

4

Jax: Commitment Issues (opens in new tab)

(gilesthomas.com)

4gpjt10d ago0

5

Jax Back Ends and Devices (opens in new tab)

(gilesthomas.com)

2gpjt20d ago0

6

Using Safetensors with Flax (opens in new tab)

(gilesthomas.com)

2gpjt21d ago0

7

First Looking into Jax (opens in new tab)

(gilesthomas.com)

3gpjt26d ago0

8

10Gb/s Ethernet: using mini-heatsinks with a 10GBASE-T SFP+ module (opens in new tab)

(gilesthomas.com)

3gpjt1mo ago0

9

10Gb/s Ethernet: what I did to get it working in my home (opens in new tab)

(gilesthomas.com)

232gpjt1mo ago177

10

10Gb Ethernet: what I had to (re)learn (opens in new tab)

(gilesthomas.com)

1gpjt1mo ago1

11

LLM from scratch, part 33 – what I learned from the appendices (opens in new tab)

(gilesthomas.com)

5gpjt2mo ago0

12

LLM from scratch (32l) – Interventions: updated instruction fine-tuning results (opens in new tab)

(gilesthomas.com)

1gpjt2mo ago0

13

How an LLM becomes more coherent as we train it (opens in new tab)

(gilesthomas.com)

3gpjt2mo ago0

14

LLM from scratch, part 32k – Interventions: gradient accumulation (opens in new tab)

(gilesthomas.com)

2gpjt2mo ago0

15

Provision: LLM-powered server setup from Markdown (opens in new tab)

(provision.sh)

2gpjt2mo ago0

gpjt

Recent submissions

Thoughts on Role Confusion (opens in new tab)

Flax debugging: making a hash of things (opens in new tab)

10Gb/s Ethernet: switching to a Broadcom SFP+ module (opens in new tab)

Jax: Commitment Issues (opens in new tab)

Jax Back Ends and Devices (opens in new tab)

Using Safetensors with Flax (opens in new tab)

First Looking into Jax (opens in new tab)

10Gb/s Ethernet: using mini-heatsinks with a 10GBASE-T SFP+ module (opens in new tab)

10Gb/s Ethernet: what I did to get it working in my home (opens in new tab)

10Gb Ethernet: what I had to (re)learn (opens in new tab)

LLM from scratch, part 33 – what I learned from the appendices (opens in new tab)

LLM from scratch (32l) – Interventions: updated instruction fine-tuning results (opens in new tab)

How an LLM becomes more coherent as we train it (opens in new tab)

LLM from scratch, part 32k – Interventions: gradient accumulation (opens in new tab)

Provision: LLM-powered server setup from Markdown (opens in new tab)

Recent submissions

Thoughts on Role Confusion (opens in new tab)

Flax debugging: making a hash of things (opens in new tab)

10Gb/s Ethernet: switching to a Broadcom SFP+ module (opens in new tab)

Jax: Commitment Issues (opens in new tab)

Jax Back Ends and Devices (opens in new tab)

Using Safetensors with Flax (opens in new tab)

First Looking into Jax (opens in new tab)

10Gb/s Ethernet: using mini-heatsinks with a 10GBASE-T SFP+ module (opens in new tab)

10Gb/s Ethernet: what I did to get it working in my home (opens in new tab)

10Gb Ethernet: what I had to (re)learn (opens in new tab)

LLM from scratch, part 33 – what I learned from the appendices (opens in new tab)

LLM from scratch (32l) – Interventions: updated instruction fine-tuning results (opens in new tab)

How an LLM becomes more coherent as we train it (opens in new tab)

LLM from scratch, part 32k – Interventions: gradient accumulation (opens in new tab)

Provision: LLM-powered server setup from Markdown (opens in new tab)