kashifr on Hacker News

1

Carbon: Autoregressive Genomic Foundation Model (opens in new tab)

(huggingface.co)

7kashifr1mo ago1

2

The ultimate guide to RL environments: building and scaling them in the LLM era (opens in new tab)

(huggingface.co)

7kashifr1mo ago0

3

Distilling 100B+ Models 40x Faster with TRL (opens in new tab)

(huggingface.co)

13kashifr2mo ago0

4

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries (opens in new tab)

(huggingface.co)

2kashifr3mo ago0

5

Transformers V5 is out! (opens in new tab)

(github.com)GitHub

10kashifr5mo ago0

6

The Smol Training Playbook: The Secrets to Building World-Class LLMs (opens in new tab)

(huggingface.co)

265kashifr7mo ago19

7

Unlocking On-Policy Distillation for Any Model Family (opens in new tab)

(huggingface.co)

6kashifr7mo ago1

8

Transformers 4.55 New OpenAI GPT OSS (opens in new tab)

(github.com)GitHub

2kashifr10mo ago1

9

Smollm3: Smol, multilingual, long-context reasoner LLM (opens in new tab)

(huggingface.co)

388kashifr11mo ago79

10

Epic vs. Apple (opens in new tab)

(twitter.com)

7kashifr1y ago0

11

AIMO (AI Math Olympiad) progress prize winning solution (opens in new tab)

(huggingface.co)

9kashifr1y ago0

12

MaPO: A reference-free alignment technique for diffusion models (opens in new tab)

(mapo-t2i.github.io)

2kashifr2y ago1

13

OpenHermesPreferences: Dataset of ~1M AI preferences from teknium/OpenHermes-2.5 (opens in new tab)

(huggingface.co)

7kashifr2y ago1

14

HuggingFace Training Cluster as a Service (opens in new tab)

(huggingface.co)

101kashifr2y ago45

15

HuggingFace 235M series D at a $4.5B valuation (opens in new tab)

(twitter.com)

3kashifr2y ago0

kashifr

Recent submissions

Carbon: Autoregressive Genomic Foundation Model (opens in new tab)

The ultimate guide to RL environments: building and scaling them in the LLM era (opens in new tab)

Distilling 100B+ Models 40x Faster with TRL (opens in new tab)

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries (opens in new tab)

Transformers V5 is out! (opens in new tab)

The Smol Training Playbook: The Secrets to Building World-Class LLMs (opens in new tab)

Unlocking On-Policy Distillation for Any Model Family (opens in new tab)

Transformers 4.55 New OpenAI GPT OSS (opens in new tab)

Smollm3: Smol, multilingual, long-context reasoner LLM (opens in new tab)

Epic vs. Apple (opens in new tab)

AIMO (AI Math Olympiad) progress prize winning solution (opens in new tab)

MaPO: A reference-free alignment technique for diffusion models (opens in new tab)

OpenHermesPreferences: Dataset of ~1M AI preferences from teknium/OpenHermes-2.5 (opens in new tab)

HuggingFace Training Cluster as a Service (opens in new tab)

HuggingFace 235M series D at a $4.5B valuation (opens in new tab)

Recent submissions

Carbon: Autoregressive Genomic Foundation Model (opens in new tab)

The ultimate guide to RL environments: building and scaling them in the LLM era (opens in new tab)

Distilling 100B+ Models 40x Faster with TRL (opens in new tab)

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries (opens in new tab)

Transformers V5 is out! (opens in new tab)

The Smol Training Playbook: The Secrets to Building World-Class LLMs (opens in new tab)

Unlocking On-Policy Distillation for Any Model Family (opens in new tab)

Transformers 4.55 New OpenAI GPT OSS (opens in new tab)

Smollm3: Smol, multilingual, long-context reasoner LLM (opens in new tab)

Epic vs. Apple (opens in new tab)

AIMO (AI Math Olympiad) progress prize winning solution (opens in new tab)

MaPO: A reference-free alignment technique for diffusion models (opens in new tab)

OpenHermesPreferences: Dataset of ~1M AI preferences from teknium/OpenHermes-2.5 (opens in new tab)

HuggingFace Training Cluster as a Service (opens in new tab)

HuggingFace 235M series D at a $4.5B valuation (opens in new tab)