Skip to content

Top Best Ask Show New Jobs

starzmustdie | Better HN

starzmustdie

7 karmaJoined March 18, 20228 submissions

Recent submissions

1

Show HN: #1 On This Day (opens in new tab)

(onthisday-theta.vercel.app)

18starzmustdie2mo ago1

2

A minimal hackable implementation of policy gradients (GRPO, PPO, REINFORCE) (opens in new tab)

(github.com)GitHub

1starzmustdie5mo ago0

3

Reasoning Gym: Procedural Dataset Generation for Reinforcement Learning (opens in new tab)

(github.com)GitHub

1starzmustdie1y ago0

4

Show HN: Word Game Bench – evaluating language models on word puzzles (opens in new tab)

(wordgamebench.github.io)

1starzmustdie1y ago0

5

Show HN: Answers to Chip Huyen's ML Interview Questions (opens in new tab)

(github.com)GitHub

3starzmustdie2y ago0