moondistance on Hacker News

1

VibeThinker-3B achieves 80.2 on LCBv6 (opens in new tab)

(twitter.com)

8moondistance8d ago3

2

Qwen 3.5: small models with impressive performance (opens in new tab)

(twitter.com)

6moondistance3mo ago0

3

Language Models Are Injective and Hence Invertible (opens in new tab)

(arxiv.org)arXiv

1moondistance7mo ago2

4

Autonomous Trash Can (opens in new tab)

(twitter.com)

20moondistance11mo ago1

5

Grok 4 Benchmarks (opens in new tab)

(twitter.com)

3moondistance11mo ago0

6

Jensen Huang – Nvidia GTC 2025 Keynote (opens in new tab)

(nvidia.com)

75moondistance1y ago74

7

Exaone Deep 32B – beats DeepSeek 671B (opens in new tab)

(lgresearch.ai)

3moondistance1y ago1

8

DeepSeek-R1 at 3,872 tokens / second on a single Nvidia HGX H200 (opens in new tab)

(blogs.nvidia.com)

13moondistance1y ago1

9

ByteDance Doubao-1.5-pro matches GPT 4o benchmarks at 50x cheaper (opens in new tab)

(twitter.com)

4moondistance1y ago0

10

US-China Commission top recommendation: Manhattan project for race to AGI [pdf] (opens in new tab)

(uscc.gov)PDF

4moondistance1y ago0

11

AI scans RNA 'dark matter' and uncovers 70k new viruses (opens in new tab)

(nature.com)

1moondistance1y ago0

12

Llama 405B 506 tokens/second on an H200 (opens in new tab)

(developer.nvidia.com)

21moondistance1y ago5

13

SenseNova 5.5 claims SOTA LLM benchmark results (opens in new tab)

(twitter.com)

2moondistance1y ago0

14

New AI Training Technique Is Drastically Faster, Says Google (opens in new tab)

(decrypt.co)

84moondistance1y ago38

15

Nvidia open source LLM Nemotron 4 340B at top of the charts [pdf] (opens in new tab)

(d1qx31qr3h6wln.cloudfront.net)PDF

17moondistance2y ago1

moondistance

Recent submissions

VibeThinker-3B achieves 80.2 on LCBv6 (opens in new tab)

Qwen 3.5: small models with impressive performance (opens in new tab)

Language Models Are Injective and Hence Invertible (opens in new tab)

Autonomous Trash Can (opens in new tab)

Grok 4 Benchmarks (opens in new tab)

Jensen Huang – Nvidia GTC 2025 Keynote (opens in new tab)

Exaone Deep 32B – beats DeepSeek 671B (opens in new tab)

DeepSeek-R1 at 3,872 tokens / second on a single Nvidia HGX H200 (opens in new tab)

ByteDance Doubao-1.5-pro matches GPT 4o benchmarks at 50x cheaper (opens in new tab)

US-China Commission top recommendation: Manhattan project for race to AGI [pdf] (opens in new tab)

AI scans RNA 'dark matter' and uncovers 70k new viruses (opens in new tab)

Llama 405B 506 tokens/second on an H200 (opens in new tab)

SenseNova 5.5 claims SOTA LLM benchmark results (opens in new tab)

New AI Training Technique Is Drastically Faster, Says Google (opens in new tab)

Nvidia open source LLM Nemotron 4 340B at top of the charts [pdf] (opens in new tab)

Recent submissions

VibeThinker-3B achieves 80.2 on LCBv6 (opens in new tab)

Qwen 3.5: small models with impressive performance (opens in new tab)

Language Models Are Injective and Hence Invertible (opens in new tab)

Autonomous Trash Can (opens in new tab)

Grok 4 Benchmarks (opens in new tab)

Jensen Huang – Nvidia GTC 2025 Keynote (opens in new tab)

Exaone Deep 32B – beats DeepSeek 671B (opens in new tab)

DeepSeek-R1 at 3,872 tokens / second on a single Nvidia HGX H200 (opens in new tab)

ByteDance Doubao-1.5-pro matches GPT 4o benchmarks at 50x cheaper (opens in new tab)

US-China Commission top recommendation: Manhattan project for race to AGI [pdf] (opens in new tab)

AI scans RNA 'dark matter' and uncovers 70k new viruses (opens in new tab)

Llama 405B 506 tokens/second on an H200 (opens in new tab)

SenseNova 5.5 claims SOTA LLM benchmark results (opens in new tab)

New AI Training Technique Is Drastically Faster, Says Google (opens in new tab)

Nvidia open source LLM Nemotron 4 340B at top of the charts [pdf] (opens in new tab)