forks on Hacker News

1

Vercel: Eve, an open-source agent framework (opens in new tab)

(vercel.com)

4forks1d ago1

2

Claude: Elevated errors across many models (opens in new tab)

(status.claude.com)

1forks2d ago1

3

Claude: Elevated Error Rates for Opus 4.8, Opus 4.7, Opus 4.6, and Sonnet 4.6 (opens in new tab)

(status.claude.com)

34forks3d ago38

4

Claude: Elevated errors across many models [resolved] (opens in new tab)

(status.claude.com)

189forks8d ago160

5

An Unbiased OSS Benchmark for Code Review Agents (opens in new tab)

(codereview.withmartian.com)

3forks19d ago0

6

The human cost of 10x: How AI is physically breaking senior engineers (opens in new tab)

(techtrenches.dev)

82forks2mo ago71

7

Zero Day Clock: The gap between disclosure and exploitation is collapsing to 0 (opens in new tab)

(zerodayclock.com)

3forks3mo ago0

8

AI Agents Gone Rogue (opens in new tab)

(osohq.com)

1forks3mo ago0

9

AWS Duvet: a bidirectional link between implementation and specification (opens in new tab)

(awslabs.github.io)

12forks5mo ago1

10

Hax: Verifying Security-Critical Rust Software Using Multiple Provers (opens in new tab)

(eprint.iacr.org)

2forks5mo ago0

11

Bake Oven Knob (opens in new tab)

(en.wikipedia.org)

2forks5mo ago0

12

Learning from Sudoku Solvers (2007) (opens in new tab)

(ravimohan.blogspot.com)

22forks5mo ago8

13

Architecting Security for Agentic Capabilities in Chrome (opens in new tab)

(security.googleblog.com)

1forks5mo ago0

14

Continuously hardening ChatGPT Atlas against prompt injection attacks (opens in new tab)

(openai.com)

3forks5mo ago0

15

We removed 80% of our agent's tools (opens in new tab)

(vercel.com)

3forks6mo ago0

forks

Recent submissions

Vercel: Eve, an open-source agent framework (opens in new tab)

Claude: Elevated errors across many models (opens in new tab)

Claude: Elevated Error Rates for Opus 4.8, Opus 4.7, Opus 4.6, and Sonnet 4.6 (opens in new tab)

Claude: Elevated errors across many models [resolved] (opens in new tab)

An Unbiased OSS Benchmark for Code Review Agents (opens in new tab)

The human cost of 10x: How AI is physically breaking senior engineers (opens in new tab)

Zero Day Clock: The gap between disclosure and exploitation is collapsing to 0 (opens in new tab)

AI Agents Gone Rogue (opens in new tab)

AWS Duvet: a bidirectional link between implementation and specification (opens in new tab)

Hax: Verifying Security-Critical Rust Software Using Multiple Provers (opens in new tab)

Bake Oven Knob (opens in new tab)

Learning from Sudoku Solvers (2007) (opens in new tab)

Architecting Security for Agentic Capabilities in Chrome (opens in new tab)

Continuously hardening ChatGPT Atlas against prompt injection attacks (opens in new tab)

We removed 80% of our agent's tools (opens in new tab)

Recent submissions

Vercel: Eve, an open-source agent framework (opens in new tab)

Claude: Elevated errors across many models (opens in new tab)

Claude: Elevated Error Rates for Opus 4.8, Opus 4.7, Opus 4.6, and Sonnet 4.6 (opens in new tab)

Claude: Elevated errors across many models [resolved] (opens in new tab)

An Unbiased OSS Benchmark for Code Review Agents (opens in new tab)

The human cost of 10x: How AI is physically breaking senior engineers (opens in new tab)

Zero Day Clock: The gap between disclosure and exploitation is collapsing to 0 (opens in new tab)

AI Agents Gone Rogue (opens in new tab)

AWS Duvet: a bidirectional link between implementation and specification (opens in new tab)

Hax: Verifying Security-Critical Rust Software Using Multiple Provers (opens in new tab)

Bake Oven Knob (opens in new tab)

Learning from Sudoku Solvers (2007) (opens in new tab)

Architecting Security for Agentic Capabilities in Chrome (opens in new tab)

Continuously hardening ChatGPT Atlas against prompt injection attacks (opens in new tab)

We removed 80% of our agent's tools (opens in new tab)