Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
We Benchmarked Claude Code, Codex, Semgrep, CodeQL, Trent on 28 CWE-Bench CVEs
(opens in new tab)
(trent.ai)
6 points
geopsist
28d ago
2 comments
Save
Share
2 comments
2 comments · 2 top-level
top
newest
oldest
kbrajesh176
27d ago
Looks interesting. LLM base solutions fails when metric is strict. For security solution guess is not enough, we need reliable and robust solution to pin vulnerability and its evidence to fully judge and mitigate with appropriate fix.
enothereska
28d ago
I'm co-founder at trent.ai, happy to answer any questions around this.
j
/
k
navigate · click thread line to collapse