Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
Show HN: AST-guard A gradient-immune structural guard against RL reward hacking
(opens in new tab)
(github.com)
2 points
thinking-nick
11h ago
0 comments
Save
Share
0 comments
No comments yet.