1Toward Guarantees for Clinical Reasoning in Vision Language Models (opens in new tab)(arxiv.org)5barthelomew25d ago3
2SoTA LLM Guardrails by Trusting the Typical [ICLR 2026] (opens in new tab)(arxiv.org)1barthelomew1mo ago0
3Predict your distributed LLM training time before you burn GPU hours (opens in new tab)(github.com)2barthelomew2mo ago1
4Uncertainty Quantification for Auto Formalization [NeurIPS 2025] (opens in new tab)(github.com)1barthelomew3mo ago0
5Race optimization algorithms with good initializations (beat them with bonuses) (opens in new tab)(debargha.com)8barthelomew4mo ago8
6ProofOfThought: LLM-based reasoning using Z3 theorem proving (opens in new tab)(github.com)326barthelomew5mo ago175
7A Deep Research Agent for Curating Vision Datasets (opens in new tab)(arxiv.org)12barthelomew5mo ago0
8Provably guarantee correctness of (some of) your LLM outputs (opens in new tab)(aws.amazon.com)3barthelomew7mo ago0
9K^4: Online Log Anomaly Detection via Unsupervised Typicality Learning (opens in new tab)(arxiv.org)3barthelomew7mo ago1
11Show HN: Drop-In Out-of-Distribution Data Detector (opens in new tab)(github.com)4barthelomew1y ago0
12Proof of Thought: Neurosymbolic Program Synthesis for Interpretable Reasoning (opens in new tab)(arxiv.org)4barthelomew1y ago1