1Labeling Copilot: An agent for automated data curation in computer vision (opens in new tab)(github.com)5barthelomew27d ago0
2Toward Guarantees for Clinical Reasoning in Vision Language Models (opens in new tab)(arxiv.org)5barthelomew2mo ago3
3SoTA LLM Guardrails by Trusting the Typical [ICLR 2026] (opens in new tab)(arxiv.org)1barthelomew3mo ago0
4Predict your distributed LLM training time before you burn GPU hours (opens in new tab)(github.com)2barthelomew3mo ago1
5Uncertainty Quantification for Auto Formalization [NeurIPS 2025] (opens in new tab)(github.com)1barthelomew5mo ago0
6Race optimization algorithms with good initializations (beat them with bonuses) (opens in new tab)(debargha.com)8barthelomew6mo ago8
7ProofOfThought: LLM-based reasoning using Z3 theorem proving (opens in new tab)(github.com)326barthelomew7mo ago175
8A Deep Research Agent for Curating Vision Datasets (opens in new tab)(arxiv.org)12barthelomew7mo ago0
9Provably guarantee correctness of (some of) your LLM outputs (opens in new tab)(aws.amazon.com)3barthelomew9mo ago0
10K^4: Online Log Anomaly Detection via Unsupervised Typicality Learning (opens in new tab)(arxiv.org)3barthelomew9mo ago1
12Show HN: Drop-In Out-of-Distribution Data Detector (opens in new tab)(github.com)4barthelomew1y ago0
13Proof of Thought: Neurosymbolic Program Synthesis for Interpretable Reasoning (opens in new tab)(arxiv.org)4barthelomew1y ago1