1Labeling Copilot: An agent for automated data curation in computer vision (opens in new tab)(github.com)GitHub5barthelomew2mo ago0Save
2Toward Guarantees for Clinical Reasoning in Vision Language Models (opens in new tab)(arxiv.org)arXiv5barthelomew3mo ago3Save
3SoTA LLM Guardrails by Trusting the Typical [ICLR 2026] (opens in new tab)(arxiv.org)arXiv1barthelomew4mo ago0Save
4Predict your distributed LLM training time before you burn GPU hours (opens in new tab)(github.com)GitHub2barthelomew5mo ago1Save
5Uncertainty Quantification for Auto Formalization [NeurIPS 2025] (opens in new tab)(github.com)GitHub1barthelomew7mo ago0Save
6Race optimization algorithms with good initializations (beat them with bonuses) (opens in new tab)(debargha.com)8barthelomew7mo ago8Save
7ProofOfThought: LLM-based reasoning using Z3 theorem proving (opens in new tab)(github.com)GitHub326barthelomew8mo ago175Save
8A Deep Research Agent for Curating Vision Datasets (opens in new tab)(arxiv.org)arXiv12barthelomew9mo ago0Save
9Provably guarantee correctness of (some of) your LLM outputs (opens in new tab)(aws.amazon.com)3barthelomew10mo ago0Save
10K^4: Online Log Anomaly Detection via Unsupervised Typicality Learning (opens in new tab)(arxiv.org)arXiv3barthelomew11mo ago1Save
12Show HN: Drop-In Out-of-Distribution Data Detector (opens in new tab)(github.com)GitHub4barthelomew1y ago0Save
13Proof of Thought: Neurosymbolic Program Synthesis for Interpretable Reasoning (opens in new tab)(arxiv.org)arXiv4barthelomew1y ago1Save