barthelomew on Hacker News

1

Labeling Copilot: An agent for automated data curation in computer vision (opens in new tab)

(github.com)GitHub

5barthelomew2mo ago0

2

Toward Guarantees for Clinical Reasoning in Vision Language Models (opens in new tab)

(arxiv.org)arXiv

5barthelomew3mo ago3

3

SoTA LLM Guardrails by Trusting the Typical [ICLR 2026] (opens in new tab)

(arxiv.org)arXiv

1barthelomew4mo ago0

4

Predict your distributed LLM training time before you burn GPU hours (opens in new tab)

(github.com)GitHub

2barthelomew5mo ago1

5

Uncertainty Quantification for Auto Formalization [NeurIPS 2025] (opens in new tab)

(github.com)GitHub

1barthelomew7mo ago0

6

Race optimization algorithms with good initializations (beat them with bonuses) (opens in new tab)

(debargha.com)

8barthelomew7mo ago8

7

ProofOfThought: LLM-based reasoning using Z3 theorem proving (opens in new tab)

(github.com)GitHub

326barthelomew8mo ago175

8

A Deep Research Agent for Curating Vision Datasets (opens in new tab)

(arxiv.org)arXiv

12barthelomew9mo ago0

9

Provably guarantee correctness of (some of) your LLM outputs (opens in new tab)

(aws.amazon.com)

3barthelomew10mo ago0

10

K^4: Online Log Anomaly Detection via Unsupervised Typicality Learning (opens in new tab)

(arxiv.org)arXiv

3barthelomew11mo ago1

11

Grammars of Formal Uncertainty (opens in new tab)

(arxiv.org)arXiv

34barthelomew1y ago5

12

Show HN: Drop-In Out-of-Distribution Data Detector (opens in new tab)

(github.com)GitHub

4barthelomew1y ago0

13

Proof of Thought: Neurosymbolic Program Synthesis for Interpretable Reasoning (opens in new tab)

(arxiv.org)arXiv

4barthelomew1y ago1

14

Pfizer vaccine adverse event reports [pdf] (opens in new tab)

(phmpt.org)PDF

8barthelomew4y ago0

barthelomew

Recent submissions

Labeling Copilot: An agent for automated data curation in computer vision (opens in new tab)

Toward Guarantees for Clinical Reasoning in Vision Language Models (opens in new tab)

SoTA LLM Guardrails by Trusting the Typical [ICLR 2026] (opens in new tab)

Predict your distributed LLM training time before you burn GPU hours (opens in new tab)

Uncertainty Quantification for Auto Formalization [NeurIPS 2025] (opens in new tab)

Race optimization algorithms with good initializations (beat them with bonuses) (opens in new tab)

ProofOfThought: LLM-based reasoning using Z3 theorem proving (opens in new tab)

A Deep Research Agent for Curating Vision Datasets (opens in new tab)

Provably guarantee correctness of (some of) your LLM outputs (opens in new tab)

K^4: Online Log Anomaly Detection via Unsupervised Typicality Learning (opens in new tab)

Grammars of Formal Uncertainty (opens in new tab)

Show HN: Drop-In Out-of-Distribution Data Detector (opens in new tab)

Proof of Thought: Neurosymbolic Program Synthesis for Interpretable Reasoning (opens in new tab)

Pfizer vaccine adverse event reports [pdf] (opens in new tab)

Recent submissions

Labeling Copilot: An agent for automated data curation in computer vision (opens in new tab)

Toward Guarantees for Clinical Reasoning in Vision Language Models (opens in new tab)

SoTA LLM Guardrails by Trusting the Typical [ICLR 2026] (opens in new tab)

Predict your distributed LLM training time before you burn GPU hours (opens in new tab)

Uncertainty Quantification for Auto Formalization [NeurIPS 2025] (opens in new tab)

Race optimization algorithms with good initializations (beat them with bonuses) (opens in new tab)

ProofOfThought: LLM-based reasoning using Z3 theorem proving (opens in new tab)

A Deep Research Agent for Curating Vision Datasets (opens in new tab)

Provably guarantee correctness of (some of) your LLM outputs (opens in new tab)

K^4: Online Log Anomaly Detection via Unsupervised Typicality Learning (opens in new tab)

Grammars of Formal Uncertainty (opens in new tab)

Show HN: Drop-In Out-of-Distribution Data Detector (opens in new tab)

Proof of Thought: Neurosymbolic Program Synthesis for Interpretable Reasoning (opens in new tab)

Pfizer vaccine adverse event reports [pdf] (opens in new tab)