veryluckyxyz on Hacker News

1

Scaling Laws for Agent Harnesses via Effective Feedback Compute (opens in new tab)

(arxiv.org)arXiv

1veryluckyxyz29d ago0

2

Generalizing Test-Time Compute-Optimal Scaling as an Optimizable Graph (opens in new tab)

(huggingface.co)

2veryluckyxyz7mo ago0

3

Hidden drivers of HRM's performance on ARC-AGI (opens in new tab)

(arcprize.org)

31veryluckyxyz8mo ago2

4

Set Block Decoding Is a Language Model Inference Accelerator (opens in new tab)

(arxiv.org)arXiv

4veryluckyxyz9mo ago0

5

Deep Think with Confidence (opens in new tab)

(jiaweizzhao.github.io)

1veryluckyxyz10mo ago0

6

A Batch Size and Token NUM- BER Agnostic Learning Rate Scheduler (opens in new tab)

(arxiv.org)arXiv

2veryluckyxyz1y ago0

7

Easily Understand Rdma Technology (opens in new tab)

(naddod.com)

1veryluckyxyz1y ago1

8

Model Merging in Pre-Training of Large Language Models (opens in new tab)

(arxiv.org)arXiv

2veryluckyxyz1y ago0

9

Understanding Perception and Reasoning Through Model Merging (opens in new tab)

(arxiv.org)arXiv

2veryluckyxyz1y ago0

10

Building and better understanding vision-language models (2024) (opens in new tab)

(huggingface.co)

2veryluckyxyz1y ago0

11

HF smolagents computer-agent demo (opens in new tab)

(huggingface.co)

1veryluckyxyz1y ago0

12

Do Reasoning Models Show Better Verbalized Calibration? (opens in new tab)

(arxiv.org)arXiv

2veryluckyxyz1y ago0

13

Robustly identifying concepts introduced during chat fine-tuning with crosscoder (opens in new tab)

(arxiv.org)arXiv

6veryluckyxyz1y ago0

14

Retrieval with Learned Similarities (opens in new tab)

(arxiv.org)arXiv

3veryluckyxyz1y ago0

15

The Curse of Depth in Large Language Models (opens in new tab)

(arxiv.org)arXiv

1veryluckyxyz1y ago0

veryluckyxyz

Recent submissions

Scaling Laws for Agent Harnesses via Effective Feedback Compute (opens in new tab)

Generalizing Test-Time Compute-Optimal Scaling as an Optimizable Graph (opens in new tab)

Hidden drivers of HRM's performance on ARC-AGI (opens in new tab)

Set Block Decoding Is a Language Model Inference Accelerator (opens in new tab)

Deep Think with Confidence (opens in new tab)

A Batch Size and Token NUM- BER Agnostic Learning Rate Scheduler (opens in new tab)

Easily Understand Rdma Technology (opens in new tab)

Model Merging in Pre-Training of Large Language Models (opens in new tab)

Understanding Perception and Reasoning Through Model Merging (opens in new tab)

Building and better understanding vision-language models (2024) (opens in new tab)

HF smolagents computer-agent demo (opens in new tab)

Do Reasoning Models Show Better Verbalized Calibration? (opens in new tab)

Robustly identifying concepts introduced during chat fine-tuning with crosscoder (opens in new tab)

Retrieval with Learned Similarities (opens in new tab)

The Curse of Depth in Large Language Models (opens in new tab)

Recent submissions

Scaling Laws for Agent Harnesses via Effective Feedback Compute (opens in new tab)

Generalizing Test-Time Compute-Optimal Scaling as an Optimizable Graph (opens in new tab)

Hidden drivers of HRM's performance on ARC-AGI (opens in new tab)

Set Block Decoding Is a Language Model Inference Accelerator (opens in new tab)

Deep Think with Confidence (opens in new tab)

A Batch Size and Token NUM- BER Agnostic Learning Rate Scheduler (opens in new tab)

Easily Understand Rdma Technology (opens in new tab)

Model Merging in Pre-Training of Large Language Models (opens in new tab)

Understanding Perception and Reasoning Through Model Merging (opens in new tab)

Building and better understanding vision-language models (2024) (opens in new tab)

HF smolagents computer-agent demo (opens in new tab)

Do Reasoning Models Show Better Verbalized Calibration? (opens in new tab)

Robustly identifying concepts introduced during chat fine-tuning with crosscoder (opens in new tab)

Retrieval with Learned Similarities (opens in new tab)

The Curse of Depth in Large Language Models (opens in new tab)