chw9e on Hacker News

Show HN: qckfx – Stop manually checking if your AI agent broke your iOS app

AI coding agents can build iOS UI, but they can't verify it. They can update a screen, but the UI might drift or break, and nobody catches it until a human checks.

qckfx gives your agent a baseline. Record a simulator session once. Every tap, scroll, and network response gets captured. On replay, each screen is compared against the original.

With our MCP, your agent triggers the tests and gets back visual diffs of exactly what changed. Updating the baselines is one click.

Under the hood:

- Full network replay (HTTP & WebSocket)

- Initial disk & keychain state captured during recording and restored on every run

- Precise scroll positioning (built from scratch; XCUITest only exposes this on macOS and iPad)

- No AI in the loop at runtime, fast execution

No SDK or code changes needed. Nothing to commit to git. Just download the app and go.

Everything runs locally. Your data stays on your machine.

https://qckfx.com

2chw9e4mo ago0

Show HN: qckfx – Record your iOS simulator sessions, replay them as tests

I built this because I hate writing tests but I also hate shipping bugs.

qckfx records what you're already doing. You tap through the simulator to check your work, and it replays those sessions to catch visual regressions and crashes. No SDK, no code changes, no test files.

How it works: finish a feature, click around in the simulator, hit Cmd+Shift+S. That's your test. Cmd+Shift+T to run it before you push.

It's free, runs locally, and installs via Homebrew: brew install --cask qckfx/tap/qckfx. I'm a solo dev (10 years iOS, ex-Meta) and this is early. Feedback welcome.

https://qckfx.com

2chw9e4mo ago0

Ask HN: How do you handle oncall at 15-30 engineers?

I'm doing research on how oncall breaks down as engineering teams scale, specifically at Series A/B stage companies (roughly 15-30 engineers).

The hypothesis I'm exploring: oncall workflows were designed for small teams where everyone knows the codebase. They break at scale because of specialization and context loss before volume becomes a problem.

Some patterns I've heard so far:

- Oncall engineers spending 10-20% of time just routing bugs to the right owner

- Bug reports from CS/sales teams missing critical context (no logs, vague repro steps)

- "Couple hours" average per bug, mostly on investigation not fixing

- Session replay tools too expensive to run at meaningful coverage

I'm trying to figure out:

- Is this universal, or just specific to certain tech stacks/org structures?

- What's the actual breaking point - team size, user scale, something else?

- Has anyone solved this well? What worked?

If you're currently running oncall at a growing company, I'd love to hear:

- What percentage of oncall time is triage/routing vs. actual fixing?

- What broke first as you scaled?

- What have you tried?

2chw9e6mo ago0

Show HN: qckfx – Stop manually checking if your AI agent broke your iOS app

AI coding agents can build iOS UI, but they can't verify it. They can update a screen, but the UI might drift or break, and nobody catches it until a human checks.

qckfx gives your agent a baseline. Record a simulator session once. Every tap, scroll, and network response gets captured. On replay, each screen is compared against the original.

With our MCP, your agent triggers the tests and gets back visual diffs of exactly what changed. Updating the baselines is one click.

Under the hood:

- Full network replay (HTTP & WebSocket)

- Initial disk & keychain state captured during recording and restored on every run

- Precise scroll positioning (built from scratch; XCUITest only exposes this on macOS and iPad)

- No AI in the loop at runtime, fast execution

No SDK or code changes needed. Nothing to commit to git. Just download the app and go.

Everything runs locally. Your data stays on your machine.

https://qckfx.com

2chw9e4mo ago0

Show HN: qckfx – Record your iOS simulator sessions, replay them as tests

I built this because I hate writing tests but I also hate shipping bugs.

How it works: finish a feature, click around in the simulator, hit Cmd+Shift+S. That's your test. Cmd+Shift+T to run it before you push.

It's free, runs locally, and installs via Homebrew: brew install --cask qckfx/tap/qckfx. I'm a solo dev (10 years iOS, ex-Meta) and this is early. Feedback welcome.

https://qckfx.com

2chw9e4mo ago0

Ask HN: How do you handle oncall at 15-30 engineers?

I'm doing research on how oncall breaks down as engineering teams scale, specifically at Series A/B stage companies (roughly 15-30 engineers).

Some patterns I've heard so far:

- Oncall engineers spending 10-20% of time just routing bugs to the right owner

- Bug reports from CS/sales teams missing critical context (no logs, vague repro steps)

- "Couple hours" average per bug, mostly on investigation not fixing

- Session replay tools too expensive to run at meaningful coverage

I'm trying to figure out:

- Is this universal, or just specific to certain tech stacks/org structures?

- What's the actual breaking point - team size, user scale, something else?

- Has anyone solved this well? What worked?

If you're currently running oncall at a growing company, I'd love to hear:

- What percentage of oncall time is triage/routing vs. actual fixing?

- What broke first as you scaled?

- What have you tried?

2chw9e6mo ago0

chw9e

Recent submissions

Background Coding Agents: Predictable Results Through Strong Feedback Loops (opens in new tab)

Agent-written tests can't verify agent-written code (opens in new tab)

Diffusion Models for ARC-AGI: A Retrospective (opens in new tab)

AI Agents are shipping code faster than we can test (opens in new tab)

Building iOS UI with Coding Agents Is Slow (opens in new tab)

Show HN: qckfx – Stop manually checking if your AI agent broke your iOS app

A Guide to Hiring the Illegible (opens in new tab)

Giving Your Agent Eyes Is Not Enough (opens in new tab)

Show HN: qckfx – Record your iOS simulator sessions, replay them as tests

We Use API Agents to Build Integrations Fast (opens in new tab)

What I Wish I Could Tell Every Engineer Joining a New Team (opens in new tab)

Dropbox: Knowledge Graphs, Prompt Optimizers, and MCPs (opens in new tab)

Bug Fixing Is an ETL Problem (opens in new tab)

Ask HN: How do you handle oncall at 15-30 engineers?

Finding and fixing software bugs automatically with SapFix and Sapienz (opens in new tab)

Recent submissions

Background Coding Agents: Predictable Results Through Strong Feedback Loops (opens in new tab)

Agent-written tests can't verify agent-written code (opens in new tab)

Diffusion Models for ARC-AGI: A Retrospective (opens in new tab)

AI Agents are shipping code faster than we can test (opens in new tab)

Building iOS UI with Coding Agents Is Slow (opens in new tab)

Show HN: qckfx – Stop manually checking if your AI agent broke your iOS app

A Guide to Hiring the Illegible (opens in new tab)

Giving Your Agent Eyes Is Not Enough (opens in new tab)

Show HN: qckfx – Record your iOS simulator sessions, replay them as tests

We Use API Agents to Build Integrations Fast (opens in new tab)

What I Wish I Could Tell Every Engineer Joining a New Team (opens in new tab)

Dropbox: Knowledge Graphs, Prompt Optimizers, and MCPs (opens in new tab)

Bug Fixing Is an ETL Problem (opens in new tab)

Ask HN: How do you handle oncall at 15-30 engineers?

Finding and fixing software bugs automatically with SapFix and Sapienz (opens in new tab)