vampiregrey on Hacker News

Software engineer who scaled a startup from 10 → 500, seeking early-stage roles

I’m a software engineer who joined a startup early (~10 people) and worked through its growth to ~500 employees. I’m currently working in a large enterprise environment, which has given me experience with scale, process, and long-lived systems - but I prefer early-stage work where ownership and speed matter more.

My work has included: – Custom AI agents, backend and infra-heavy systems – scraping and automation pipelines – AI-based outreach and enrichment pipelines – shipping and operating production systems end to end

I’m looking for an early-stage startup (seed–Series A), ideally working closely with founders and owning systems from zero to one.

Portfolio: https://zerobitflip.com Email: sm@zerobitflip.com

Ask HN: Has Claude Code quality dropped recently for anyone else?

I’m a daily user of Claude Code (Pro subscription) and it’s been my primary tool for the last few months.

Over the past week or so, I’ve noticed:

More shallow reasoning

Ignoring parts of context

Comparing Gemini Pro 3, Opus 4.6, GLM-5 and Kimi 2.5 in a mid-sized Go project

Last week I ran a small experiment while building a mid-sized Go backend (APIs + some concurrency-heavy logic + a bit of refactoring).

I tested:

- Gemini Pro 3 - Opus 4.6 - GLM-5 - Kimi 2.5

My rough criteria:

- Code correctness (first-pass compile success) - Quality of architectural suggestions - Refactor clarity - Handling of existing code context - Cost per useful output

Surprisingly (at least to me), Kimi 2.5 gave the best cost/performance ratio for this particular workload. It wasn’t always the most “verbose” or polished, but it required the fewest correction loops per dollar spent.

Opus 4.6 felt strong on reasoning-heavy changes, but cost scaled quickly. Gemini Pro 3 was decent but inconsistent in multi-file refactors. GLM-5 was interesting but sometimes hallucinated internal project structures.

This is obviously anecdotal and project-specific.

Curious:

What models are people here using for real-world codebases?

Has anyone benchmarked cost vs correction loops?

Are people optimizing for raw quality or iteration speed per dollar?

Would love to hear other dev experiences, especially from people working in Go or other statically typed backends.

2vampiregrey2mo ago0

Software engineer who scaled a startup from 10 → 500, seeking early-stage roles

I’m looking for an early-stage startup (seed–Series A), ideally working closely with founders and owning systems from zero to one.

Portfolio: https://zerobitflip.com Email: sm@zerobitflip.com

Ask HN: Has Claude Code quality dropped recently for anyone else?

I’m a daily user of Claude Code (Pro subscription) and it’s been my primary tool for the last few months.

Over the past week or so, I’ve noticed:

More shallow reasoning

Ignoring parts of context

vampiregrey

Recent submissions

Show HN: AI powered coding interview practice (opens in new tab)

Software engineer who scaled a startup from 10 → 500, seeking early-stage roles

OpenAI launched symphony, turn project work into isolated, autonomous runs (opens in new tab)

Built data pipelines across 200M+ companies seeking early roles (opens in new tab)

Solidclaw Local credential broker and model proxy for OpenClaw (opens in new tab)

Ask HN: Has Claude Code quality dropped recently for anyone else?

Comparing Gemini Pro 3, Opus 4.6, GLM-5 and Kimi 2.5 in a mid-sized Go project

Recent submissions

Show HN: AI powered coding interview practice (opens in new tab)

Software engineer who scaled a startup from 10 → 500, seeking early-stage roles

OpenAI launched symphony, turn project work into isolated, autonomous runs (opens in new tab)

Built data pipelines across 200M+ companies seeking early roles (opens in new tab)

Solidclaw Local credential broker and model proxy for OpenClaw (opens in new tab)

Ask HN: Has Claude Code quality dropped recently for anyone else?

Comparing Gemini Pro 3, Opus 4.6, GLM-5 and Kimi 2.5 in a mid-sized Go project