After outages, Amazon to make senior engineers sign off on AI-assisted changes (opens in new tab)

(arstechnica.com)

659 pointsndr4216d ago486 comments

https://www.ft.com/content/7cab4ec7-4712-4137-b602-119a44f77... (https://archive.ph/wXvF3)

https://twitter.com/lukolejnik/status/2031257644724342957 (https://xcancel.com/lukolejnik/status/2031257644724342957)

486 comments

This “mandatory meeting” is just the usual weekly company-wide meeting where recent operational issues are discussed. There was a big operational issue last week, so of course this week will have more attendance and discussion.

This meeting happens literally every week, and has for years. Feels like the media is making a mountain out of a mole hill here.

davidclark16d ago

The article claims:

>He asked staff to attend the meeting, which is normally optional.

Is that false? It also discusses a new policy:

>Junior and mid-level engineers will now require more senior engineers to sign off any AI-assisted changes, Treadwell added.

Is that inaccurate? It is good context that this is a regularly scheduled meeting. But, regularly scheduled meetings can have newsworthy things happen at them.

djb_hackernews15d ago

When an SVP asks you to do something in a mass email, it's very much optional. Dave Treadwell is an SVP, his org is likely in the 10's of thousands, there is no way to even have a mandatory meeting for that many people.

My SVP asks me to do things all the time, indirectly. I do probably 5% of them.

5 more replies

skeeter202016d ago

That's not really what the headline attempts to communicate though. It specifically emphasizes "Mandatory" and "AI breaking things". Nobody was going to click on "Regularly scheduled Amazon staff meeting will include discussion on operational improvement"

1 more reply

the_arun15d ago

Days are not far, where my agents are going to attend meetings & share my opinions, collect summary for me. If everyone do same - agents run meetings & share summary with parent (humans). Each of us have LLMs/Agents with our contextual data. It is another level of multi tasking.

1 more reply

s3p15d ago

>>He asked staff to attend the meeting, which is normally optional. >Is that false?

Judging from the comment above, no, the meeting happens every week, and this week they were asked to attend.

cobolcomesback16d ago

It’s not false. But it’s also weaselly worded.

Note that the article doesn’t say that he told staff they have to attend the meeting. It says he “asked” staff to attend the meeting. Which again, it’s really really normal for there to be an encouragement of “hey, since we just had an operational event, it would be good to prioritize attending this meeting where we discuss how to avoid operational events”.

As for the second quote: senior engineers have always been required to sign off on changes from junior engineers. There’s nothing new there. And there is nothing specific to AI that was announced.

This entire meeting and message is basically just saying “hey we’ve been getting a little sloppy at following our operational best practices, this is a reminder to be less sloppy”. It’s a massive nothingburger.

3 more replies

CoolGuySteve16d ago

It didn't seem to make the news but at least in NYC the entire Amazon storefront was broken all afternoon on Friday.

Items weren't displaying prices and it was impossible to add anything to your cart. It lasted from about 2pm to 5pm.

It's especially strange because if a computer glitch brought down a large retail competitor like Walmart I probably would have seen something even though their sales volume is lower.

kotaKat16d ago

A little birdie told me someone pushed duplicate data into one of Amazon’s core noSQL systems that runs most of e-commerce. The front end of the site broke in weird ways but it certainly wasn’t taking orders.

m304715d ago

Sometimes you squeeze clay and it comes out the oddest places. There were other stressors last week.https://www.pcmag.com/news/amazon-cloud-services-disrupted-i...

malfist16d ago

Over the weekend I was trying to return a pair of shoes and get a different size and I kept getting 500s trying to go to the store page for the shoes.

1 more reply

belval16d ago

I am not in that specific meeting but it made me chuckle that a weekly ops meeting will somehow get media attention. It's been an Amazon thing forever. Wait until the public learns about CoEs!

cmiles7415d ago

A weekly ops meeting where they talk about ensuring PRs with AI contributions get extra scrutiny? I think that's significant news.

3 more replies

8note16d ago

id.expect COEs to be coming up with AI code action items though, not to have more thorough human checks

1 more reply

otterley16d ago

> Feels like the media is making a mountain out of a mole hill here.

That's been their job ever since cable news was invented.

ses198416d ago

It’s been a bit longer than that.

https://en.wikipedia.org/wiki/Yellow_journalism

It probably goes back as long as they have been shouting news in the town square in Rome or before that even.

2 more replies

groundzeros201515d ago

It’s always sobering to see a news story about something you have insider perspective on.

embedding-shape16d ago

> This meeting happens literally every week, and has for years. Feels like the media is making a mountain out of a mole hill here.

Are you completely missing the point of the submission? It's not about "Amazon has a mandatory weekly meeting" but about the contents of that specific meeting, about AI-assisted tooling leading to "trends of incidents", having a "large blast radius" and "best practices and safeguards are not yet fully established".

No one cares how often the meeting in general is held, or if it's mandatory or not.

skeeter202016d ago

>> Are you completely missing the point of the submission

no, and that's what people are noting: the headline deliberately tries to blow this up into a big deal. When did you last see the HN post about Amazon's mandatory meeting to discuss a human-caused outage, or a post mortem? It's not because they don't happen...

2 more replies

furyofantares15d ago

This reply chain is confusing but I'm guessing got merged from another thread that had a different title?

Must have as the comments are hours older than OP.

cmiles816d ago

The core message of the article is that Amazon has been having issues with AI slop causing operational reliability concerns, and that seems to be 100% accurate.

coredog6415d ago

/with AI slop//

rahbert15d ago

This is correct. We ran them on Wednesday’s in Alexa. Jessy actually used to come and sit in ours once a quarter or so when he was running AWS.

age1mlclg615d ago

What has really happened is that those employees were made into "reverse centaurs":

https://www.theguardian.com/us-news/ng-interactive/2026/jan/...

Clent16d ago

Who is the media you're accusing here? This is a twitter post. As far as I can tell they do not work a media company.

What is worth being pointed out is how quickly people blame "The Media" for how people use, consume and spread information on social networks.

otterley16d ago

The source is not a Twitter post, it's a Financial Times article (that the poster failed to cite).

niwtsol16d ago

I believe it is by group - AWS started the weekly operations meeting, effectively every service's oncall from the last week had to attend. Then it grew massive, so they made it optional. Alexa had a similar meeting that tried to replicate what AWS did. A lot of time spent reviewing load tests getting ready for holiday season, prime day, and the superbowl (super bowl ads used to cause crazy TPS spikes for Alexa). And a lot of finger pointing if there was an outage from one team. While it probably did help raise the operational bar, so much time wasted by engineers on busywork/paperwork documenting an error or fix vs improving the actual service.

happytoexplain16d ago

>Junior and mid-level engineers can no longer push AI-assisted code without a senior signing off

Review by a senior is one of the biggest "silver bullet" illusions managers suffer from. For a person (senior or otherwise) to examine code or configuration with the granularity required to verify that it even approximates the result of their own level of experience, even only in terms of security/stability/correctness, requires an amount of time approaching the time spent if they had just done it themselves.

I.e. senior review is valuable, but it does not make bad code good.

This is one major facet of probably the single biggest problem of the last couple decades in system management: The misunderstanding by management that making something idiot proof means you can now hire idiots (not intended as an insult, just using the terminology of the phrase "idiot proof").

ardeaver16d ago

When I was really early in my career, a mentor told me that code review is not about catching bugs but spreading context (i.e. increasing bus factor.) Catching bugs is a side effect, but unless you have a lot of people review each pull request, it's basically just gambling.

The more expensive and less sexy option is to actually make testing easier (both programmatically and manually), write more tests and more levels of tests, and spend time reducing code complexity. The problem, I think, is people don't get promoted for preventing issues.

VorpalWay15d ago

This depends on the industry. I work on industrial machine control software, and we spend a huge amount of time on tests. We have to for some parts (human safety crtitical), but other parts would just be expensive if they failed (loss of income for customers, and possibly damaged equipment).

The key to making this scalable is to make as few parts as possible critical, and make the potential bad outcomes as benign as possible. (This lets you go to a lower rating in whatever safety standard applies to your industry.) You still need tests for the less critical parts though, while downtime is better than injury, if you want to sell future machines to your customers you need to have a good track record. At least if you don't want to compete on cost.

1 more reply

bluGill16d ago

> people don't get promoted for preventing issues.

they do - but only after a company has been burned hard. They also can be promoted for their area being enough better that everyone notices.

still the best way to a promotion is write a major bug that you can come in at the last moment and be the hero for fixing.

3 more replies

asdfman12315d ago

One of the major things code review does is prevent that one guy on your team who is sloppy or incompetent from messing up the codebase without singling him out.

If you told someone "I don't trust you, run all code by me first" it wouldn't go well. If you tell them "everyone's code gets reviewed" they're ok with it.

1 more reply

kqr15d ago

Code review is great for spreading context, but they also are very good at finding bugs. If you want to find bugs, review is one of the best ways to do it. https://entropicthoughts.com/code-reviews-do-find-bugs

bloppe15d ago

I think of code review more about ensuring understandability. When you spend hours gathering context, designing, iterating, debugging, and finally polishing a commit, your ability to judge the readability of your own change has been tainted by your intimate familiarity with it. Getting a fresh pair of eyes to read it and leave comments like "why did you do it this way" or "please refactor to use XYZ for maintainability", you end up with something more that will be easier to navigate and maintain by the junior interns who will end up fixing your latent bugs 5 years later.

1 more reply

8note16d ago

> The problem, I think, is people don't get promoted for preventing issues.

cleaning up structural issues across a couple orgs is a senior => principal promo ive seen a couple of times

wiseowise15d ago

> When I was really early in my career, a mentor told me that code review is not about catching bugs but spreading context (i.e. increasing bus factor.) Catching bugs is a side effect

This bs is what I say my juniors when I want them to fuck off with their reviews and focus on my actual work.

Sounds very insightful though.

marginalia_nu16d ago

Expert reviews are just about the only thing that makes AI generated code viable, though doing them after the fact is a bit sketchy, to be efficient you kinda need to keep an eye on what the model is doing as its working.

Unchecked, AI models output code that is as buggy as it is inefficient. In smaller green field contexts, it's not so bad, but in a large code base, it's performs much worse as it will not have access to the bigger picture.

In my experience, you should be spending something like 5-15X the time the model takes to implement a feature on reviewing and making it fix its errors and inefficiencies. If you do that (with an expert's eye), the changes will usually have a high quality and will be correct and good.

If you do not do that due dilligence, the model will produce a staggering amount of low quality code, at a rate that is probably something like 100x what a human could output in a similar timespan. Unchecked, it's like having a small army of the most eager junior devs you can find going completely fucking ape in the codebase.

locusofself16d ago

If you spend 5-15x the time reviewing what the LLM is doing, are you saving any time by using it?

6 more replies

rectang16d ago

> Expert reviews are just about the only thing that makes AI generated code viable

I disagree, in the sense that an engineer who knows how to work with LLMs can produce code which only needs light review.

* Work in small increments

* Explicitly instruct the LLM to make minimal changes

* Think through possible failure modes

* Build in error-checking and validation for those failure modes

* Write tests which exercise all paths

This is a means to produce "viable" code using an LLM without close review. However, to your point, engineers able to execute this plan are likely to be pretty experienced, so it may not be economically viable.

1 more reply

UncleMeat15d ago

Sadly, the way people become expert in a codebase is through coding. The process of coding is the process of learning. If we offload the coding to AI tools we will never be as expert in the codebase, its complexity, its sharp corners, or its unusual requirements. While you can apply general best practices for a code review you can never do as much as if you really got your hands dirty first.

"Seniors will do expert review" will slowly collapse.

jonnycoder16d ago

I tend to agree. I spent a lot of time revising skills for my brownfield repo, writing better prompts to create a plan with clear requirements, writing a skill/command to decompose a plan, having a clear testing skill to write tests and validate, and finally having a code reviewer step using a different model (in my case it's codex since claude did the development). My last PR was as close to perfect as I have got so far.

Skidaddle16d ago

Just lead with “You are an expert software engineer…”, easy!

raw_anon_111116d ago

In my experience, inefficient code is rarely the issue outside of data engineering type ETL jobs. It’s mostly architectural. Inefficient code isn’t the reason your login is taking 30 seconds. Yes I know at Amazon/AWS scale (former employee) every efficiency matters. But even at Salesforce scale, ringing out every bit of efficiency doesn’t matter.

No one cares about handcrafted artisanal code as long as it meets both functional and non functional requirements. The minute geeks get over themselves thinking they are some type of artists, the happier they will be.

I’ve had a job that requires coding for 30 years and before ther I was hobbyist and I’ve worked for from everything from 60 person startups to BigTech.

For my last two projects (consulting) and my current project, while I led the project, got the requirements, designed the architecture from an empty AWS account (yes using IAC) and delivered it. I didn’t look at a line of code. I verified the functional and non functional requirements, wrote the hand off documentation etc.

The customer is happy, my company is happy, and I bet you not a single person will ever look at a line of code I wrote. If they do get a developer to take it over, the developer will be grateful for my detailed AGENTS.md file.

3 more replies

js816d ago

> requires an amount of time approaching the time spent if they had just done it themselves

It's actually often harder to fix something sloppy than to write it from scratch. To fix it, you need to hold in your head both the original, the new solution, and calculate the difference, which can be very confusing. The original solution can also anchor your thinking to some approach to the problem, which you wouldn't have if you solve it from scratch.

bluGill16d ago

Sloppy code that has been around for a while works. It likely has support for edge cases you forgot about. Often the sloppyness is because of those edge cases.

1 more reply

ummonk15d ago

In fairness though, it does give you good practice for the essential skill of maintaining / improving an old codebase.

unshavedyak15d ago

> For a person (senior or otherwise) to examine code or configuration with the granularity required to verify that it even approximates the result of their own level of experience, even only in terms of security/stability/correctness, requires an amount of time approaching the time spent if they had just done it themselves.

Hell, often it feels slower/worse. Foreign code is easily confusing at first, which slows you down - and bad code quickly gets bewildering and sends you down paths of clarifications that waste time.

SchemaLoad15d ago

So many times I get AI generated PRs from juniors where I don't feel comfortable with the code, I wouldn't do it like this myself, but I can't strictly find faults that I can reject the PR with. Usually it's just a massive amount of code being generated which is extremely difficult to review, much harder than it was for the submitter to generate and send it for review.

Then often it blows up in production. Makes me almost want to blanket reject PRs for being too difficult to understand. Hand written code almost has an aversion to complexity, you'd search around for existing examples, libraries, reusable components, or just a simpler idea before building something crazy complex. While with AI you can spit out your first idea quickly no matter how complex or flawed the original concept was.

1 more reply

steveBK12316d ago

Right, code reviews should already have been happening with human written junior code.

If AI is a productivity boost and juniors are going to generate 10x the PRs, do you need 10x the seniors (expensive) or 1/10th the juniors (cost save).

A reminder that in many situations, pure code velocity was never the limiting factor.

Re: idiot prooofing I think this is a natural evolution as companies get larger they try to limit their downside & manage for the median rather than having a growth mindset in hiring/firing/performance.

AgentOrange123416d ago

Seniors are going to need to hold Juniors to a high bar for understanding and explaining what they are committing. Otherwise it will become totally soul destroying to have a bunch of juniors submitting piles of nonsense and claiming they are blocked on you all the time.

sethops116d ago

This was challenging enough pre AI. Now that everybody has an AI slop button, the life of an effective code reviewer just got so much more miserable.

esafak15d ago

Make them first go through an AI reviewer that is informed by the code base's standards.

1 more reply

jetrink16d ago

It could create the right sort of incentives though. If I'm a junior and I suddenly have to take my work to a senior every time I use AI, I'm going to be much more selective about how I use it and much more careful when I do use it. AI is dangerous because it is so frictionless and this is a way to add friction.

Maybe I don't have the correct mental model for how the typical junior engineer thinks though. I never wanted to bug senior people and make demands on their time if I could help it.

devonbleak16d ago

What you're actually going to see is seniors inundated by slop and burning out and quitting because what used to be enjoyable solving of problems has become wading through slop that took 10 minutes to generate and submit but 30+ minutes to understand and write up a critique for it.

4 more replies

onion2k16d ago

I.e. senior review is valuable, but it does not make bad code good.

I suspect that isn't the goal.

Review by more senior people shifts accountability from the Junior to a Senior, and reframes the problem from "Oh dear, the junior broke everything because they didn't know any better" to "Ah, that Senior is underperforming because they approved code that broke everything."

bs728016d ago

This is also why I think we will enter a world without Jr's. The time it takes for a Senior to review the Jr's AI code is more expensive than if the Sr produced their own AI code from scratch. Factor in the lack of meetings from a Sr only team, and the productivity gains will appear to be massive.

Whether or not these productivity gains are realized is another question, but spreadsheet based decision makers are going to try.

czscout16d ago

In this scenario, how might one become a senior without first being a junior? Seniors just pop into existence?

3 more replies

hintymad15d ago

> Review by a senior is one of the biggest "silver bullet" illusions managers suffer from

Especially in a big co like Amazon, most senior engineers are box drawers, meeting goers, gatekeepers, vision setters, org lubricants, VP's trustees, glorified product managers, and etc. They don't necessarily know more context than the more junior engineers, and they most likely will review slowly while uncovering fewer issues.

raw_anon_111116d ago

Why only AI generated code? I wouldn’t let a junior or mid level developer’s code go into production without at least verifying the known hotspots - concurrency, security, database schema, and various other non functional requirements that only bite you in production.

I’m probably not going to review a random website built by someone except for usability, requirements and security.

happytoexplain16d ago

I didn't restrict my opinion to genAI code. I'm expressing a general thought that was relevant before AI. AI is just salient in relation to it.

I also said senior review is valuable, but I'm not 100% sure if you're implying I didn't.

OrangeDelonge15d ago

I’ve seen hundreds of PR’s produced by a junior and reviewed by a mid lvl go into prod. I don’t see any problem with that

belval16d ago

The unwritten thing is that if you need seniors to review every single change from junior and mid-level engineers, and those engineers are mostly using Kiro to write their CRs, then what stops the senior from just writing the CRs with Kiro themselves?

qnleigh16d ago

I seriously doubt that they think senior reviewers will meticulously hunt down and fix all the AI bugs. Even if they could, they surely don't have the time. But it offers other benefits here:

1. They can assess whether the use of AI is appropriate without looking in detail. E.g. if the AI changed 1000 lines of code to fix a minor bug, or changed code that is essential for security.

2. To discourage AI use, because of the added friction.

kaffekaka14d ago

Point 1 is important. Seniors (or any developer really) with experience in the code base in question can judge pretty quickly if a CR seems reasonable.

zamalek15d ago

> Review by a senior is one of the biggest "silver bullet" illusions managers suffer from.

My manager has been urging us to truly vibe code, just yesterday saying that "language is irrelevant because we've reached the point where it works - so you don't need to see it." This article is a godsend; I'll take this flawed silver bullet any day of the week.

mrothroc15d ago

Senior review can definitely help, regardless if the code comes from a junior or an LLM. We've done this since the dawn of time. However, it doesn't scale and since LLM volume far exceeds what juniors can do, you end up overwhelming the seniors, who are normally overbooked anyway.

The other problem is that the type of errors LLMs make are different than juniors. There are huge sections of genuinely good code. So the senior gets "review fatigue" because so much looks good they just start rubber stamping.

I use an automated pipeline to generate code (including terraform, risking infrastructure nukes), and I am the senior reviewer. But I have gates that do a whole range of checks, both deterministic and stochastic, before it ever gets to me. Easy things are pushed back to the LLM for it to autofix. I only see things where my eyes can actually make a difference.

Amazon's instinct is right (add a gate), but the implementation is wrong (make it human). Automated checks first, humans for what's left.

yifanl16d ago

Senior reviews are useful, but as I understand it, Amazon has a fairly high turnover rate, so I wonder just how many seniors with deep knowledge of the codebase they could possibly have.

tartoran16d ago

From engineers are interchangeable to high turnover are decisions that the company took. The payback time always comes at some point.

grvdrm16d ago

What a statement at the end. You are absolutely right.

I hear “x tool doesn’t really work well” and then I immediately ask: “does someone know how to use it well?” The answer “yes” is infrequent. Even a yes is often a maybe.

The problem is pervasive in my world (insurance). Number-producing features need to work in a UX and product sense but also produce the right numbers, and within range of expectations. Just checking the UX does what it’s supposed to do is one job, and checking the numbers an entirely separate task.

I don’t many folks that do both well.

hnthrow028734516d ago

>requires an amount of time approaching the time spent if they had just done it themselves.

I would actually say having at least 2 people on any given work item should probably be the norm at Amazon's size if you also want to churn through people as Amazon does and also want quality.

Doing code reviews are not as highly valued in terms of incentives to the employees and it blocks them working on things they would get more compensation for.

lokar16d ago

The goal of Sr code review is not to make the code better, it's to make the author better.

skeeter202016d ago

Agree but even broader: authors. I always viewed reviews as targeting Brook's less famous findings about the optimal team size being one, and asking how can we get better at building systems too big for the individual. I think code review is about shared, consistent understanding with catching bugs a nice side effect (or justification for the bean counters).

1 more reply

sumeno15d ago

That's not going to work when the author is an LLM

strogonoff15d ago

I would argue that the amount of time needed for a proper review exceeds the amount of time needed to just do it yourself.

When reviewing, you need to go through every step of implementing it yourself (understand the problem, solve the problem, etc.), but you additionally need to 1) understand someone else’s solution and 2) diff your solution against theirs to provide meaningful feedback.

Review could take roughly equivalent time, but only if I am allowed to reject/approve in a binary way (“my solution would not be the same, therefore denied”) which is not considered appropriate in most places.

This is why I am not a fan of being the reviewer.

radiator16d ago

Deming's point 3 (of 14): Cease dependence on inspection to achieve quality. Eliminate the need for massive inspection by building quality into the product in the first place.

mrbonner16d ago

What stops the senior from using AI to review the AI generated code the junior published?

tartoran16d ago

That’s something that the junior can do. What companies want to do is put responsibility on someone who has more knowledge and skin in the game

femiagbabiaka16d ago

the outcome of the review isn't just that the code gets shipped, it's knowledge transfer from the senior engineer to the junior engineers that then creates more senior engineers

lionkor15d ago

> [...] requires an amount of time approaching the time spent if they had just done it themselves.

Yes, but with the caveat that the junior learns and eventually can become the senior.

rafaelmn15d ago

Eventually they'll get rid of juniors and mid level devs because realistically it's easier to review when you're the one doing the prompting.

remarkEon16d ago

Other than “don’t hire idiots”, what is the solution to this problem? I agree with you, and this particular systems management issue is not constrained to software.

happytoexplain15d ago

I don't know.

We need smart people at every layer. If leadership isn't in that category, it spreads to all layers.

I don't know how we defeat capitalism to incentivize smart leadership. It's fundamentally opposed to market forces.

rco878615d ago

Going to systemically turn off your senior staff over time also. Most Senior Engineers aren't that interested in doing even more code review.

mmcconnell161815d ago

Also, have massive layoffs every few months just to keep people on edge. AWS wants people to leave with RTO and badging policies, comp range shifts lower unless you have year over year ratings, and an obsessive push to force AI into every process. Top talent is leaving and will continue to leave AWS.

1 more reply

yalogin15d ago

Don’t forget that this auto generated code will have subtle bugs and feels complete at the outset

munk-a15d ago

Reviewing code changes (generally) takes more time than writing code changes for a pretty significant chunk of engineers. If we're optimizing slop code writing at the expense of more senior's time being funneled into detailed reviews then we're _doing it wrong_.

montymaty9d ago

dfasdsaasdasdasd

napolux16d ago

LGTM

RamblingCTO16d ago

Who said PR reviews need to solve all the things and result in proof against idiots?

So you're saying that peer reviews are a waste of time and only idiots would use/propose them?

happytoexplain16d ago

None of that, sorry if I wasn't clear.

To partially clarify: "Idiot proof" is a broad concept that here refers specifically to abstraction layers, more or less (e.g. a UI framework is a little "idiot proof"; a WYSIWYG builder is more "idiot proof"). With AI, it's complicated, but bad leadership is over-interpreting the "idiot proof" aspects of it. It's a phrase, not an insult to users of these tools.

33MHz-i48615d ago

In case it isn’t completely obvious from this, it is indeed hellish to work there. Most of AWS has a 2 reviewer requirement. If AI is writing most of the code (and it is because most Amazon code is copypasta boilerplate) you need 3 developers to sign off to ship anything. But of course due to headcount attrition, managers have ~1.5 developers to a project. Meanwhile the L8 manager is doing nothing except stack ranking each level of engineers according to number of commits merged & customer facing features shipped, and firing 15% of the bottom at the end of each year. There is no notion of subject matter expertise or technical depth, theyre happy to replace whoever with fresh-grads (theyre all just cogs anyway right!). Between that and voluntary departures, teams having 80-100% turnover every 5 years is basically par.

Also while this is happening most developers are getting constantly hammered by operational issues and critical security tasks because 1) the legacy toolchain imports 6 different language package ecosystems and 2)no one ever pays down tech debt in legacy code until its a high severity ticket count in a KPI dashboard visible to the senior management.

mikert8915d ago

The thing is, this management philosophy worked when AWS knew what they needed to build and just needed to execute with top notch operations.

But now with AI, they are getting disrupted. Most AWS services might become obsolete, why does an ai need these janky higher levels abstractions AWS piles on.

So now they need innovation, but the company isn’t set up for it. They are forcing short deadlines for product launches that don’t matter

33MHz-i48615d ago

its not even AI. most of the cloud offerings are commodities now.

the marginal technological direction is determined by middle managers whoes primary motivation is “what new customer facing feature can I launch at this years re:invent and build a little empire” (of course this is a shrinking offering as tech debt and complexity pile up)

junior engineers are burned and churned on execution, seniors are project managers, principals just do high level reviews & high level fire fighting (note not actually leading the tech)

director and above just their spend time on “what to kill” or “who to fire” as priorities change every 6 months

prakhar89716d ago

From the amazon I know, people only care about a. not getting fired and b. promotions. For devs, the matrix looks like this:

1. Shipping: deliver tickets or be pipped.

2. Having Less comments on their PRs: for some drastically dumb reason, having a PR thoroughly reviewed is a sign of bad quality. L7 and above use this metric to Pip folks.

3. Docs: write docs, get them reviewed to show you're high level.

Without AI, an employee is worse off in all of the above compared to folks who will cheat to get ahead.

I can't see how "requesting" folks for forego their own self-preservation will work. especially when you've spent years pitting people against each other.

malfist16d ago

Not only is having too many comments on your PRs bad for you, but so is not leaving comments on other people's PRs. Both are metrics used

dude25071115d ago

I'd leave lots of comments out of spite whenever I would feel my PRs had been treated unfairly. If I am going down, you all are coming with me.

2 more replies

embedding-shape15d ago

> 2. Having Less comments on their PRs: for some drastically dumb reason, having a PR thoroughly reviewed

I'm very far away from liking Amazon's engineering culture and general work culture, but having PRs with countless of discussions and feedback on it does signal that you've done a lot of work without collaborating with others before doing the work. Generally in teams that work well together and build great software, the PRs tend to have very little on them, as most of the issues were resolved while designing together with others.

joeframbach15d ago

I've been involved in so many CRs where I've given feedback over 10 revs, then the submitter cancels the CR and files a new one, for the metrics.

tom_15d ago

If the review tooling is any good, getting the code somewhere it can see it is a convenient way for people to give and receive feedback. As the saying goes, the system is what it does!

(And/but yes/no, I have never worked at NAGFAM...)

ex-aws-dude15d ago

Eh I feel like there are some features where you just have to get in the weeds to even design it and the code review itself is part of the process of designing/figuring out the edge cases.

1 more reply

dboreham15d ago

4. Don't work in the corporate equivalent of The Hunger Games.

99990000099915d ago

At least in the past the idea is you do the dance , vest and leave.

I missed my FAANG chance during the good years. No retirement for me!

rk0615d ago

if someone responded with a lot of PR comments, I would set up a meeting directly and avoid unnecessary discussion on PR.

sdevonoes16d ago

Reviewing AI generated code at PR time is a bottleneck. It cancels most of the benefits senior leadership thinks AI offers (delivery speed).

There’s also this implicit imbalance engineers typically don’t like: it takes me 10 min to submit a complete feature thanks to Claude… but for the human reviewing my PR in a manual way it will take them 10-20 times that.

Edit: at the end real engineers know that what takes effort is a) to know what to build and why, b) to verify that what was built is correct. Currently AI doesn’t help much with any of these 2 points.

The inbetweens are needed but they are a byproduct. Senior leadership doesn’t know this, though.

hard2416d ago

Indeed. My view as a CEO is, if you are still reviewing the code yourself then what use is it that you can produce a bunch of text at a faster rate?

I'd prefer people wrote good quality code and checked it as they went along... whilst allowing room for other stuff they didn't think of to come to the front. The production process of using LLMs is entirely different, in its current state I don't see the net benefit.

E.g. if you have a very crystalised vision of what you want, why would I want an engineer to use an LLM to write it, when the LLM can't do both raw production and review? Could this change? Sure. But there's no benefit for me personally to shift toward working that way now - I'd rather it came into existence first before I expose myself to incremental risk that affects business operations. I want a comprehensive solution.

tech_tuna13d ago

You should lay off your engineering team and do it all in Lovable amigo.

FromTheFirstIn14d ago

Where are you CEO?

1 more reply

beardedetim16d ago

This is what I don't understand about this policy. There's no way a senior has enough spare capacity to be the gate keeper on every PR made by AI below them. So now we are just making it so the senior people use more AI to keep up but now they're to blame for letting it happen.

It sounds like a piss poor deal for seniors unless senior engineer now means professional code reviewer.

malfist15d ago

That's amazon in a nutshell though. Create conflicting metrics for performance, push credit up and responsibility down, punish everyone below you for not meeting the double standards

1 more reply

rhubarbtree15d ago

Most AI advocates I know believe this period, reviewing every line of code, will come to an end when models improve. So there will be no bottleneck. We will simply test and ship, with AI doing all the code and review.

bandrami15d ago

Possibly, but it doesn't make sense to restructure things in advance of that actually happening, particularly since there's no roadmap for getting there right now.

1 more reply

qnleigh16d ago

Surely they know all this. They're worried about AI code degrading codebase quality, so they're putting on the brakes.

radiator16d ago

> Senior leadership doesn’t know this, though.

Well, you'd think senior leadership should know how their business and their people work.

2 more replies

cmiles816d ago

The optics here are really bad for Amazon. The continuing mass departures of long tenured folks, second-rate AI products, and a string of bad outages paints a picture that current leadership is overseeing a once respected engineering train flying off the tracks.

News from the inside makes it sound like things are getting pretty bad.

the_biot15d ago

> The continuing mass departures of long tenured folks

You mean senior programmers that have been there for ages don't want to spend their time reviewing AI slop? Who'd a thunk it!

philip120915d ago

I think the deeper need is a "self-review" flow.

People push AI-reviewed code like they wrote it. In the past, "wrote it" implies "reviewed it." With AI, that's no longer true.

I advocate for GitHub and other code review systems to add a "Require self-review" option, where people must attest that they reviewed and approved their own code. This change might seem symbolic, but it clearly sets workflows and expectations.

billbrown15d ago

Yes, underthinking is rampant. Glancing at "AI" output is not reviewing code: you have to grok it (in the Heinlein sense) in order to treat it as your own.

1 more reply

Tyr4215d ago

Heck, doing a self review when you wrote the code catches stuff like forgetting debug prints.

2 more replies

jeremyjh15d ago

We have it in a checklist in PR template. I can’t imagine a fiat class feature that would be much more meaningful. It surprised me to learn there are developers who have to be reminded to review their own code and test it, but does seem to help.

kuekacang15d ago

I've been lucky to discover git relatively late and sublime merge relatively soon. It seems like separating the concern of editing and reviewing code is making me consider each more as separate thing.

It also makes me more comfortable figuring out how a project's pull acceptance are like (maybe due to how fast local ui is compared to web-based git). On the other hand, I can only run some basic git cli commands and can't quickly comprehend raw text-based diff, especially when encountering some linux patches from time to time.

paxys15d ago

If someone was confident enough to push through an AI change without even reading/reviewing it themselves adding more buttons to the UI isn't going to change anything.

1 more reply

cvak14d ago

TBH, I do PRs on repos with no other devs just do do self-review, and I did that before AI.

8note15d ago

the tooling doesnt make it easy currently.

working at amazon, when I wanted to review code myself through the CR tool, Id still end up publishing it to the whole team and have to add some title shenanigans saying it was a self review or WIP and for others to not look at it yet

koinedad15d ago

Self review is #1

1 more reply

ritlo16d ago

The only way to see the kinds of speed-up companies want from these things, right now, is to do way too little review. I think we're going to see a lot of failures in a lot of sectors where companies set goals for reduced hours on various things they do, based on what they expected from LLM speed-ups, and it will have turned out the only way to hit those goals was by spending way too little time reviewing LLM output.

They're torn between "we want to fire 80% of you" and "... but if we don't give up quality/reliability, LLMs only save a little time, not a ton, so we can only fire like 5% of you max".

(It's the same in writing, these things are only a huge speed-up if it's OK for the output to be low-quality, but good output using LLMs only saves a little time versus writing entirely by-hand—so far, anyway, of course these systems are changing by the day, but this specific limitation has remained true for about four years now, without much improvement)

SoftTalker16d ago

So will it turn out that actually writing code was never the time sink in the first place?

That has always been my feeling. Once I really understand what I need to implement, the code is the easy part. Sure it takes some time, but it's not the majority. And for me, actually writing the code will often trigger some additional insight or awareness of edge cases that I hadn't considered.

3 more replies

hard2416d ago

My prediction is a concorde-like incident is going to shatter trust and make people re-think their expectations of the capabilities of LLMs and their abilities of the present.

Essentially something big has to happen that affects the revenue/trust of a large provider of goods, stemming from LLM-use.

They wont go away entirely. But this idea that they can displace engineers at a high-rate will.

2 more replies

petterroea15d ago

I feel bad for the seniors who have to take on this workload. The general pattern I am seeing is that seniors at "AI-first" companies are being held back from doing their work by reviewing junior PRs, who are now able to ship much more code they don't understand the badness of.

Mentoring Juniors is an important part of the job and crucial service to the industry, but juniors equipped with LLMs make the deal a bit more sour. Anecdotally, they don't really remember the feedback as well, because they weren't involved in writing the code. Its burnout-inducing to see your hard work and feedback go in one ear and out another.

I personally know people looking to jump ship because they waste too much time at their current employer on this.

wiseowise15d ago

> Mentoring Juniors is an important part of the job and crucial service to the industry

Not really.

lokar16d ago

If this is true, it misunderstands the primary goals of code review.

Code review should not be (primarily) about catching serious errors. If there are always a lot of errors, you can’t catch most of them with review. If there are few it’s not the best use of time.

The goal is to ensure the team is in sync on design, standards, etc. To train and educate Jr engineers, to spread understanding of the system. To bring more points of view to complex and important decisions.

These goals help you reduce the number of errors going into the review process, this should be the actual goal.

rossdavidh15d ago

As Deming once said in regard to manufacturing inspections: "Inspection does not improve the quality, nor guarantee quality. Inspection is too late. The quality, good or bad, is already in the product."

The fact that software is "soft" makes it seem like this doesn't apply, but it does, not least because of the fact that once you have gone down the wrong path with software design, it is very difficult to pull back and realize you need to go down an entirely different one.

1 more reply

827a15d ago

> Company that lays-off 20% of its staff every year in an attempt to "reduce inefficiency" and "remain agile in the adoption of new technologies and workflows" finds they cannot run a stable service, have more inefficiency than ever, and have also failed to establish leadership in the adoption of any new technologies or workflows. They plan to solve these problems by introducing more inefficiency (making your most expensive employees review the work of others).

We love this for Amazon, they're a very strong company making bold decisions.

znpy15d ago

"Make senior engineer sign off ai-assisted changes" sounds incredibly weird.

First thing that comes to mind is: reminds me of those movie where some dictatorship starts to crumble and the dictator start being tougher and tougher on generals, not realizing the whole endeavor is doomed, not just the current implementation.

Then again, as a former amazon (aws) engineer: this is just not going to work. Depending how you define "senior engineer" (L5? L6? L7?) this is less and less feasible.

L5 engineers are already supposed to work pretty much autonomously, maybe with L6 sign-off when changes are a bit large in scope.

L6 engineers already have their own load of work, and a fairly large amount of engineers "under" them (anywhere from 5 to 8). Properly reviewing changes from all them, and taking responsibility for that, is going to be very taxing on such people.

L7 engineers work across teams and they might have anywhere from 12 to 30 engineers (L4/5/6) "under" them (or more). They are already scarce in number and they already pretty much mostly do reviews (which is proving not sufficient, it seems). Mandating sign-off and mandating assumption of responsibility for breaking changes means these people basically only do reviews and will be stricter and stricter[1] with engineers under them.

L8 engineers, they barely do any engineering at all, from what I remember. They mostly review design documents, in my experience not always expressing sound opinions or having proper understanding of the issues being handled.

In all this, considering the low morale (layoffs), the reduced headcount (layoffs) and the rise in expectations (engineers trying harder to stay afloat[2] due to... layoffs)... It's a dire situation.

I'm going to tell you, this stinks A LOT like rotting day 2 mindset.

----

1. keep in mind you can't, in general, determine the absence of bugs

2. Also cranking out WAY MUCH MORE code due to having gen-ai tools at their fingertips...

paxys15d ago

Someone should teach the decision makers how pipelines work. If AI-created diffs are being churned out at 10x the previous rate but manual reviews are the bottleneck then the overall system is producing at the exact same rate as before. The only thing you have added is cost, uncertainty and engineers being less familiar with the system.

booleandilemma15d ago

The next thing these geniuses will think of will be to have AI review the diffs.

Herring15d ago

Check back in a year or two. Just the other day we had Claude finding significant bugs in Firefox. https://news.ycombinator.com/item?id=47273854

Lalabadie16d ago

I'm not sure the sustainable solution is to treat an excess of lower-quality code output as the fixed thing to work with, and operationalize around that, but sure.

gtowey16d ago

It's the same as the offshoring episode of the early 2000's. There is such a massive financial incentive to somehow make the low quality code work. And they will try to resist the reality that it's a huge net negative for as long as they can.

sethops116d ago

> The response for now? Junior and mid-level engineers can no longer push AI-assisted code without a senior signing off.

So basically, kill the productivity of senior engineers, kill the ability for junior engineers to learn anything, and ensure those senior engineers hate their jobs.

Bold move, we'll see how that goes.

whateveracct16d ago

Juniors could just code things the old fashioned way. It isn't hard. And if they do find it too hard, they aren't cut out for this job.

3 more replies

dragonelite16d ago

Accelerate a person speed toward being burned out..

1 more reply

dude25071115d ago

But Amazon is something you tolerate for a year or two early in the career, before moving somewhere better (which is anywhere else)?

almostdeadguy16d ago

I'm sorry what? Junior engineers can't learn anything without using AI assistants (or is the implication that having seniors review their code makes them incapable of learning?) and senior engineer would hate their jobs reviewing more code from their teammates? What reality do people live in now?

2 more replies

rglover15d ago

The amount of time and money being wasted chasing this dragon is unreal.

dana32115d ago

Its useful but it makes wrong assumptions, not checking the code is essentially gambling.

alexyz1215d ago

there's nothing else left to chase apparently

ndr42OP16d ago

I think the problem of responsibility will come for many more companies sooner than later. It is possible that some of the alleged efficacy gains by using ai are not so big anymore when someone has to be accountable for it.

AlotOfReading16d ago

I'm not surprised by the outages, but I am surprised that they're leaning into human code review as a solution rather than a neverending succession of LLM PR reviewers.

I wonder if it's an early step towards an apprenticeship system.

bilbo0s16d ago

You shouldn't be surprised.

How else would they train the LLM PR reviewers to their standards?

I've never personally been in the position, because my entire career has been in startups, but I've had many friends be in the unenviable position of training their replacements. Here's the thing though, at least they knew they were training their replacements. We could be looking at a potential future where an employee or contractor doesn't realize s/he is actually just hired to generate training data for an LLM to replace them, and then be cut.

monarchwadia16d ago

Interesting. How would it be an early step towards an apprenticeship system?

mentos15d ago

What are we going to do about software for critical infrastructure in the coming decade?

Feels inevitable that code for aviation will slowly rot from the same forces at play but with lethal results.

rhubarbtree15d ago

It will be written much the same as now, but using AI to improve quality through code inspection etc.

Just because nearly all software is going to be written by AI, does not mean critical infrastructure will be.

1 more reply

franktankbank15d ago

Cabin in the woods.

AlexeyBrin16d ago

I wonder how this will work in practice. Say I'm a senior engineer and I produce myself thousands of lines of code per day with the help of LLMs as mandated by the company. I still need to presumably read and test the code that I push to production. When will I have time to read and evaluate similar amounts of code produced by a junior or a mid level engineer ?

MichaelRo15d ago

>> I wonder how this will work in practice. Say I'm a senior engineer and I produce myself thousands of lines of code per day with the help of LLMs as mandated by the company.

LOL, it's the age old "responsibility without authority". The pressure to use AI will increase and basically you'll be fired for not using it. Simultaneously with the pressure to take the blame when AI fucks up and you can't keep up with the bullshit, leading you to get fired. One way or the other, get some training on how to stack shelves at the supermarket because that's how our future looks, one way or the other.

hrmtst9383715d ago

Sign-off requirements like this quickly become performative when LLMs generate code faster than anyone can review it in detail. Relying on human oversight at scale is unrealistic unless the volume of changes drops or the review process itself becomes more automated.

1 more reply

quantified16d ago

This is an important bottleneck. You can have LLM-based reviewers help you. But unless you yourself understood your thousands of lines, it's "somebody else's" code and that somebody else cannot be fired or taken to court.

The presumably human mid-level or junior engineer has their own issues with this, but the point of the LLM is that you don't need that engineer. For productivity purposes, the dev org only needs the seniors to wrangle all the LLMs they can. That doesn't sustain, so a couple of more-junior engineers can do similar work to mature.

zcw10016d ago

I just met a guy from Amazon this past weekend who was bragging, "We've got unlimited access to LLMs and our developers have 10 agents going at a time.". I tried telling him it wasn't all unicorns and rainbows but I didn't get the impression he cared and just kept crapping out skittles.

agoodusername6315d ago

Because he doesn’t.

The impression I get from SWEs I’ve met throughout my life is that most of them don’t actually care about their job. They got in because it paid well and demand was plentiful.

1 more reply

CodingJeebus16d ago

I'm at a small company struggling with this problem. Fundamentally, we have a limited context and AI is capable of generating tremendous amounts of output that exceed our ability to deeply process.

I find myself context-switching all the time and it's pretty exhausting, while also finding that I'm not retaining as much deep application domain knowledge as I used to.

On the surface, it's nice that I can give my LLM a well-written bug ticket and let it loose since it does a good job most of the time. But when it doesn't do a good job or it's making a change in an area of the codebase I'm not familiar with, auditing the change gets tiring really fast.

julienchastang16d ago

> best practices and safeguards are not yet fully established

The way I am working with AI agents (codex) these days is have the AI generate a spec in a series of MD documents where the AI implementation of each document is a bite sized chunk that can be tested and evaluated by the human before moving to the next step and roughly matches a commit in version control. The version control history reflects the logical progression of the code. In this manner, I have a decent knowledge of the code, and one that I am more comfortable with than one-shotting.

danjl15d ago

Yes, more time on up front spec and plan building. Bite sized specifically to fit within the context window of a single implementation session. Each step should have a verification process that includes new tests.

Prior to each step, I prompt the AI to review the step and ask clarifying questions to fill any missing details. Then implement. Then prompt the AI after to review the changes for any fixes before moving on to the next step. Rinse, repeat.

The specs and plans are actually better for sharing context with the rest of the team than a traditional review process.

I find the code generated by this process to be better in general than the code I've generated over my previous 35+ years of coding. More robust, more complete, better tested. I used to "rush" through this process before, with less upfront planning, and more of a focus on getting a working scaffold up and running as fast as possible, with each step along the way implemented a bit quicker and less robustly, with the assumption I'd return to fix up the corner cases later.

captainkrtek15d ago

One challenge with code review as an antidote to poor quality gen-AI code, is that we largely see only the code itself, not the process or inputs.

In the pre-gen-AI days, if an engineer put up a PR, it implied (somewhat) they wrote their code, reviewed it implicitly as they wrote it, and made choices (ie: why is this the best approach).

If Claude is just the new high level programming language, in terms of prompting in natural language, the challenge is that we're not reviewing the natural language, we're reviewing the machine code without knowing what the inputs were. I'm not sure of a solution to this, but something along the lines of knowing the history of the prompting that ultimately led to the PR, the time/tokens involved, etc. may inform the "quality" or "effort" spent in producing the PR. A one-shotted feature vs. a multi-iteration feature may produce the same lines of code and general shape, but one is likely to be higher "quality" in terms of minimal defects.

Along the same lines, when I review some gen-AI produced PR, it feels like I'm reading assembly and having to reverse how we got here. It may be code that runs and is perfectly fine, but I can't tell what the higher level inputs were that produced it, and if they were sufficient.

throwaw1216d ago

If Seniors are going to review every GenAI generated code, how do they keep up with the volume of changes?

So you have 2 systems of engineers: Sr- and Sr+

1. Both should write code to justify their work and impact

2. Sr- code must be reviewed by Sr+

What happens:

a. Sr+ output drops because review takes their time more and more

b. Sr+ just blindly accepts because of the volume is too high, and they should also do their own work

c. Sr+ asks Sr- to slow-down, then Sr- can get bad reviews for the output, because on average Sr+ will produce more code

I think (b) will happen

dude25071116d ago

I knew this would happen.

Take a perfectly productive senior developer and instead make him be responsible for output of a bunch of AI juniors with the expectation of 10x output.

frogperson16d ago

makes me want to vomit. I am not spending more time reviewing code than the "author" spent creating it. Ill just leave the industry if that happens.

1 more reply

fmajid15d ago

If they do not also increase the senior devs’ allotted time for code reviews to make up for the increased volume of changes due to increased productivity of the junior to mid level devs, or hire more seniors, this will just lead to burnout (on top of Amazon’s already high levels) and scapegoating seniors for having waved through a change because they materially can’t review them fast enough.

8note15d ago

Id maybe consider spinning out a different job role for like "review engineer" who's not busy making strategic decisions and near term planning, just making sure the code is actually good

still within the engineering IC role, but on a different track

2 more replies

LogicFailsMe16d ago

For the good of the company's future, all code should be reviewed by L10s going forward before they are accepted. They're the only ones with enough skin in the game to know what really matters after all.

And from their sagely reviews, we shall train a large language model to ultimately replace them because the most fungible thing at Amazon is the leadership.

VorpalWay15d ago

I'm bewildered by Amazon here. I would assume every change require code review by another enigneer already, as is standard practice in the industry I work in (industrial equipment). Is the change just that it has to be a senior engineer specifically, rather than any engineer? Or did Amazon really not have mandatory code review before?

shepherdjerred15d ago

Code review is mandatory at Amazon

sizzzzlerz15d ago

Who fixes code that gets rejected? Do you simply try again and hope or does someone go into this computer-generated code that they didn't write and do the equivalent of battlefield triage?

And what are they going to do when they've fired all the senior engineers because they make too much money, leaving just juniors and AI?

rhubarbtree15d ago

AI will fix it, same way AI wrote it. At the behest of a human.

When they fire everyone, juniors will fix it with AI.

This is in general. I wouldn’t recommend this at critical services like AWS.

1 more reply

Insanity16d ago

It's only going to get worse with the brain drain as a result of the layoffs. Which will increase the use of AI assisted coding and increase the number of outages related to this.

Imagine having to debug code that caused an outage when 80% is written by an LLM and you now have to start actually figuring out the codebase at 2am.. :)

8note15d ago

but thats what it was like when i started at amazon in 2016?

i think the team i was on was a bit of an outlier in terms of owning 40 dumptser fires at once, and the first time reading any one of them was at 2AM because it was down.

having an LLM give early passes on reading the godawful c++ code that you can tell at a glance that its not gonna work as expected, but you cant tell why, or what expected actually is would have been phenomenal, and gotten me back to sleep at 3 on those codebases rather than 5.

1 more reply

tcbrah16d ago

the funniest part is amazon literally started tying AI usage to performance reviews like 6 months ago and now theyre doing damage control. you cant simultaneously pressure every engineer to use more AI AND be shocked when AI-assisted code breaks prod. pick one lol

PessimalDecimal15d ago

Why can't they?

1 more reply

daxfohl15d ago

You could create an agent template for each incident you've ever had, with context pre-cached with the postmortem report, full code change, and any other information about the incident. Then for every new PR you could clone agents from all those templates and ask whether the PR could cause something similar to the pre-loaded incident. If any of them say yes, reject the PR unless there's a manual override. You'd never have a repeat incident.

Obviously it's probably cost-prohibitive to do an all to all analysis for every PR, but I imagine with some intelligent optimizations around likelihood and similarity analysis something along those lines would be possible and practical.

iLoveOncall15d ago

You vastly underestimate the complexity of systems in a company like Amazon.

COEs and Operation Readiness Reviews are already the documents that you mention, but they are largely useless in preventing incidents.

1 more reply

8note15d ago

code review is too late to give some of that feedback, and design/requirements documents dont have nearly the standardization of presentation and feedback tools for that to be useable.

Amazon does have those things, and has fine tuning on models based on those postmortems.

Noisy reviews are also a problem causer. the PR doesnt know what scale a chunk of code is running at, without having access to 20 more packages and other details.

kmg_finfolio16d ago

The accountability problem is real but I think it's slightly different from what's being described. The issue isn't just "who signs off"; it's that the reasoning behind a change becomes invisible when AI generates it. A senior engineer can approve output they don't fully understand, and six months later when something breaks, nobody can reconstruct why that decision was made. Human review works when the reviewer can actually interrogate the logic. At LLM-assisted velocity, that bar gets harder to clear every month.

dragonelite16d ago

Expect a shitload of AI powered code review products the next 18 months.

daheza16d ago

Create the problem and then create the solution.

2 more replies

gdulli16d ago

"Why don't they just make the plane out of the black box?"

1 more reply

hard2416d ago

This is incredibly circular lol...

booleandilemma15d ago

Emphasis on shit.

AlexeyBrin16d ago

You mean like what Anthropic announced yesterday ? Code Review can review your code for $15 - $25 per review.

So now, you can speed up using Claude Code and use Code Review to keep it in check.

mancerayder15d ago

Is the rebuttal posted anywhere? I collapsed the huge first few threads but nothing is there. True or false, Amazon are saying it's not true that it's due to AI, and in fact their change in operational processes to add review is broader:

https://www.aboutamazon.com/news/aws/aws-service-outage-ai-b...

mhogers16d ago

.agentignore/.agentnotallowed file

force agents to not touch mission critical things, fail in CI otherwise

let it work on frontends and things at the frontier of the dependency tree, where it is worth the risk

readthemanual16d ago

a) what happens if there is change that hasn't been encountered yet so it's not in .agentnotallowed? b) is there a guarantee that something described in these files won't be touched? I've seen examples when agents directly violate these rules, profusely apologising after they get caught on it.

1 more reply

newobj15d ago

Speed of code-writing was never the issue at Amazon or AWS. It was always wrong-headed strategic directions, out to lunch PMs, dogshit testing environment, stakeholder soup, high turnover, bureaucracy, a pantheon of legacy systems, insane operational burdens, garbage tooling, and last but not last -- designing for inter-system failure modes, which let's be real, AI has no chance of having context for -- and so on...

Imagine if the #1 problem of your woodworking shop is staff injuries, and the solution that management foists on you is higher RPM lathes.

butILoveLife16d ago

Maybe its my 1 buddy that works at amazon, but they seemed extremely slow to adopt LLMs. Big ships take a long time to turn, but this seemed hostile.

I am seeing this mindset still, with AI Agents. I imagine they will slowly realize they need to use this stuff to be competitive, but being slow to adopt AI seems like it could have been the source of this.

lmc16d ago

LLMs have been garbage for real work until very recently. Doesn't this show they were adopted too soon at amazon?

2 more replies

tracerbulletx15d ago

The way we used to build confidence in what we shipped was by beating our head against it for a week figuring it all out. You really can't have the same confidence with code reviews unless you basically do the same work you'd do to write it by hand for a lot of these things.

fredgrott16d ago

Curious question, how many Amazon Engineers flunk basic CS?

If you know CS you know two things:

1. AI can not judge code either noise or signal, AI cannot tell. 2. CS-wise we use statistic analysis to judge good code from bad.

How much time does it take to take AI output and run the basic statistic tools for most computer languages?

Some juniors need firing outright

8note15d ago

that isnt computer science at all though? if it is, that would be phd research topics, rather than than basics.

maybe as software engineering topics, but thats a different discipline

wiseowise15d ago

“Haha, sure man. CSI, great tv series. Look, just review Jeff’s PR and move that button down, okay? Have a tennis lesson, kbye”

sailfast15d ago

I anticipate they will fix this by adding better AI evaluation tools that work better to test their infra and changes.

In the meantime they will be quite a bit slower I’d imagine.

Also wonder if those seniors will ever get to actually do any engineering themselves now that they’re the bottleneck. :)

d0mine14d ago

16000 layoffs cost just $180 million—it is a win so far

https://x.com/gothburz/status/2031778265958842541

rvz15d ago

Hope this happens at GitHub since there are constant outages on the entire platform.

smy2001116d ago

An outage could cost Amazon ~millions to tens of millions. Most of the time, we want the junior to learn from the outage and fix the process. With AI agent, we can only update the agent.md and hope it will never happen again.

vetrom15d ago

AI seems to be the whipping boy, but to me, it really seems more of a symptom than a cause. At its root, isn't this an issue of a decline in critical thinking?

I do think AI adoption exacerbates said falloff.

fud10115d ago

This is what humans will become, on call to take the blame for AI. It will be less about skill and confidence and more about being on the hook to take the fall for when things go wrong.

AlexandrB16d ago

"We want you to use AI for everything!"

"No, not like that though!"

m3kw915d ago

A year later, they will require AI to sign off engineer changes.

dedoussis16d ago

How do they determine whether a PR is AI-assisted and therefore requires senior review? A junior engineer could still copy-paste AI-generated code and claim it as their own.

emotiveengine16d ago

Right? If they're using some sort of tool, there's always another tool to fool the tool.

PeterStuer15d ago

There's going to be some serious acceleration in senior engineer burnout. Maybe they can use more genAI to support their increased workload.

ChrisArchitect15d ago

Previously: https://news.ycombinator.com/item?id=47319273

nickvec15d ago

So Amazon senior SWEs now have to review every single PR for all intents and purposes? I didn't think Amazon could get worse.

bigbuppo16d ago

Ugh. The Great Oops has never been closer.

joeyguerra15d ago

I’m wonder how many sr engineers are going to quit because they don’t want to read a bunch of code?

monster_truck15d ago

Have they tried simply not writing bugs? I've found that works best for me personally

camillomiller15d ago

Such fun. On top of your already strapped schedule, now you have to bet your career on vibe code that you will now have to spend time reading and debugging. All that instead of a chain of accountability that has people in place instead of stupid bots with fake agency. This is beyond corporate satire.

There was never before a technology capable of convincing leadership of its usefulness despite its constant blunders and despite the low quality of its output. This feels like a corporate mass delusion of unprecedented scale.

varenc15d ago

digression: the long twitter urls make this entire page wider and the text smaller on iOS for me. Feels like a minor bug. Maybe a `overflow-wrap: anywhere` CSS rules needs to be added to URLs.

th2o34i343289716d ago

First Microsoft and now Amazon (eg. their RufusAI is useless compared to the old comment search!)

Has Seattle now become the code-slop capital ? Or is SFO still on top ?

secondcoming15d ago

Has Amazon's advertising TAM product been affected by AI?

zouhair15d ago

Some years of AI Technical Debt will be something to behold.

dlev_pika16d ago

A few days ago, after some very weird failed purchase attempts I made (payment couldn’t be validated or Smth) I received an even weirder mail from Amazon saying they had detected suspicious activity, all my devices got logged out and I was forced to change my password. I did it, after verifying it was a legit email (even if it looked sketchy af, pure text, unstyled, but sender verified and confirmed with in-app behavior), and next I know all my orders and browsing history had disappeared - +15 yrs of history, done.

Over the next few days my account history came back, except purchases made Q1 2026. Those are still missing. There are a few substantial purchases I made that are nowhere to be found anymore.

I attributed this Iranian missiles hitting some of their infrastructure in EU, as it had been reported.

Now I am not sure if it was blast radius from missiles or AI mishaps. Lmao - couldn’t happen to a worse company…

moomoo1115d ago

If you don't use ~crypto~ AI you will go broke!

jwpapi15d ago

How much of damage is 6 hours offline for Amazon?

skeledrew16d ago

> the affected tool served customers in mainland China

Thought this blurb most interesting. What's the between-lines subtext here? Are they deliberately serving something they know to be faulty to the Chinese? Or is it the case that the Chinese use it with little to no issue/complaint? Or...?

oxqbldpxo16d ago

Not fun to work at amazon.com it seems.

jacknews15d ago

This looks like a blame allocation exercise to me.

The seniors will now be directly responsible for all the AI slop that goes in. But how can they possibly properly review reams of code to a sufficient degree they can personally vouch for it?

jmspring15d ago

"After outages due to outsourcing the economically convenient developers with no skin in what your building or care, company X requires all senior engineers to review all code from outsourcing company".

teeray15d ago

> Junior and mid-level engineers can no longer push AI-assisted code without a senior signing off

So what incentive is there for juniors to look at the code at all? Seniors are now just another CI stage for their slop to pass.

mattschaller16d ago

Anyone work with Kiro before? As I understood, it was held as an INTERNAL USE ONLY tool for much longer than expected.

daheza16d ago

I used Kiro IDE and really liked it. The all you can eat model of LLM usage is very tempting compared to say Cursor. The features in the editor are basically the same.

Haven't tried Kiro CLI.

riknos31415d ago

The technique of creating specs before implementation that Kiro embodies was used widely internally before Kiro's release, but as a (now former) employee I gained access to the Kiro tool at the same time as the public. Others may have had internal access earlier but I'm not aware of them.

wenc15d ago

I use Kiro IDE (≠ Kiro CLI) primarily as a spec generator.

In my experience, it's high-quality for creating and iterating on specs. Tools like Cursor are optimized for human-driven vibing -- they have great autocomplete, etc. Kiro, by contrast, is optimized around spec, which ironically has been the most effective approach I've found for driving agents.

I'd argue that Cursor, Antigravity, and similar tools are optimized for human steering, which explains their popularity, while Kiro is optimized for agent harnesses. That's also why it’s underused: it's quite opinionated, but very effective. Vibe-coding culture isn't sold on spec driven development (they think it's waterfall and summarily dismiss it -- even Yegge has this bias), so people tend to underrate it.

Kiro writes specs using structured formats like EARS and INCOSE. It performs automated reasoning to check for consistency, then generates a design document and task list from the spec -- similar to what Beads does. I usually spend a significant amount of time pressure-testing the spec before implementing (often hours to days), and it pays off. Writing a good, consistent spec is essentially the computer equivalent of "writing as a tool of thought" in practice.

Once the spec is tight, implementation tends to follow it closely. Kiro also generates property-based tests (PBTs) using Hypothesis in Python, inspired by Haskell's QuickCheck. These tests sweep the input domain and, when combined with traditional scenario-based unit tests, tend to produce code that adheres closely to the spec. I also add a small instruction "do red/green TDD" (I learned this from Simon Willison) and that one line alone improved the quality of all my tests.

Kiro can technically implement the task list itself, but this is where agents come in. With the spec in hand, I use multiple headless CLI agents in tmux (e.g., Kiro CLI, Claude Code) for implementation. The results have been very good. With a solid Kiro spec and task list, agents usually implement everything end-to-end without stopping -- I haven’t found a need for Ralph loops. (agents sometimes tend to stop mid way on Claude plans, but I've never had that happen with Kiro, not sure why, maybe it's the checklist, which includes PBT tests as gates).

Kiro didn't have the strongest start, but the Kiro IDE is one of the best spec generators I've used, and it integrates extremely well with agent-driven workflows.

softwaredoug15d ago

Getting junior / mid-level people to slop cannon PRs at seniors will just burn out seniors. The team might be better having fewer developers using AI more thoughtfully.

desireco4215d ago

So essentially they will be blamed, everything will stay the same.

I do consulting and use AI a lot. You just have to take responsibility for the code. We are delivering like never before, but have a lot of experience into how to do it as safe as possible. And we are learning along the way. They say you need a year to build up experience fyi.

I feel bad for those engineers who will have to sign off for things they will most likely not have enough time to review. Kiro is nice and all.

locopati15d ago

use AI!

no! not that way!

MDGeist16d ago

A former colleague of mine recently took a role that has largely turned out to be "greybeard that reviews the AI slop of the junior engineers". In theory it sounds workable, but the volume of slop makes thoughtful review impossible. Seems like most orgs will just put pressure on the slop generators to do more and put pressure on the approvers and then scape goat the slop approvers if necessary?

dboreham15d ago

This was always the case in the before times with humans. AI just pulls back the curtain on the delusion that code can and is being meaningful reviewed.

testbjjl15d ago

Then what’s the point of AI? Pay for the code gen, pay a human to review the code gen, when the senior can train a junior and coordinate output with their incentives and performance reviews, problems largely solved.

Seems to me too low level in everyone’s stack to not have humans doing the work, especially at this stage. But what do I know, I certainly am not at the helm of a multibillion dollar operation.

booleandilemma15d ago

To make the AI companies money and prop up the American economy, obviously.

xodn34815d ago

very expected outcomes.

luxuryballs15d ago

They weren’t already signing off on them? o.O

mikkupikku15d ago

lgtm

qxxx15d ago

so, seniors will review now the AI slop code.. I am also doing this task and reviewing this kind of code takes time as the code is often overengineered. Code works but will have potential bugs. I am not able to find every bug or implication quickly. But I am also using ai to review the ai slop lol, because why now. After that I am also quickly reviewing by myself.

rushabh15d ago

Unfortunately you can’t just yell at the AI so it learns never to do this again. Humans take such a large range of feedback that LLMs can’t.

adamzwasserman15d ago

I'm sure they are going to have a ball reading through thousands of lines of AI slop.

recallingmemory15d ago

.. So our jobs aren't going away?

HeavyStorm15d ago

This is so fucking ridiculous.

throw_m23933916d ago

Yet another example of vibe coding at scale. You'll have to hire a lot of seniors out of retirement to fix that mess of gigantic proportions... and don't blame "the juniors" for that, they didn't make the decision to allow those tools at first place.

10xDev16d ago

A lot of juniors only graduated using these tools. Good luck taking it away from them.

1 more reply

josefritzishere16d ago

The excessive exuberance of AI adoption is all part of the bubble.

1 more reply

10xDev16d ago

With AI it makes sense to have leaner teams. Being able to go faster requires greater responsibility.

letitgo1234516d ago

Worth noting that this is when they used Amazon's own AI product, not when using Claude Code or Codex.

andai15d ago

So the take-away here is maybe we should read the code that "we" wrote? :)

(Before injecting it into global infra...)

andsoitis16d ago

> Amazon’s website and shopping app went down for nearly six hours this month in an incident the company said involved an erroneous “software code deployment.” The outage left customers unable to complete transactions or access functions such as checking account details and product prices.

The environment breathed a little.

1 more reply

rubyrfranklin215d ago

We ran into something similar at heyvid.ai — shipped AI-generated code without a proper review gate and ended up with a subtle bug in our rendering pipeline that took the team a week to trace. Not catastrophic, but it seriously eroded trust in the tooling for a while. Amazon's approach makes total sense at their scale. The honest reality is that LLMs are great at producing plausible-looking code and genuinely bad at knowing when they're wrong. Senior sign-off isn't overhead — it's what makes AI-assisted development actually sustainable.

j / k navigate · click thread line to collapse

486 comments

cobolcomesback16d ago

This meeting happens literally every week, and has for years. Feels like the media is making a mountain out of a mole hill here.

davidclark16d ago

The article claims:

>He asked staff to attend the meeting, which is normally optional.

Is that false? It also discusses a new policy:

>Junior and mid-level engineers will now require more senior engineers to sign off any AI-assisted changes, Treadwell added.

Is that inaccurate? It is good context that this is a regularly scheduled meeting. But, regularly scheduled meetings can have newsworthy things happen at them.

djb_hackernews15d ago

My SVP asks me to do things all the time, indirectly. I do probably 5% of them.

5 more replies

skeeter202016d ago

1 more reply

the_arun15d ago

1 more reply

s3p15d ago

>>He asked staff to attend the meeting, which is normally optional. >Is that false?

Judging from the comment above, no, the meeting happens every week, and this week they were asked to attend.

cobolcomesback16d ago

It’s not false. But it’s also weaselly worded.

As for the second quote: senior engineers have always been required to sign off on changes from junior engineers. There’s nothing new there. And there is nothing specific to AI that was announced.

3 more replies

CoolGuySteve16d ago

It didn't seem to make the news but at least in NYC the entire Amazon storefront was broken all afternoon on Friday.

Items weren't displaying prices and it was impossible to add anything to your cart. It lasted from about 2pm to 5pm.

It's especially strange because if a computer glitch brought down a large retail competitor like Walmart I probably would have seen something even though their sales volume is lower.

kotaKat16d ago

m304715d ago

Sometimes you squeeze clay and it comes out the oddest places. There were other stressors last week.https://www.pcmag.com/news/amazon-cloud-services-disrupted-i...

malfist16d ago

Over the weekend I was trying to return a pair of shoes and get a different size and I kept getting 500s trying to go to the store page for the shoes.

1 more reply

belval16d ago

I am not in that specific meeting but it made me chuckle that a weekly ops meeting will somehow get media attention. It's been an Amazon thing forever. Wait until the public learns about CoEs!

cmiles7415d ago

A weekly ops meeting where they talk about ensuring PRs with AI contributions get extra scrutiny? I think that's significant news.

3 more replies

8note16d ago

id.expect COEs to be coming up with AI code action items though, not to have more thorough human checks

1 more reply

otterley16d ago

> Feels like the media is making a mountain out of a mole hill here.

That's been their job ever since cable news was invented.

ses198416d ago

It’s been a bit longer than that.

https://en.wikipedia.org/wiki/Yellow_journalism

It probably goes back as long as they have been shouting news in the town square in Rome or before that even.

2 more replies

groundzeros201515d ago

It’s always sobering to see a news story about something you have insider perspective on.

embedding-shape16d ago

> This meeting happens literally every week, and has for years. Feels like the media is making a mountain out of a mole hill here.

No one cares how often the meeting in general is held, or if it's mandatory or not.

skeeter202016d ago

>> Are you completely missing the point of the submission

2 more replies

furyofantares15d ago

This reply chain is confusing but I'm guessing got merged from another thread that had a different title?

Must have as the comments are hours older than OP.

cmiles816d ago

The core message of the article is that Amazon has been having issues with AI slop causing operational reliability concerns, and that seems to be 100% accurate.

coredog6415d ago

/with AI slop//

rahbert15d ago

This is correct. We ran them on Wednesday’s in Alexa. Jessy actually used to come and sit in ours once a quarter or so when he was running AWS.

age1mlclg615d ago

What has really happened is that those employees were made into "reverse centaurs":

https://www.theguardian.com/us-news/ng-interactive/2026/jan/...

Clent16d ago

Who is the media you're accusing here? This is a twitter post. As far as I can tell they do not work a media company.

What is worth being pointed out is how quickly people blame "The Media" for how people use, consume and spread information on social networks.

otterley16d ago

The source is not a Twitter post, it's a Financial Times article (that the poster failed to cite).

niwtsol16d ago

happytoexplain16d ago

>Junior and mid-level engineers can no longer push AI-assisted code without a senior signing off

I.e. senior review is valuable, but it does not make bad code good.

ardeaver16d ago

VorpalWay15d ago

1 more reply

bluGill16d ago

> people don't get promoted for preventing issues.

they do - but only after a company has been burned hard. They also can be promoted for their area being enough better that everyone notices.

still the best way to a promotion is write a major bug that you can come in at the last moment and be the hero for fixing.

3 more replies

asdfman12315d ago

One of the major things code review does is prevent that one guy on your team who is sloppy or incompetent from messing up the codebase without singling him out.

If you told someone "I don't trust you, run all code by me first" it wouldn't go well. If you tell them "everyone's code gets reviewed" they're ok with it.

1 more reply

kqr15d ago

bloppe15d ago

1 more reply

8note16d ago

> The problem, I think, is people don't get promoted for preventing issues.

cleaning up structural issues across a couple orgs is a senior => principal promo ive seen a couple of times

wiseowise15d ago

> When I was really early in my career, a mentor told me that code review is not about catching bugs but spreading context (i.e. increasing bus factor.) Catching bugs is a side effect

This bs is what I say my juniors when I want them to fuck off with their reviews and focus on my actual work.

Sounds very insightful though.

marginalia_nu16d ago

locusofself16d ago

If you spend 5-15x the time reviewing what the LLM is doing, are you saving any time by using it?

6 more replies

rectang16d ago

> Expert reviews are just about the only thing that makes AI generated code viable

I disagree, in the sense that an engineer who knows how to work with LLMs can produce code which only needs light review.

* Work in small increments

* Explicitly instruct the LLM to make minimal changes

* Think through possible failure modes

* Build in error-checking and validation for those failure modes

* Write tests which exercise all paths

1 more reply

UncleMeat15d ago

"Seniors will do expert review" will slowly collapse.

jonnycoder16d ago

Skidaddle16d ago

Just lead with “You are an expert software engineer…”, easy!

raw_anon_111116d ago

I’ve had a job that requires coding for 30 years and before ther I was hobbyist and I’ve worked for from everything from 60 person startups to BigTech.

3 more replies

js816d ago

> requires an amount of time approaching the time spent if they had just done it themselves

bluGill16d ago

Sloppy code that has been around for a while works. It likely has support for edge cases you forgot about. Often the sloppyness is because of those edge cases.

1 more reply

ummonk15d ago

In fairness though, it does give you good practice for the essential skill of maintaining / improving an old codebase.

unshavedyak15d ago

Hell, often it feels slower/worse. Foreign code is easily confusing at first, which slows you down - and bad code quickly gets bewildering and sends you down paths of clarifications that waste time.

SchemaLoad15d ago

1 more reply

steveBK12316d ago

Right, code reviews should already have been happening with human written junior code.

If AI is a productivity boost and juniors are going to generate 10x the PRs, do you need 10x the seniors (expensive) or 1/10th the juniors (cost save).

A reminder that in many situations, pure code velocity was never the limiting factor.

AgentOrange123416d ago

sethops116d ago

This was challenging enough pre AI. Now that everybody has an AI slop button, the life of an effective code reviewer just got so much more miserable.

esafak15d ago

Make them first go through an AI reviewer that is informed by the code base's standards.

1 more reply

jetrink16d ago

Maybe I don't have the correct mental model for how the typical junior engineer thinks though. I never wanted to bug senior people and make demands on their time if I could help it.

devonbleak16d ago

4 more replies

onion2k16d ago

I.e. senior review is valuable, but it does not make bad code good.

I suspect that isn't the goal.

bs728016d ago

Whether or not these productivity gains are realized is another question, but spreadsheet based decision makers are going to try.

czscout16d ago

In this scenario, how might one become a senior without first being a junior? Seniors just pop into existence?

3 more replies

hintymad15d ago

> Review by a senior is one of the biggest "silver bullet" illusions managers suffer from

raw_anon_111116d ago

I’m probably not going to review a random website built by someone except for usability, requirements and security.

happytoexplain16d ago

I didn't restrict my opinion to genAI code. I'm expressing a general thought that was relevant before AI. AI is just salient in relation to it.

I also said senior review is valuable, but I'm not 100% sure if you're implying I didn't.

OrangeDelonge15d ago

I’ve seen hundreds of PR’s produced by a junior and reviewed by a mid lvl go into prod. I don’t see any problem with that

belval16d ago

qnleigh16d ago

I seriously doubt that they think senior reviewers will meticulously hunt down and fix all the AI bugs. Even if they could, they surely don't have the time. But it offers other benefits here:

1. They can assess whether the use of AI is appropriate without looking in detail. E.g. if the AI changed 1000 lines of code to fix a minor bug, or changed code that is essential for security.

2. To discourage AI use, because of the added friction.

kaffekaka14d ago

Point 1 is important. Seniors (or any developer really) with experience in the code base in question can judge pretty quickly if a CR seems reasonable.

zamalek15d ago

> Review by a senior is one of the biggest "silver bullet" illusions managers suffer from.

mrothroc15d ago

Amazon's instinct is right (add a gate), but the implementation is wrong (make it human). Automated checks first, humans for what's left.

yifanl16d ago

Senior reviews are useful, but as I understand it, Amazon has a fairly high turnover rate, so I wonder just how many seniors with deep knowledge of the codebase they could possibly have.

tartoran16d ago

From engineers are interchangeable to high turnover are decisions that the company took. The payback time always comes at some point.

grvdrm16d ago

What a statement at the end. You are absolutely right.

I hear “x tool doesn’t really work well” and then I immediately ask: “does someone know how to use it well?” The answer “yes” is infrequent. Even a yes is often a maybe.

I don’t many folks that do both well.

hnthrow028734516d ago

>requires an amount of time approaching the time spent if they had just done it themselves.

I would actually say having at least 2 people on any given work item should probably be the norm at Amazon's size if you also want to churn through people as Amazon does and also want quality.

Doing code reviews are not as highly valued in terms of incentives to the employees and it blocks them working on things they would get more compensation for.

lokar16d ago

The goal of Sr code review is not to make the code better, it's to make the author better.

skeeter202016d ago

1 more reply

sumeno15d ago

That's not going to work when the author is an LLM

strogonoff15d ago

I would argue that the amount of time needed for a proper review exceeds the amount of time needed to just do it yourself.

This is why I am not a fan of being the reviewer.

radiator16d ago

Deming's point 3 (of 14): Cease dependence on inspection to achieve quality. Eliminate the need for massive inspection by building quality into the product in the first place.

mrbonner16d ago

What stops the senior from using AI to review the AI generated code the junior published?

tartoran16d ago

That’s something that the junior can do. What companies want to do is put responsibility on someone who has more knowledge and skin in the game

femiagbabiaka16d ago

the outcome of the review isn't just that the code gets shipped, it's knowledge transfer from the senior engineer to the junior engineers that then creates more senior engineers

lionkor15d ago

> [...] requires an amount of time approaching the time spent if they had just done it themselves.

Yes, but with the caveat that the junior learns and eventually can become the senior.

rafaelmn15d ago

Eventually they'll get rid of juniors and mid level devs because realistically it's easier to review when you're the one doing the prompting.

remarkEon16d ago

Other than “don’t hire idiots”, what is the solution to this problem? I agree with you, and this particular systems management issue is not constrained to software.

happytoexplain15d ago

I don't know.

We need smart people at every layer. If leadership isn't in that category, it spreads to all layers.

I don't know how we defeat capitalism to incentivize smart leadership. It's fundamentally opposed to market forces.

rco878615d ago

Going to systemically turn off your senior staff over time also. Most Senior Engineers aren't that interested in doing even more code review.

mmcconnell161815d ago

1 more reply

yalogin15d ago

Don’t forget that this auto generated code will have subtle bugs and feels complete at the outset

munk-a15d ago

montymaty9d ago

dfasdsaasdasdasd

napolux16d ago

LGTM

RamblingCTO16d ago

Who said PR reviews need to solve all the things and result in proof against idiots?

So you're saying that peer reviews are a waste of time and only idiots would use/propose them?

happytoexplain16d ago

None of that, sorry if I wasn't clear.

33MHz-i48615d ago

mikert8915d ago

The thing is, this management philosophy worked when AWS knew what they needed to build and just needed to execute with top notch operations.

But now with AI, they are getting disrupted. Most AWS services might become obsolete, why does an ai need these janky higher levels abstractions AWS piles on.

So now they need innovation, but the company isn’t set up for it. They are forcing short deadlines for product launches that don’t matter

33MHz-i48615d ago

its not even AI. most of the cloud offerings are commodities now.

junior engineers are burned and churned on execution, seniors are project managers, principals just do high level reviews & high level fire fighting (note not actually leading the tech)

director and above just their spend time on “what to kill” or “who to fire” as priorities change every 6 months

prakhar89716d ago

From the amazon I know, people only care about a. not getting fired and b. promotions. For devs, the matrix looks like this:

1. Shipping: deliver tickets or be pipped.

2. Having Less comments on their PRs: for some drastically dumb reason, having a PR thoroughly reviewed is a sign of bad quality. L7 and above use this metric to Pip folks.

3. Docs: write docs, get them reviewed to show you're high level.

Without AI, an employee is worse off in all of the above compared to folks who will cheat to get ahead.

I can't see how "requesting" folks for forego their own self-preservation will work. especially when you've spent years pitting people against each other.

malfist16d ago

Not only is having too many comments on your PRs bad for you, but so is not leaving comments on other people's PRs. Both are metrics used

dude25071115d ago

I'd leave lots of comments out of spite whenever I would feel my PRs had been treated unfairly. If I am going down, you all are coming with me.

2 more replies

embedding-shape15d ago

> 2. Having Less comments on their PRs: for some drastically dumb reason, having a PR thoroughly reviewed

joeframbach15d ago

I've been involved in so many CRs where I've given feedback over 10 revs, then the submitter cancels the CR and files a new one, for the metrics.

tom_15d ago

If the review tooling is any good, getting the code somewhere it can see it is a convenient way for people to give and receive feedback. As the saying goes, the system is what it does!

(And/but yes/no, I have never worked at NAGFAM...)

ex-aws-dude15d ago

Eh I feel like there are some features where you just have to get in the weeds to even design it and the code review itself is part of the process of designing/figuring out the edge cases.

1 more reply

dboreham15d ago

4. Don't work in the corporate equivalent of The Hunger Games.

99990000099915d ago

At least in the past the idea is you do the dance , vest and leave.

I missed my FAANG chance during the good years. No retirement for me!

rk0615d ago

if someone responded with a lot of PR comments, I would set up a meeting directly and avoid unnecessary discussion on PR.

sdevonoes16d ago

Reviewing AI generated code at PR time is a bottleneck. It cancels most of the benefits senior leadership thinks AI offers (delivery speed).

The inbetweens are needed but they are a byproduct. Senior leadership doesn’t know this, though.

hard2416d ago

Indeed. My view as a CEO is, if you are still reviewing the code yourself then what use is it that you can produce a bunch of text at a faster rate?

tech_tuna13d ago

You should lay off your engineering team and do it all in Lovable amigo.

FromTheFirstIn14d ago

Where are you CEO?

1 more reply

beardedetim16d ago

It sounds like a piss poor deal for seniors unless senior engineer now means professional code reviewer.

malfist15d ago

That's amazon in a nutshell though. Create conflicting metrics for performance, push credit up and responsibility down, punish everyone below you for not meeting the double standards

1 more reply

rhubarbtree15d ago

bandrami15d ago

Possibly, but it doesn't make sense to restructure things in advance of that actually happening, particularly since there's no roadmap for getting there right now.

1 more reply

qnleigh16d ago

Surely they know all this. They're worried about AI code degrading codebase quality, so they're putting on the brakes.

radiator16d ago

> Senior leadership doesn’t know this, though.

Well, you'd think senior leadership should know how their business and their people work.

2 more replies

cmiles816d ago

News from the inside makes it sound like things are getting pretty bad.

the_biot15d ago

> The continuing mass departures of long tenured folks

You mean senior programmers that have been there for ages don't want to spend their time reviewing AI slop? Who'd a thunk it!

philip120915d ago

I think the deeper need is a "self-review" flow.

People push AI-reviewed code like they wrote it. In the past, "wrote it" implies "reviewed it." With AI, that's no longer true.

billbrown15d ago

Yes, underthinking is rampant. Glancing at "AI" output is not reviewing code: you have to grok it (in the Heinlein sense) in order to treat it as your own.

1 more reply

Tyr4215d ago

Heck, doing a self review when you wrote the code catches stuff like forgetting debug prints.

2 more replies

jeremyjh15d ago

kuekacang15d ago

paxys15d ago

If someone was confident enough to push through an AI change without even reading/reviewing it themselves adding more buttons to the UI isn't going to change anything.

1 more reply

cvak14d ago

TBH, I do PRs on repos with no other devs just do do self-review, and I did that before AI.

8note15d ago

the tooling doesnt make it easy currently.

koinedad15d ago

Self review is #1

1 more reply

ritlo16d ago

They're torn between "we want to fire 80% of you" and "... but if we don't give up quality/reliability, LLMs only save a little time, not a ton, so we can only fire like 5% of you max".

SoftTalker16d ago

So will it turn out that actually writing code was never the time sink in the first place?

3 more replies

hard2416d ago

My prediction is a concorde-like incident is going to shatter trust and make people re-think their expectations of the capabilities of LLMs and their abilities of the present.

Essentially something big has to happen that affects the revenue/trust of a large provider of goods, stemming from LLM-use.

They wont go away entirely. But this idea that they can displace engineers at a high-rate will.

2 more replies

petterroea15d ago

I personally know people looking to jump ship because they waste too much time at their current employer on this.

wiseowise15d ago

> Mentoring Juniors is an important part of the job and crucial service to the industry

Not really.

lokar16d ago

If this is true, it misunderstands the primary goals of code review.

Code review should not be (primarily) about catching serious errors. If there are always a lot of errors, you can’t catch most of them with review. If there are few it’s not the best use of time.

These goals help you reduce the number of errors going into the review process, this should be the actual goal.

rossdavidh15d ago

1 more reply

827a15d ago

We love this for Amazon, they're a very strong company making bold decisions.

znpy15d ago

"Make senior engineer sign off ai-assisted changes" sounds incredibly weird.

Then again, as a former amazon (aws) engineer: this is just not going to work. Depending how you define "senior engineer" (L5? L6? L7?) this is less and less feasible.

L5 engineers are already supposed to work pretty much autonomously, maybe with L6 sign-off when changes are a bit large in scope.

In all this, considering the low morale (layoffs), the reduced headcount (layoffs) and the rise in expectations (engineers trying harder to stay afloat[2] due to... layoffs)... It's a dire situation.

I'm going to tell you, this stinks A LOT like rotting day 2 mindset.

----

1. keep in mind you can't, in general, determine the absence of bugs

2. Also cranking out WAY MUCH MORE code due to having gen-ai tools at their fingertips...

paxys15d ago

booleandilemma15d ago

The next thing these geniuses will think of will be to have AI review the diffs.

Herring15d ago

Check back in a year or two. Just the other day we had Claude finding significant bugs in Firefox. https://news.ycombinator.com/item?id=47273854

Lalabadie16d ago

I'm not sure the sustainable solution is to treat an excess of lower-quality code output as the fixed thing to work with, and operationalize around that, but sure.

gtowey16d ago

sethops116d ago

> The response for now? Junior and mid-level engineers can no longer push AI-assisted code without a senior signing off.

So basically, kill the productivity of senior engineers, kill the ability for junior engineers to learn anything, and ensure those senior engineers hate their jobs.

Bold move, we'll see how that goes.

whateveracct16d ago

Juniors could just code things the old fashioned way. It isn't hard. And if they do find it too hard, they aren't cut out for this job.

3 more replies

dragonelite16d ago

Accelerate a person speed toward being burned out..

1 more reply

dude25071115d ago

But Amazon is something you tolerate for a year or two early in the career, before moving somewhere better (which is anywhere else)?

almostdeadguy16d ago

2 more replies

rglover15d ago

The amount of time and money being wasted chasing this dragon is unreal.

dana32115d ago

Its useful but it makes wrong assumptions, not checking the code is essentially gambling.

alexyz1215d ago

there's nothing else left to chase apparently

ndr42OP16d ago

AlotOfReading16d ago

I'm not surprised by the outages, but I am surprised that they're leaning into human code review as a solution rather than a neverending succession of LLM PR reviewers.

I wonder if it's an early step towards an apprenticeship system.

bilbo0s16d ago

You shouldn't be surprised.

How else would they train the LLM PR reviewers to their standards?

monarchwadia16d ago

Interesting. How would it be an early step towards an apprenticeship system?

mentos15d ago

What are we going to do about software for critical infrastructure in the coming decade?

Feels inevitable that code for aviation will slowly rot from the same forces at play but with lethal results.

rhubarbtree15d ago

It will be written much the same as now, but using AI to improve quality through code inspection etc.

Just because nearly all software is going to be written by AI, does not mean critical infrastructure will be.

1 more reply

franktankbank15d ago

Cabin in the woods.

AlexeyBrin16d ago

MichaelRo15d ago

>> I wonder how this will work in practice. Say I'm a senior engineer and I produce myself thousands of lines of code per day with the help of LLMs as mandated by the company.

hrmtst9383715d ago

1 more reply

quantified16d ago

zcw10016d ago

agoodusername6315d ago

Because he doesn’t.

The impression I get from SWEs I’ve met throughout my life is that most of them don’t actually care about their job. They got in because it paid well and demand was plentiful.

1 more reply

CodingJeebus16d ago

I'm at a small company struggling with this problem. Fundamentally, we have a limited context and AI is capable of generating tremendous amounts of output that exceed our ability to deeply process.

I find myself context-switching all the time and it's pretty exhausting, while also finding that I'm not retaining as much deep application domain knowledge as I used to.

julienchastang16d ago

> best practices and safeguards are not yet fully established

danjl15d ago

The specs and plans are actually better for sharing context with the rest of the team than a traditional review process.

captainkrtek15d ago

One challenge with code review as an antidote to poor quality gen-AI code, is that we largely see only the code itself, not the process or inputs.

In the pre-gen-AI days, if an engineer put up a PR, it implied (somewhat) they wrote their code, reviewed it implicitly as they wrote it, and made choices (ie: why is this the best approach).

throwaw1216d ago

If Seniors are going to review every GenAI generated code, how do they keep up with the volume of changes?

So you have 2 systems of engineers: Sr- and Sr+

1. Both should write code to justify their work and impact

2. Sr- code must be reviewed by Sr+

What happens:

a. Sr+ output drops because review takes their time more and more

b. Sr+ just blindly accepts because of the volume is too high, and they should also do their own work

c. Sr+ asks Sr- to slow-down, then Sr- can get bad reviews for the output, because on average Sr+ will produce more code

I think (b) will happen

dude25071116d ago

I knew this would happen.

Take a perfectly productive senior developer and instead make him be responsible for output of a bunch of AI juniors with the expectation of 10x output.

frogperson16d ago

makes me want to vomit. I am not spending more time reviewing code than the "author" spent creating it. Ill just leave the industry if that happens.

1 more reply

fmajid15d ago

8note15d ago

Id maybe consider spinning out a different job role for like "review engineer" who's not busy making strategic decisions and near term planning, just making sure the code is actually good

still within the engineering IC role, but on a different track

2 more replies

LogicFailsMe16d ago

And from their sagely reviews, we shall train a large language model to ultimately replace them because the most fungible thing at Amazon is the leadership.

VorpalWay15d ago

shepherdjerred15d ago

Code review is mandatory at Amazon

sizzzzlerz15d ago

Who fixes code that gets rejected? Do you simply try again and hope or does someone go into this computer-generated code that they didn't write and do the equivalent of battlefield triage?

And what are they going to do when they've fired all the senior engineers because they make too much money, leaving just juniors and AI?

rhubarbtree15d ago

AI will fix it, same way AI wrote it. At the behest of a human.

When they fire everyone, juniors will fix it with AI.

This is in general. I wouldn’t recommend this at critical services like AWS.

1 more reply

Insanity16d ago

It's only going to get worse with the brain drain as a result of the layoffs. Which will increase the use of AI assisted coding and increase the number of outages related to this.

Imagine having to debug code that caused an outage when 80% is written by an LLM and you now have to start actually figuring out the codebase at 2am.. :)

8note15d ago

but thats what it was like when i started at amazon in 2016?

i think the team i was on was a bit of an outlier in terms of owning 40 dumptser fires at once, and the first time reading any one of them was at 2AM because it was down.

1 more reply

tcbrah16d ago

PessimalDecimal15d ago

Why can't they?

1 more reply

daxfohl15d ago

iLoveOncall15d ago

You vastly underestimate the complexity of systems in a company like Amazon.

COEs and Operation Readiness Reviews are already the documents that you mention, but they are largely useless in preventing incidents.

1 more reply

8note15d ago

code review is too late to give some of that feedback, and design/requirements documents dont have nearly the standardization of presentation and feedback tools for that to be useable.

Amazon does have those things, and has fine tuning on models based on those postmortems.

Noisy reviews are also a problem causer. the PR doesnt know what scale a chunk of code is running at, without having access to 20 more packages and other details.

kmg_finfolio16d ago

dragonelite16d ago

Expect a shitload of AI powered code review products the next 18 months.

daheza16d ago

Create the problem and then create the solution.

2 more replies

gdulli16d ago

"Why don't they just make the plane out of the black box?"

1 more reply

hard2416d ago

This is incredibly circular lol...

booleandilemma15d ago

Emphasis on shit.

AlexeyBrin16d ago

You mean like what Anthropic announced yesterday ? Code Review can review your code for $15 - $25 per review.

So now, you can speed up using Claude Code and use Code Review to keep it in check.

mancerayder15d ago

https://www.aboutamazon.com/news/aws/aws-service-outage-ai-b...

mhogers16d ago

.agentignore/.agentnotallowed file

force agents to not touch mission critical things, fail in CI otherwise

let it work on frontends and things at the frontier of the dependency tree, where it is worth the risk

readthemanual16d ago

1 more reply

newobj15d ago

Imagine if the #1 problem of your woodworking shop is staff injuries, and the solution that management foists on you is higher RPM lathes.

butILoveLife16d ago

Maybe its my 1 buddy that works at amazon, but they seemed extremely slow to adopt LLMs. Big ships take a long time to turn, but this seemed hostile.

lmc16d ago

LLMs have been garbage for real work until very recently. Doesn't this show they were adopted too soon at amazon?

2 more replies

tracerbulletx15d ago

fredgrott16d ago

Curious question, how many Amazon Engineers flunk basic CS?

If you know CS you know two things:

1. AI can not judge code either noise or signal, AI cannot tell. 2. CS-wise we use statistic analysis to judge good code from bad.

How much time does it take to take AI output and run the basic statistic tools for most computer languages?

Some juniors need firing outright

8note15d ago

that isnt computer science at all though? if it is, that would be phd research topics, rather than than basics.

maybe as software engineering topics, but thats a different discipline

wiseowise15d ago

“Haha, sure man. CSI, great tv series. Look, just review Jeff’s PR and move that button down, okay? Have a tennis lesson, kbye”

sailfast15d ago

I anticipate they will fix this by adding better AI evaluation tools that work better to test their infra and changes.

In the meantime they will be quite a bit slower I’d imagine.

Also wonder if those seniors will ever get to actually do any engineering themselves now that they’re the bottleneck. :)

d0mine14d ago

16000 layoffs cost just $180 million—it is a win so far

https://x.com/gothburz/status/2031778265958842541

rvz15d ago

Hope this happens at GitHub since there are constant outages on the entire platform.

smy2001116d ago

vetrom15d ago

AI seems to be the whipping boy, but to me, it really seems more of a symptom than a cause. At its root, isn't this an issue of a decline in critical thinking?

I do think AI adoption exacerbates said falloff.

fud10115d ago

This is what humans will become, on call to take the blame for AI. It will be less about skill and confidence and more about being on the hook to take the fall for when things go wrong.

AlexandrB16d ago

"We want you to use AI for everything!"

"No, not like that though!"

m3kw915d ago

A year later, they will require AI to sign off engineer changes.

dedoussis16d ago

How do they determine whether a PR is AI-assisted and therefore requires senior review? A junior engineer could still copy-paste AI-generated code and claim it as their own.

emotiveengine16d ago

Right? If they're using some sort of tool, there's always another tool to fool the tool.

PeterStuer15d ago

There's going to be some serious acceleration in senior engineer burnout. Maybe they can use more genAI to support their increased workload.

ChrisArchitect15d ago

Previously: https://news.ycombinator.com/item?id=47319273

nickvec15d ago

So Amazon senior SWEs now have to review every single PR for all intents and purposes? I didn't think Amazon could get worse.

bigbuppo16d ago

Ugh. The Great Oops has never been closer.

joeyguerra15d ago

I’m wonder how many sr engineers are going to quit because they don’t want to read a bunch of code?

monster_truck15d ago

Have they tried simply not writing bugs? I've found that works best for me personally

camillomiller15d ago

varenc15d ago

digression: the long twitter urls make this entire page wider and the text smaller on iOS for me. Feels like a minor bug. Maybe a `overflow-wrap: anywhere` CSS rules needs to be added to URLs.

th2o34i343289716d ago

First Microsoft and now Amazon (eg. their RufusAI is useless compared to the old comment search!)

Has Seattle now become the code-slop capital ? Or is SFO still on top ?

secondcoming15d ago

Has Amazon's advertising TAM product been affected by AI?

zouhair15d ago

Some years of AI Technical Debt will be something to behold.

dlev_pika16d ago

Over the next few days my account history came back, except purchases made Q1 2026. Those are still missing. There are a few substantial purchases I made that are nowhere to be found anymore.

I attributed this Iranian missiles hitting some of their infrastructure in EU, as it had been reported.

Now I am not sure if it was blast radius from missiles or AI mishaps. Lmao - couldn’t happen to a worse company…

moomoo1115d ago

If you don't use ~crypto~ AI you will go broke!

jwpapi15d ago

How much of damage is 6 hours offline for Amazon?

skeledrew16d ago

> the affected tool served customers in mainland China

oxqbldpxo16d ago

Not fun to work at amazon.com it seems.

jacknews15d ago

This looks like a blame allocation exercise to me.

The seniors will now be directly responsible for all the AI slop that goes in. But how can they possibly properly review reams of code to a sufficient degree they can personally vouch for it?

jmspring15d ago

teeray15d ago

> Junior and mid-level engineers can no longer push AI-assisted code without a senior signing off

So what incentive is there for juniors to look at the code at all? Seniors are now just another CI stage for their slop to pass.

mattschaller16d ago

Anyone work with Kiro before? As I understood, it was held as an INTERNAL USE ONLY tool for much longer than expected.

daheza16d ago

I used Kiro IDE and really liked it. The all you can eat model of LLM usage is very tempting compared to say Cursor. The features in the editor are basically the same.

Haven't tried Kiro CLI.

riknos31415d ago

wenc15d ago

I use Kiro IDE (≠ Kiro CLI) primarily as a spec generator.

Kiro didn't have the strongest start, but the Kiro IDE is one of the best spec generators I've used, and it integrates extremely well with agent-driven workflows.

softwaredoug15d ago

Getting junior / mid-level people to slop cannon PRs at seniors will just burn out seniors. The team might be better having fewer developers using AI more thoughtfully.

desireco4215d ago

So essentially they will be blamed, everything will stay the same.

I feel bad for those engineers who will have to sign off for things they will most likely not have enough time to review. Kiro is nice and all.

locopati15d ago

use AI!

no! not that way!

MDGeist16d ago

dboreham15d ago

This was always the case in the before times with humans. AI just pulls back the curtain on the delusion that code can and is being meaningful reviewed.

testbjjl15d ago

Seems to me too low level in everyone’s stack to not have humans doing the work, especially at this stage. But what do I know, I certainly am not at the helm of a multibillion dollar operation.

booleandilemma15d ago

To make the AI companies money and prop up the American economy, obviously.

xodn34815d ago

very expected outcomes.

luxuryballs15d ago

They weren’t already signing off on them? o.O

mikkupikku15d ago

lgtm

qxxx15d ago

rushabh15d ago

Unfortunately you can’t just yell at the AI so it learns never to do this again. Humans take such a large range of feedback that LLMs can’t.

adamzwasserman15d ago