After outages, Amazon to make senior engineers sign off on AI-assisted changes

Tony Bark@pawb.social · 5 months ago

After outages, Amazon to make senior engineers sign off on AI-assisted changes

Otter@lemmy.ca · 5 months ago

…

Do the senior engineers NOT sign off on changes to systems that can take down the production servers? Even if we take out the LLM created code, this sounds like a bigger problem

pageflight@piefed.social · 5 months ago

We may start to see people realize that “have the AI generate slop, humans will catch the mistakes” actually is different from “have humans generate robust code.”

🌞 Alexander Daychilde 🌞@lemmy.world · 5 months ago

Not only that, but writing code is so much easier than understanding code you didn’t write. Seems like either you need to be able to trust the AI code, or you’re probably better of writing it yourself. Maybe there’s some simple yet tedious stuff, but it has to be simple enough to understand and verify faster than you could write it. Or maybe run code through AI to check for bugs and check out any bugs it finds…

I definitely have trusted AI to write miniature pointless little projects - like a little PHP page that loaded music for the current directory and showed a simple JS player in a webpage so I could share Christmas music with my family and friends. No database, no file uploading or anything. It worked decently, although not perfectly, and that’s all it needed to do.

slaacaa@lemmy.world · edit-2 4 months ago

This is true not just with code, but with many types of complex outputs. Going through and fixing somebody’s horrible excel model is much worse than building a good one yourself. And if the quality is really bad, it’s also just faster to do it yourself from scratch.

Mirror Giraffe@piefed.social · 4 months ago

I’ve been writing a slightly larger project with frontend, bff and backend and I need to take it in small batches so that I can catch when it misunderstands or outright does a piss job of implementing something. I’ve been focusing a lot on getting all the unit tests I need in place which makes me feel a bunch better.

The bigger and more complex the projects get, the harder it is for the LLM to keep stuff in context which means I’ll have to improve my chunking out smaller scoped implementations or start writing code myself I think.

All in all I feel pretty safe with my project and pleased with the agents work but I need to increase testing further before bringing anything live.

🌞 Alexander Daychilde 🌞@lemmy.world · 4 months ago

Security testing will be the most important.

I’ve done a couple of tiny projects that I didn’t feel like coding. So far, I have not been terribly impressed. Well, it is impressive that it can make something functional at all, and in one case, what it made was fine enough to use as the temporary project it was intended (sharing christmas music with friends/family - reading files from a directory and writing a javascript player to play them in a shuffled order).

In the other case, replicating a simple text-based old DOS game with simple rules (think a space-based game around the complexity of checkers or so), it failed to think of so many things that while it did what I told it for the most part, it wasn’t a playable game. It was close, and fun enough for a nostalgic moment, but I had to work with it on logic like “If two fleets of ships arrive at the same planet in the same turn, you have to see how the first battle goes. If the first battle captures the planet, the second fleet is not attacking the first fleet’s ships - we won the planet at that point”. Very simple concepts that sure, you’d have to think of as a programmer, but if you were telling another person about how the game should work, were things I felt another person would think about.

I hope AI works well for you. Anywhere security it needed like database sanitation or user credentials… I hope you test thoroughly and I hope you can tell it enough to remind it to implement things like sanitation and other safety measures. An app can certainly appear to be working, but give many many fronts for attack. That’s my main worry with AI code. I worry enough on the little projects I do if I’m being secure enough myself.

Mirror Giraffe@piefed.social · 4 months ago

Yeah I hope I am cautious enough. I use strict db models that were man written and have type checking and sanitation. That along with unit tests that cover everything I’ve been able to think of that can go right or wrong combined with the classic “obscurity===security” motto.

Of course there are always vectors one hasn’t thought of, but that goes for man made projects as well. If I decide to bring it live and scale up I’ll probably order a pen test.

🌞 Alexander Daychilde 🌞@lemmy.world · 4 months ago

Sounds like you’re 1) thinking about it and certainly 2) doing way the fuck more than most utilizing AI.

My approval means quite little, but you have it anyway <3

Mirror Giraffe@piefed.social · 4 months ago

❤️

Hupf@feddit.org · 4 months ago

Yeah, initially writing the code never was the time sink.

somethingsnappy@lemmy.world · 4 months ago

*LLM

PattyMcB@lemmy.world · 5 months ago

I guarantee there’s so much pressure on those engineers to deliver code that they rubber stamp a ton of it with the intention of “fixing it later”

Source: I’ve worked in software for 20+ years and know a lot of folks working for and who have worked for Amazon

PabloSexcrowbar@piefed.social · 5 months ago

That’s basically the story at all the big tech companies, from what I’ve heard. In my time at Facebook, I felt like the only person who actually read the merge requests that people sent me before hitting it with “LGTM”

tal@lemmy.today · edit-2 5 months ago

If companies are going to place increasing reliance on review due to having lower-quality submissions, then they should probably evaluate employees weighting review quality (say, oh, rate of bugs subsequently discovered in reviewed commits or something like that).

ragas@lemmy.ml · 4 months ago

Sure. i’ll review your code favourably if you do the same with mine.

That is also a way to get no bugs at all.

criss_cross@lemmy.world · 5 months ago

When I worked there 20% of the work we had to do had to go through a senior engineer. And getting his time was like pulling teeth.

More of the time he would just nitpick grammar in docs and then finally rubber stamp work. It was awful.

mrgoosmoos@lemmy.ca · 5 months ago

the way private companies work is that they require their employees to produce more than is reasonable given the work quality that is expected.

when this discrepancy is pointed out, it’s handwaved away. when the discrepancy results in problems, as it most obviously will, somebody is found to place the blame on.

it’s not the developer’s faults. it’s a management decision.

source: I’m talking out of my ass I’m just a salty employee who is seeing this happen at their own workplace when it didn’t used to, at least not to this level