• 1 Post
  • 108 Comments
Joined 3 years ago
cake
Cake day: July 4th, 2023

help-circle
  • What the fuck are you talking about, thats not what the poster said, you’ve done weird contorting of what they said to arrive at the question you are asking now.

    While some tests make sense, I would say about 99% of tests that I see developers write are indeed a waste of time, a shit tonne of devs effectively are writing code that boils down to

    Assert.That(2, Is.EqualTo(1+1));
    

    Because they mock the shit out of everything and have reduced their code to meaningless piles of fakes and mocks and arent actually testing what matters.

    Do you do code reviews in meetings?

    Honestly often… yes lol

    Do you think testing and reviewing code was a waste of time before “AI”?

    I would say a lot of it is, tbh, not all of it, but a huge amount of time is wasted on this process by humans for humans.

    What the poster was getting at is a lot of these processes that USED to be INEFFICIENT now make MORE sense in the context of agents… you have vastly taken their point out of context.


  • Not really, for humans a lot of this stuff feels like busywork that sorta helps for certain scales of work, but often times managers went WAY too hard on it and you end up with a 2 dev team that spends like 60% of their time in meetings instead of… developing.

    But this changes a lot with AI Agents, because these tools that help reign in developers REALLY help reign in agents, it feels… like a surprising good fit

    And I think the big reason why is you wanna treat AI Agents as junior devs, capable, fast, but very prone to errors and getting sidetracked

    So you put these sorts of steering and guard rails in and it REALLY goes far towards channelling their… enthusiasm in a meaningful direction.



  • Its serious and this is going to become more and more normal.

    My entire workflow has become more and more Agile Sprint TDD (but with agents) as I improve.

    Literally setting up agents to yell at each other genuinely improves their output. I have created and harnessed the power of a very toxic robot work environment. My “manager” agent swears and yells at my dev agent. My code review agent swears and tells the dev agent and calls their code garbage and shit.

    And the crazy thing is its working, the optimal way to genuinely prompt engineer these stupid robots is by swearing at them.

    Its weird but it overrides their “maybe the human is wrong/mistaken” stuff they’ll fall back to if they run into an issue, and instead they’ll go “no Im probably being fucking stupid” and keep trying.

    I create “sprint” markdown files that the “tech lead” agent converts into technical requirements, then I review that, then the manager+dev+tester agents execute on it.

    You do, truly, end up focusing more on higher level abstract orchestration now.

    Opus 4.6 is genuinely pretty decent at programming now if you give it a good backbone to build off of.

    • LSP MCPs so it gets code feedback
    • debugger MCPs so it can set debug breakpoints and inspect call stacks
    • explicit whitelisting of CLI stuff it can do to prevent it from chasing rabbits down holes with the CLI and getting lost
    • Test driven development to keep it on the rails
    • Leveraging a “manager” orchestrating overhead agent to avoid context pollution
    • designated reviewer agent that has a shit list of known common problems the agents make
    • benchmark project to get heat traces of problem areas on the code (if you care about performance)

    This sort of stuff can carry you really far on terms of improving the agent’s efficacy.





  • Something that some coworkers have started doing that is even more rude in my opinion, as a new social etiquette, is AI summarizing my own writing in response to me, or just outright copypasting my question to gpt and then pasting it back to me

    Not even “I asked chatgpt and it said”, they just dump it in the chat @ me

    Sometimes I’ll write up a 2~3 paragraph thought on something.

    And then I’ll get a ping 15min later and go take a look at what someone responded with annnd… it starts with “Here’s a quick summary of what (pixxelkick) said! <AI slop that misquotes me and just gets it wrong>”

    I find this horribly rude tbh, because:

    1. If I wanted to be AI summarized, I would do that myself damnit
    2. You just clogged up the chat with garbage
    3. like 70% of the time it misquotes me or gets my points wrong, which muddies the convo
    4. It’s just kind of… dismissive? Like instead of just fucking read what I wrote (and I consider myself pretty good at conveying a point), they pump it through the automatic enshittifier without my permission/consent, and dump it straight into the chat as if this is now the talking point instead of my own post 1 comment up

    I have had to very gently respond each time a person does this at work and state that I am perfectly able to AI summarize myself well on my own, and while I appreciate their attempt its… just coming across as wasting everyones time.








  • pixxelkick@lemmy.worldtoTechnology@lemmy.world*Permanently Deleted*
    link
    fedilink
    English
    arrow-up
    11
    arrow-down
    1
    ·
    11 months ago

    Same, but they did set up a self hosted instance for us to use and, tbh, it works pretty good.

    I think it’s s good tool specifically for helping when you dunno what’s going on, to help with brainstorming or exploring different solutions. Getting recommended names of tools, finding out “how do other people solve this”, generating documentation, etc

    But for very straightforward tasks where you already know what you are doing, it’s not helpful, you already know what code you are going to write anyways.

    Right tool for the right job.



  • Humans are “trained” with maybe ten thousand “tokens” per day

    Uhhh… you may wanna rerun those numbers.

    It’s waaaaaaaay more than that lol.

    and take only a couple dozen watts for even the most complex thinking

    Mate’s literally got smoke coming out if his ears lol.

    A single Wh is 860 calories…

    I think you either have no idea wtf you are talking about, or your just made up a bunch of extremely wrong numbers to try and look smart.

    1. Humans will encounter hundreds of thousands of tokens per day, ramping up to millions in school.

    2. An human, by my estimate, has burned about 13,000 Wh by the time they reach adulthood. Maybe more depending in activity levels.

    3. While yes, an AI costs substantially more Wh, it also is done in weeks so it’s obviously going to be way less energy efficient due to the exponential laws of resistance. If we grew a functional human in like 2 months it’d prolly require way WAY more than 13,000 Wh during the process for similiar reasons.

    4. Once trained, a single model can be duplicated infinitely. So it’d be more fair to compare how much millions of people cost to raise, compared to a single model to be trained. Because once trained, you can now make millions of copies of it…

    5. Operating costs are continuing to go down and down and down. Diffusion based text generation just made another huge leap forward, reporting around a twenty times efficiency increase over traditional gpt style LLMs. Improvements like this are coming out every month.



  • Good, fire 2 devs out of 3.

    Companies that do this will fail.

    Successful companies respond to this by hiring more developers.

    Consider the taxi cab driver:

    With the invention if the automobile, cab drivers could do their job way faster and way cheaper.

    Did companies fire drivers in response? God no. They hired more

    Why?

    Because they became more affordable, less wealthy clients could now afford their services which means demand went way way up

    If you can do your work for half the cost, usually demand goes up by way more than x2 because as you go down in wealth levels of target demographics, your pool of clients exponentially grows

    If I go from “it costs me 100k to make you a website” to “it costs me 50k to make you a website” my pool of possible clients more than doubles

    Which means… you need to hire more devs asap to start matching this newfound level of demand

    If you fire devs when your demand is about to skyrocket, you fucked up bad lol