• Mearcfara@lemmy.ml
    link
    fedilink
    English
    arrow-up
    9
    ·
    9 hours ago

    I just wish we could invest the time/money/resources into compressing AI and making it smaller and more efficient. I’d so much rather have a somewhat capable AI that can be run locally and offline, to outsource menial tasks to like alphabetizing spreadsheets and so basic image modification, than to have to upgrade my hardware constantly or use cloud based SaaS and/or have newer models that are more accurate in their predictions.

    Of course that assumes a lot of things, like the intent to help people and not make money. Maybe someone in the Linux-sphere will make something.

    • HubertManne@piefed.social
      link
      fedilink
      English
      arrow-up
      1
      ·
      34 minutes ago

      I would like to see one integrated into a gnu os like linux where its only capability is to understand the os and guide you through it. No generation and no expertise outside the os exosystem. Maybe allow for it to be given the privelege to search the web. I would have it have capability to use other ais to perform other tasks so modules or whatnot could be added to give it more capability as a general computer butler type. Basically an os that acted like a start trek computer.

    • petersr@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      6 hours ago

      If I understand correctly, if we actually said “this model is great, let’s put a pin in it”, then it could be turned into a dedicated chip that would be much more efficient and perhaps even something that could get embedded in consumer hardware - but then you are just stuck with that model instead of “the next shiny new model” that they keep making.

    • BrightCandle@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      7 hours ago

      I feel like there is a future of more targeted AI. At the moment something that does spreadsheets has to carry knowledge of programming and chemistry and lots of languages and this seems very heavy for what ultimately we need. A programming language focussed AT dedicated to Rust or Go or Java could potentially be quite a bit smaller especially if they focussed on algorithm snippet and auto complete smarts. There is definitely a market for smaller more targeted uses than these all encompassing chat bots where the goal is to move the state of the art on for existing algorithms.

    • ZephyrXero@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      7 hours ago

      There are efforts there. The new Deepseek 4 compresses a lot of its knowledge using something they call engrams. But it’s unfortunately still too big for a consumer GPU.

      Gemma 4 is small enough to run on your cellphone.

      If your GPU has at least 8GB there are a lot of options for self hosting your own local models