Opensource AI Must Win

lemmydividebyzero@reddthat.com · 1 month ago

Opensource AI Must Win

Mearcfara@lemmy.ml · 1 month ago

I just wish we could invest the time/money/resources into compressing AI and making it smaller and more efficient. I’d so much rather have a somewhat capable AI that can be run locally and offline, to outsource menial tasks to like alphabetizing spreadsheets and so basic image modification, than to have to upgrade my hardware constantly or use cloud based SaaS and/or have newer models that are more accurate in their predictions.

Of course that assumes a lot of things, like the intent to help people and not make money. Maybe someone in the Linux-sphere will make something.

ZephyrXero@lemmy.world · 1 month ago

There are efforts there. The new Deepseek 4 compresses a lot of its knowledge using something they call engrams. But it’s unfortunately still too big for a consumer GPU.

Gemma 4 is small enough to run on your cellphone.

If your GPU has at least 8GB there are a lot of options for self hosting your own local models

nforminvasion@lemmy.world · 1 month ago

Look into Bonsai Ternary models. They’re “1.5” bit models that have to be trained that way (so no taking a full model and quantizing it down) but they are so efficient and they can run on CPU only, though it’s a bit alpha at the moment. Really cool company and projects.

You have to create a specific environment for them though, using Bonsai’s GGUF version which enables them to run properly. So unfortunately, no use in LM Studio yet.

HubertManne@piefed.social · 1 month ago

I would like to see one integrated into a gnu os like linux where its only capability is to understand the os and guide you through it. No generation and no expertise outside the os exosystem. Maybe allow for it to be given the privelege to search the web. I would have it have capability to use other ais to perform other tasks so modules or whatnot could be added to give it more capability as a general computer butler type. Basically an os that acted like a start trek computer.

SilentKnightOwl@slrpnk.net · 1 month ago

Using Pi agent with qwen 3.6 35b a3b running with llamacpp on my GPU feels a lot like that. I have a script that watches my downloads folder and keeps it organized, and it used to just get the file extensions and move things based on type, now with a local llm in the loop, it moves things based on what it is, and what it is for. If I download a PDF file from work, it automatically reads the first page, figures out what its about, and moves it to my work documents.

“Whichever port that docker container is on, make it this one”

MangoCats@feddit.it · 1 month ago

They’re really good at digging for stuff, like: this app is reporting the git hash it was built from - somewhere in the log files - go read that and show me which branch that hash appears on (hash is 8 commits back in some branch…) Yeah, I could do that myself, but why would I if I don’t have to?

dil@lemmy.zip · 1 month ago

I want clippy but actually useful with all software, just giving tips when needed, ai can be useful sometimes, idk like im bad at math always have been, I need to sort some curves by index recentlly and it helped with the math logic a lot, otherwise I was using a repeat node and it was a lot slower than the way it showed me. Downside ofc was the ai way isn’t fully accurate or implementable as they say, has to be modified, it makes up nodes that don’t exist, but there are similar ones.

MangoCats@feddit.it · 1 month ago

Increasingly, people ask me questions, send me screen shots, I copy-paste that into gpt, gpt’s answers are helpful and correct… they have access to the same (free to use) gpt themselves…

dil@lemmy.zip · 1 month ago

Please don’t compare how I use AI to how you do, I hope I never ask you a question and trust you like I would a human

MangoCats@feddit.it · 1 month ago

Well, when the question is: why isn’t my server access working, and the result from gpt gets their server access working… I hope you can trust a result like that?

dil@lemmy.zip · 1 month ago

People ask humans because they want to interact with humans, just say you don’t know and they’ll ask ai themselves, unnecessary middlemannimg for ego boost is weird

MangoCats@feddit.it · 1 month ago

It’s not that I don’t know, it’s that I’ve already answered their questions, in writing, if they would just read a half page of text and do what it says.

HubertManne@piefed.social · 1 month ago

When I ask a person a question im generally trying to get their perspective. Im likely asking a few people or will over time. Its honestly just a part of socializing and interacting as humans.

MangoCats@feddit.it · 1 month ago

I get questions like: why can’t I access this server, I followed the wiki page (first clue, they didn’t follow the wiki page). That’s not asking for insight, that’s asking for where they failed to follow a set of 5 step directions by doing things like: changing the default filename of their new ssh key to something they invented.

GPT explained, far more patiently than I would have, how indeed to do 4 more steps and rename your ssh key to anything you want, but I did offer the insight: if you just leave the name as the default value, you can skip all of this extra work.

HubertManne@piefed.social · 1 month ago

why are people asking you this. Do you mean at work?

MangoCats@feddit.it · 1 month ago

Yes, do you answer questions for money outside of work? Outside of work if somebody is asking me a question I assume they want my answer and I’ll give them that instead of looking something up, although sometimes I punt with an “I don’t know but I bet Google does…” Inside work I attempt to answer questions as correctly and efficiently as possible - the GPT tools are great at that.

HubertManne@piefed.social · 1 month ago

Okay. It was just not clear when you first said how you would put it in chat gpt. Granted I will answer work questions from myself because if they are asking I generally just know the answer or if I don’t I say I don’t. I might even say I think that is somewhere in the wiki or give a wiki link. Much more common if I wrote the wiki section which is kinda common if they are asking me. I might though ask if they did and where it lost them as then I can improve the wiki.

petersr@lemmy.world · edit-2 1 month ago

If I understand correctly, if we actually said “this model is great, let’s put a pin in it”, then it could be turned into a dedicated chip that would be much more efficient and perhaps even something that could get embedded in consumer hardware - but then you are just stuck with that model instead of “the next shiny new model” that they keep making.

Mearcfara@lemmy.ml · 1 month ago

This sent me for a loop.

I don’t mind older stuff- my car is from the late '10s, and was a few years old when I got it, but blew my mind compared to my last car from the mid '00s. It has a back up camera! And even though my car is now nearing 10 years old, my experience hasn’t changed. I’m still driving on mostly the same roads using the same method. And, when I have to get a new car, I’m sure I’ll marvel at remote start or whatever.

But what’s a bummer is the idea that someone else can decide that the hardware is no longer adequate- that “you must have the newest experience”. I simply don’t want that. Yes, it’s annoying that my phone has to be plugged in to access carplay, while new cars have it over bluetooth, but I didn’t even know it was that way until I got a rental recently.

So for AI, i’m okay with some shortcomings, because I can get to know the software and work with it, and if the shortcoming is a show stopper, then I can seek to upgrade or just not do what I was trying to do with my older gen AI.

But alas, the number must go up so the shareholders can rub their stocks or whatever

BrightCandle@lemmy.world · edit-2 1 month ago

I feel like there is a future of more targeted AI. At the moment something that does spreadsheets has to carry knowledge of programming and chemistry and lots of languages and this seems very heavy for what ultimately we need. A programming language focussed AT dedicated to Rust or Go or Java could potentially be quite a bit smaller especially if they focussed on algorithm snippet and auto complete smarts. There is definitely a market for smaller more targeted uses than these all encompassing chat bots where the goal is to move the state of the art on for existing algorithms.