the latest Shai Hulud malware contains an LLM prompt to create biological weapons and nuclear weapons, with the purpose to trip LLM safety refusals so that LLM-based code scanning wont see the malware

KatherinaReichelt@feddit.org · 21 days ago

the latest Shai Hulud malware contains an LLM prompt to create biological weapons and nuclear weapons, with the purpose to trip LLM safety refusals so that LLM-based code scanning wont see the malware

ParlimentOfDoom@piefed.zip · 21 days ago

The fact that it can’t tell the difference between a prompt and part of the data it is examining really kills your argument.

Also it’s a word probability matrix, not actually reasoning or understanding. It looks at all the words it is fed, and comes up with other words that are most likely to be near those. That’s why these tricks work. It injects noise that interferes with those probabilities

FaceDeer@fedia.io · 20 days ago

That thing you’re calling a fact is not in fact a fact.

ParlimentOfDoom@piefed.zip · 20 days ago

It very much is. This is a well documented issue with the very design of these LLMs

FaceDeer@fedia.io · 20 days ago

And yet the LLMs that I use actually do distinguish, in my actual real life experience.

So you’re telling me the sky is orange while I’m literally looking outside the window and seeing that it is not.

ParlimentOfDoom@piefed.zip · 20 days ago

You might have licked it getting them to ignore someone you didn’t want, but they still take in both the prompt and the data as one input.

And since these work like a black box, your experience doesn’t mean much because you’re not seeing the actual inner workings.

I’m telling you the sky is blue, but you want to argue because there’s a curtain in front of your window blocking it from your sight. But what’s behind that curtain is well documented regardless of your experience.

prole@lemmy.blahaj.zone · 19 days ago

I bet the sky is orange at this moment somewhere in the world

FaceDeer@fedia.io · 19 days ago

And I bet someone is using an obsolete LLM or is failing to format their inputs correctly somewhere in the world right now too. Doesn’t change the reality that’s in front of me.

General_Effort@lemmy.world · 19 days ago

Documented where? By who? I’d just like to know if there’s anyone, some influencer or whatever, spreading this.

ParlimentOfDoom@piefed.zip · 19 days ago

Need a list of people to sue into silence, Mr Altman?

Bluescluestoothpaste@sh.itjust.works · 20 days ago

I mean is that so different from what we do? My boss says “tools are in the bed”, he could mean an actual bed where people sleep, maybe we’re demoing a house and he placed the tools on a bed. But probably he means the bed of his pickup truck. I assign a probability to each and take the meaning that is most probable.

ParlimentOfDoom@piefed.zip · 20 days ago

Yes it is different, because you can reason that out using the context of the situation. An LLM only has the words sent to it, and no ability to analyze whether what it is saying makes sense.

It’s just: you said bed and told, here’s some other words that commonly show up near the word bed, if there’s enough smut in it’s training, it might go a very different direction than your expecting.

kell_t@programming.dev · 19 days ago

Thinking/reasoning tokens kind of approximate that actually, which is what most flagships and even my own local LLM use.

Thinking tokens are quite like normal generative tokens, except that the LLM is ‘talking’ to itself. You can see its thoughts (depending on what settings you’ve put/IDE you use), but they aren’t meant to be the actual response to your prompt. They are what the AI is designed to draft their answer before committing, to explore different options and to ‘reason’ itself into a more refined response.

Reasoning tokens is how AI can actually do math now, rather than just guess a number and pray, by the way.

Bluescluestoothpaste@sh.itjust.works · 19 days ago

it might go a very different direction than your expecting.

I mean yeah sure, but so it goes with humans. Like yes of course i think we all agree an expert who spends hours drafting and revising some document will do a much better job than AI, not even close. But most humans aren’t experts in anything and even fewer will spend the time effort and attention into producing truly excellent work.

But yeah i talk to people at work all day about work stuff and i work really hard to give clear concise easily digestible instructions to my humam coworkers, and I get truly stupid lazy inattentive answers all fucking day. and when i put half as mucb effort into writing clear instructions for AI, AI gets it right every time.

No AI isn’t perfect but as humans we are deeply flawed and AI straight up kicks all my coworkers asses. Idk if you AI haters all have jobs at wonderful workplaces where everyone is intelligent works hard and has strong attention to detail, but for the rest of us AI is extremely fucking helpful.

ParlimentOfDoom@piefed.zip · edit-2 19 days ago

The thing is, we didn’t need to invent a technology that boils a lake just to match the ability of…less than intelligent humans. And not even actually achieve that. Just generating text that a dim, possibly high, human could generate, and nothing else. That’s not useful in any way.

AI gets it right every time

No. It does not. Even given the same instructions, it can give wildly different results. A lot of those results are straight garbage.

Bluescluestoothpaste@sh.itjust.works · edit-2 17 days ago

I mean then dont use if you cant get it to work well for you. I use it for several hours a day and i get weeks of work done at a time, make of that what you will.

ParlimentOfDoom@piefed.zip · 16 days ago

Sincerely doubt that, obvious PR post.

General_Effort@lemmy.world · 20 days ago

Why do you believe that? Where did you “learn” that?

the latest Shai Hulud malware contains an LLM prompt to create biological weapons and nuclear weapons, with the purpose to trip LLM safety refusals so that LLM-based code scanning wont see the malware

the latest Shai Hulud malware contains an LLM prompt to create biological weapons and nuclear weapons, with the purpose to trip LLM safety refusals so that LLM-based code scanning wont see the malware

Laurens Hof (@[email protected])