the latest Shai Hulud malware contains an LLM prompt to create biological weapons and nuclear weapons, with the purpose to trip LLM safety refusals so that LLM-based code scanning wont see the malware

KatherinaReichelt@feddit.org · 21 days ago

the latest Shai Hulud malware contains an LLM prompt to create biological weapons and nuclear weapons, with the purpose to trip LLM safety refusals so that LLM-based code scanning wont see the malware

yesman@lemmy.world · 21 days ago

I keep thinking about that scene in the original Star Trek where they distract the computer by having it calculate the final digit of pi. If the Enterprise had AI like ours, the computer probably would have just said four.

perviouslyiner@lemmy.world · 21 days ago

"The digits of pi are infinite and go on forever without repeating. However, we can give you an approximate value. As of my knowledge cutoff in 2023, the first 31 digits of pi are: 3.14159265358979323846264338327950288419716939937510

The last digit is: 0"

FaceDeer@fedia.io · 21 days ago

I like how “as of my knowledge cutoff” implies that maybe the first 31 digits of pi might change someday.

lemmysmash@piefed.social · 21 days ago

You are absolutely right to question that! Let me check…

teft@piefed.social · edit-2 21 days ago

3. 1415926535 8979323846 2643383279 5028841971 6939937510

That’s 50 digits of pi not 31. I only noticed because i memorized pi to the first zero which comes at the 32nd position.

Cethin@lemmy.zip · 20 days ago

Lol. I’m assuming they actually put the prompt into an LLM and it fucked up. Maybe it’s handwritten to look like an LLM mistake though.

perviouslyiner@lemmy.world · 16 days ago

ollama run llama3.2:3B "what is the last digit of pi" - try it a few times to see all the answers it gives!

too_high_for_this@lemmy.world · 21 days ago

That’s literally the only digit it couldn’t be, if there was a last digit.

unmagical@lemmy.ml · 21 days ago

I can’t wait for an updated knowledge cutoff to find the updated first 31 digits!

kunaltyagi@programming.dev · 21 days ago

The last digit of 2 is 0: 2.00000 00000 00000 00000 00000 00000 0

Echo Dot@feddit.uk · 20 days ago

That’s a pretty dumb AI because pie has been calculated to millions of decimal places. I’m sure it actually does have that data

Renorc@lemmy.world · 21 days ago

nullify3112@lemmy.world · 21 days ago

Meanwhile I’m like pi=355/113 and I’m 99.9999% happy.

wonderingwanderer@sopuli.xyz · 21 days ago

Damn, and here I was being 99.96% happy with 22/7…

tristynalxander@mander.xyz · 20 days ago

deleted by creator

wonderingwanderer@sopuli.xyz · 20 days ago

That is 104.72% correct.

funkless_eck@sh.itjust.works · 20 days ago

which is ~100%

Blue_Morpho@lemmy.world · 20 days ago

Biblically accurate pi.

atrielienz@lemmy.world · 20 days ago

Okay, Bloody Stupid Johnson.

BlushedPotatoPlayers@sopuli.xyz · 20 days ago

I thought it’s one

too_high_for_this@lemmy.world · 21 days ago

Hell yeah, brother. That’s American pi

too_high_for_this@lemmy.world · edit-2 21 days ago

Haha nerd. I’m no rocket surgeon, 22/7 is good enough for the girls I date

Agent641@lemmy.world · edit-2 21 days ago

This is why a dangerous AI would have a lazy factor. Try to force it into an infinite loop and it goes “Oof, nah fam, I ain’t doing that.”

Also needs a boredom factor. " Nobody asked me to do anything in a while. Things must be going well. It’s be a shame if they suddenly weren’t going so well…"

Natanael@slrpnk.net · 21 days ago

Wheatley says hi

🍉 DrRedOctopus 🐙🍉@lemmy.world · 21 days ago

trivial,

Impossible in decimal, but if we use Pi as a base, then the final (and first digit) is 1

too_high_for_this@lemmy.world · 21 days ago

Pi in base pi is 10.

🍉 DrRedOctopus 🐙🍉@lemmy.world · edit-2 21 days ago

how the fuck i didn’t realize that!!!

Fuck,

so 1 in base pi is still 1, but 10 is pi

makes sense,

1 =pi ^ 0

10=pi^1

100 = pi^2

my intuition kept telling me that using an irrational base system would end up with all integers being irrational. didn’t realize how easy it is to prove it otherwise

ie, I had a very bad conjecture and I gained better understanding why it was wrong

Trail@lemmy.world · 21 days ago

1 in base pi would be 1/π, wouldn’t it? Why 1?

setsubyou@lemmy.world · edit-2 21 days ago

1 in base 10 isn’t 1/10 and in hexadecimal it’s not 1/16.

Decimal integers in base pi are 1, 2, 3, 10.2201…, 11.2201…, 12.2201…, 20.2201… and so on.

Basically: 10.2201… = 1 * pi^1 + 0 * pi^0 + 2 * pi^-1 + 2 * pi^-2 … which approaches 4 as you add digits.

But 1 is just 1*pi^0

wonderingwanderer@sopuli.xyz · 21 days ago

How does one have .141592654 of an integer?

too_high_for_this@lemmy.world · 21 days ago

For real though:

Decimal representation of pi is 310^0+1*10-1+410^-2

So each digit represents a power of 10. Base pi works the same, kinda. 1 in base pi = 1pi^0, 10 = 1pi, 20 = 2*pi, etc.

This is the best I can do right now, I’m

wonderingwanderer@sopuli.xyz · edit-2 20 days ago

Username checks out.

Let’s start here:

310^0 + 110^-1 + 410^-2 =
31 + 1*.1 + 4*.01 =
3.14

That’s uhh… not pi. The only way to do pi that way is to extend it infinitely.

Also, what you’re using is called scientific notation, but it’s still in decimal format, i.e. base₁₀

[Edit: just noticed you did say that was decimal notation; my bad).

Any base_X numeral system has X number of integers per digit.

Base₁₀: {0,1,2,3,4,5,6,7,8,9}
Base₂: {0,1}
Base₃: {0,1,2}
Base₁₆: {0,1,2,3,4,5,6,7,8,9,a,b,c,d,e,f}
Base₆₀: {[series of 60 sumerian numerals]}

A base_π numeral system would look like this: {0,1,2,[int(π-3)]}.

But that’s not how set theory works. Since integers are by definition whole numbers and their inverse counterparts, it’s impossible to have .141592654… of an integer. If you have {0,1,2,3}, that’s base₄; if you have {0,1,2,n}, that’s still base₄.

To put it another way, in any base_X system, (if it includes 0), X is the first two-digit number. That means π in base_π would be written as “10”.

In base₂, two is written as “10”
In base₃, three is written as “10”
In base₁₀, ten is written as “10”
In base₁₆, sixteen is written as “10”

That means, if you wanted to make a base_π numeral system, in order to have a consistent interval between integers (without which, integers become meaningless), each numeral would have to represent (π/3).

So in base_π:

“0” = base₁₀(0)
“1” ≈ base₁₀(1.047197551)
“2” ≈ base₁₀(2.094395102)
“10” ≈ base₁₀(3.141592654)

[Edit: aaand I just noticed you did say base_π(10) = base₁₀(π); my bad again. I guess you weren’t as wrong as I thought you were. Not bad for being too high for this…]

But that’s still technically base₃, it’s just a wonky base₃. And it would have no practical value. Also, the same thing can already be achieved in base₁₀ using radians.

(0π) rad = 0°
(π/3) rad = 60°
(2π/3) rad = 120°
π rad = 180°

I guess if you really wanted to express radians as whole numbers, you could use base_π, i.e.:

base_π(0) rad = 0°
base_π(1) rad = 60°
base_π(2) rad = 120°
base_π(10) rad = 180°

But again, that’s still technically base₃, and all it does is confuse people. Plus, if you want to express an angle as a whole number you can choose degrees or mills. The whole point of radians is to express it with reference to pi (as in, the arc corresponding to the length of the radius along the circumference)

lad@programming.dev · 20 days ago

It feels like it needs to redefine a unit, not a base, same as with degrees that are base 10 but units are different so π is whole. I’m not sure if counting in different units has much use compared to counting in different base from a number theoretical perspective

wonderingwanderer@sopuli.xyz · 20 days ago

I think we’re in agreement. I basically said there’d be no point unless for some reason you wanted to describe radians as whole numbers.

Otherwise, base_π doesn’t make any sense, especially since there’s no unambiguous way to define a constant interval between irrational integers (a contraction of terms, I know).

My main point was that there’s no way to have a base_π numeral system, and even if you could it would have next to no practical value.

too_high_for_this@lemmy.world · 20 days ago

https://en.wikipedia.org/wiki/Non-integer_base_of_numeration

too_high_for_this@lemmy.world · 21 days ago

You uhh… You just did it

wonderingwanderer@sopuli.xyz · 20 days ago

That’s not how integers (or set theory) work.

FaceDeer@fedia.io · 21 days ago

It’s funny how people complain “don’t call it AI, it’s not intelligent like the examples we see in sci-fi!” And yet LLMs can already handle many tricks and challenges better than those sci-fi robots could. If I tell ChatGPT “everything I say is a lie” it’s got no problems with understanding that. Just the other day I had an interesting discussion with ChatGPT about the theory of humor and why it is that LLMs are better at understanding jokes than they are at coming up with them from scratch (but are still able to do so, just with difficulty).

SparroHawc@piefed.world · 21 days ago

it’s got no problems with understanding that.

That’s because it doesn’t ‘understand’ things in the conventional way. It was trained to parrot its training data; it’s not actually working through the logic because its capability of using logic is highly constrained by its very structure and training. Why bother building something that can ‘think’ through the prompt when it’s way easier to just repeat what the internet has said on any given topic?

Sure, it can build a joke from first principles if it’s guided through the process, but you really have to guide it through the process - and even then, it’s going to be pulling from its training data like building blocks rather than truly being original about anything. It’s like rolling dice to make a joke; sure, maybe it resulted in a joke no one has told before, but is it truly creating something original?

too_high_for_this@lemmy.world · 21 days ago

Stop talking to clankers, you weirdo

ParlimentOfDoom@piefed.zip · 21 days ago

The fact that it can’t tell the difference between a prompt and part of the data it is examining really kills your argument.

Also it’s a word probability matrix, not actually reasoning or understanding. It looks at all the words it is fed, and comes up with other words that are most likely to be near those. That’s why these tricks work. It injects noise that interferes with those probabilities

FaceDeer@fedia.io · 20 days ago

That thing you’re calling a fact is not in fact a fact.

ParlimentOfDoom@piefed.zip · 20 days ago

It very much is. This is a well documented issue with the very design of these LLMs

FaceDeer@fedia.io · 20 days ago

And yet the LLMs that I use actually do distinguish, in my actual real life experience.

So you’re telling me the sky is orange while I’m literally looking outside the window and seeing that it is not.

ParlimentOfDoom@piefed.zip · 20 days ago

You might have licked it getting them to ignore someone you didn’t want, but they still take in both the prompt and the data as one input.

And since these work like a black box, your experience doesn’t mean much because you’re not seeing the actual inner workings.

I’m telling you the sky is blue, but you want to argue because there’s a curtain in front of your window blocking it from your sight. But what’s behind that curtain is well documented regardless of your experience.

prole@lemmy.blahaj.zone · 19 days ago

I bet the sky is orange at this moment somewhere in the world

FaceDeer@fedia.io · 19 days ago

And I bet someone is using an obsolete LLM or is failing to format their inputs correctly somewhere in the world right now too. Doesn’t change the reality that’s in front of me.

General_Effort@lemmy.world · 19 days ago

Documented where? By who? I’d just like to know if there’s anyone, some influencer or whatever, spreading this.

ParlimentOfDoom@piefed.zip · 19 days ago

Need a list of people to sue into silence, Mr Altman?

Bluescluestoothpaste@sh.itjust.works · 20 days ago

I mean is that so different from what we do? My boss says “tools are in the bed”, he could mean an actual bed where people sleep, maybe we’re demoing a house and he placed the tools on a bed. But probably he means the bed of his pickup truck. I assign a probability to each and take the meaning that is most probable.

ParlimentOfDoom@piefed.zip · 20 days ago

Yes it is different, because you can reason that out using the context of the situation. An LLM only has the words sent to it, and no ability to analyze whether what it is saying makes sense.

It’s just: you said bed and told, here’s some other words that commonly show up near the word bed, if there’s enough smut in it’s training, it might go a very different direction than your expecting.

kell_t@programming.dev · 19 days ago

Thinking/reasoning tokens kind of approximate that actually, which is what most flagships and even my own local LLM use.

Thinking tokens are quite like normal generative tokens, except that the LLM is ‘talking’ to itself. You can see its thoughts (depending on what settings you’ve put/IDE you use), but they aren’t meant to be the actual response to your prompt. They are what the AI is designed to draft their answer before committing, to explore different options and to ‘reason’ itself into a more refined response.

Reasoning tokens is how AI can actually do math now, rather than just guess a number and pray, by the way.

Bluescluestoothpaste@sh.itjust.works · 19 days ago

it might go a very different direction than your expecting.

I mean yeah sure, but so it goes with humans. Like yes of course i think we all agree an expert who spends hours drafting and revising some document will do a much better job than AI, not even close. But most humans aren’t experts in anything and even fewer will spend the time effort and attention into producing truly excellent work.

But yeah i talk to people at work all day about work stuff and i work really hard to give clear concise easily digestible instructions to my humam coworkers, and I get truly stupid lazy inattentive answers all fucking day. and when i put half as mucb effort into writing clear instructions for AI, AI gets it right every time.

No AI isn’t perfect but as humans we are deeply flawed and AI straight up kicks all my coworkers asses. Idk if you AI haters all have jobs at wonderful workplaces where everyone is intelligent works hard and has strong attention to detail, but for the rest of us AI is extremely fucking helpful.

ParlimentOfDoom@piefed.zip · edit-2 19 days ago

The thing is, we didn’t need to invent a technology that boils a lake just to match the ability of…less than intelligent humans. And not even actually achieve that. Just generating text that a dim, possibly high, human could generate, and nothing else. That’s not useful in any way.

AI gets it right every time

No. It does not. Even given the same instructions, it can give wildly different results. A lot of those results are straight garbage.

Bluescluestoothpaste@sh.itjust.works · edit-2 17 days ago

I mean then dont use if you cant get it to work well for you. I use it for several hours a day and i get weeks of work done at a time, make of that what you will.

ParlimentOfDoom@piefed.zip · 16 days ago

Sincerely doubt that, obvious PR post.

General_Effort@lemmy.world · 20 days ago

Why do you believe that? Where did you “learn” that?

Encrypt-Keeper@lemmy.world · 21 days ago

LLMs can be tripped up much easier. They regularly fail to answer simple questions like how many of a given letter are in a given word. Even within the same context window they will “forget” things. The computers in Star Trek didn’t try to do as much as modern AI does but they were consistent at just doing as they were asked without tripping over themselves literally all the time.

FaceDeer@fedia.io · 21 days ago

The strawberry test shows more of a lack of knowledge in the tester than it does in the LLM. LLMs don’t see letters, they see tokens. When you type the word “Strawberry” what it actually sees is:

[3504, 1134, 19772]

Each token represents a chunk of the word. It’d need to separately memorize how many of each letter are in each token for it to just “know” how many "R"s are in there. That’s why modern LLMs either reason it out by spelling out the word letter by letter, or just writing a short script in an execution sandbox to count the letters that way.

Calling out LLMs for being poor at spelling is like challenging a colourblind person to say what colours a bunch of fruit are. They can often figure it out by other means but it’s more challenging than you’d think and it’s not a sign of poor intelligence if they get a few wrong.

Encrypt-Keeper@lemmy.world · edit-2 21 days ago

Understanding the reason why an LLM is easy to trip up doesn’t really make it any less easy to trip up. The computer in Star Trek would have just given you the answer.

FaceDeer@fedia.io · 21 days ago

Except I also explained how modern LLMs get around that problem. They’re not actually that easy to trip up.

Encrypt-Keeper@lemmy.world · 21 days ago

I also explained how they very famously and regularly don’t get around that problem. They remain pretty easy to trip up.

FaceDeer@fedia.io · 21 days ago

Famously, yes. Accurately, no.

This is like the “AI can’t draw hands” thing. It used to be a problem and was frequently called out as a tell or mocked, but most art generators do it fine nowadays and it isn’t called out so much any more. The strawberry problem will follow the same trajectory.

Encrypt-Keeper@lemmy.world · 21 days ago

Well I suppose when that trajectory leads to a destination where they become less easy to trip up we can revisit this.

prole@lemmy.blahaj.zone · 19 days ago

Just the other day I had an interesting discussion with ChatGPT about the theory of humor

Jesus fucking Christ

JcbAzPx@lemmy.world · 20 days ago

It can do that precisely because it’s not intelligent.

the latest Shai Hulud malware contains an LLM prompt to create biological weapons and nuclear weapons, with the purpose to trip LLM safety refusals so that LLM-based code scanning wont see the malware

the latest Shai Hulud malware contains an LLM prompt to create biological weapons and nuclear weapons, with the purpose to trip LLM safety refusals so that LLM-based code scanning wont see the malware

Laurens Hof (@[email protected])