School shooting survivor sues AI gun detection firm after system failed to spot weapon

Tony Bark@pawb.social · 1 day ago

School shooting survivor sues AI gun detection firm after system failed to spot weapon

CeeBee_Eh@lemmy.world · 17 hours ago

you could paint marker “not a gun” on the side of a gun and guess what would happen.

It would flag it as a gun. How do I know? I worked on and developed a similar system at one point. It worked extremely well. We weren’t an American company and ultimately covid killed us (it was US American orgs that were the most interested in our stuff).

It has some uses, but 95% of what is being used for and 100% of the data centers aren’t it.

Do you think LLMs are being used for this sort of thing? Putting aside the sheer technical mountain of a hurdle that slapping an LLM vision model on top of dozens and dozens of real-time camera streams, the hardware requirements would put the company out of business before they made their first sale.

Computer vision models, which are NOT LLMs, have been around for quite a while now and are very good at doing one thing and one thing only. And they’ll do it well for a miniscule fraction of what it takes to run an LLM.

No, datacentres are not being used for real-time gun detection. The company might have other kinds of infrastructure located in a DC, but not the main video processing hardware.

db2@lemmy.world · 15 hours ago

Do you think LLMs are being used for this sort of thing?

Yes. It took all of five seconds to find out too.

No, datacentres are not being used for real-time gun detection

You’ve already been wrong once, care to try for two?

CeeBee_Eh@lemmy.world · 7 hours ago

Yes. It took all of five seconds to find out too.

Didn’t I just say that slapping an LLM vision model on to dozens of camera streams would be a near impossible technical hurdle?

I never said vLLM models don’t exist. I said they’re impractical for this use case.

You’ve already been wrong once, care to try for two?

Haven’t been wrong yet. You on the other hand…

db2@lemmy.world · 6 hours ago

There are several examples of exactly what I said, contradicting your repeated claim. Since I don’t want to talk to someone with the conversational ability of Donald Trump demanding things be true in spite of evidence they’re not im going to be blocking you now. Have a nice day.

CeeBee_Eh@lemmy.world · 3 hours ago

There are several examples of exactly what I said

No one is denying the existence of vision based LLM models. The issue is performance. It takes in the order of double (or even triple) digit seconds to process an image through an LLM. Even if it took a single second to process an image using decent server-grade hardware (which starts at about $10k per card), that’s way too much and still not fast enough.

On just 10 cameras at a facility it would require north of $100k on just GPUs alone.

Whereas a specialized computer vision model could process several dozen camera streams, in real-time, on just one of those $10k cards.

An LLM would process an image in 10 seconds (generous) whereas a computer vision model operates in the milliseconds. We’re talking about a 1000x difference in required processing power.

That’s why you’re wrong and have zero clue what you’re talking about.

You’re arguing that that family uses a fully loaded semi-trailer to go 200m to the local park. It’s a clueless and asinine argument.

Wispy2891@lemmy.world · 15 hours ago

Using a LLM for detecting a specific object on an image is possible but stupid: if your object is always the same (like in this case) it’s several orders of magnitude cheaper to train once on that specific object then use the computer vision model running directly on the local server that’s recording the video.

Otherwise:

the api costs would be colossal, 0.001$ per each image, at 30 fps it’s $100 per hour, nobody would pay that
The detection latency would be several seconds vs almost instant
Without internet connection the system wouldn’t work

Use cases for LLM-based image recognition is if the object changes at every request or it’s ultra specific with brands and colors

db2@lemmy.world · 14 hours ago

if your object is always the same (like in this case)

It isn’t the same though. A large gauge shotgun and a small gauge pistol are pretty different looking. Compare those to a .22 rifle with a scope, and those to a decked out ar15. That’s a lot of different always the sames. What if it’s a revolver? Or has a folded stock? Or a sawed off stock? Will it recognize a derringer or a mac10 with a large capacity mag as guns?

We can because they make us dead. We have valid reason to fear them which is a great motivator for most species to learn to recognize the danger. You’d still recognize a ring gun as a gun, without getting specifically trained to do so a machine will identify it as jewelry.

CeeBee_Eh@lemmy.world · 7 hours ago

A large gauge shotgun and a small gauge pistol are pretty different looking. Compare those to a .22 rifle with a scope, and those to a decked out ar15. That’s a lot of different always the sames. What if it’s a revolver? Or has a folded stock? Or a sawed off stock? Will it recognize a derringer or a mac10 with a large capacity mag as guns?

You seem to think that computer vision models can only be trained on a single thing. You simply train your modem on as many object types as you want it to be aware of. That’s it.

Wispy2891@lemmy.world · 11 hours ago

so, train the computer vision model for a gun and train again for a shotgun. Run the two detection models at the same time.

Your approach is the typical “but if you really want you can use an atomic bomb to kill mosquitoes” - yes, you could do that, but nobody is paying $1 mil/year in inference costs (+some expensively licensed software to wrap around that) when it can be done locally with a $300 GPU (+ some expensively licensed software to wrap around that)

db2@lemmy.world · 7 hours ago

I gave a lot more than two examples and it was hardly exhaustive.