AI was a common theme at Gamescom 2025, and while some indie teams say it's invaluable, it remains an ethical nightmare

inclementimmigrant@lemmy.world · 2 months ago

AI was a common theme at Gamescom 2025, and while some indie teams say it's invaluable, it remains an ethical nightmare

Skullgrid@lemmy.world · 2 months ago

doesn’t have to be an ethical nightmare. Public domain datasets on local hardware using renewable eletricity, who’s mad now, the artist you already can’t afford to pay because you have no fucking money anyway?

very_well_lost@lemmy.world · 2 months ago

AI would be fine if we just changed everything about it

lol

onslaught545@lemmy.zip · 2 months ago

Not all LLMs are the same. You can absolutely take a neural network model and train it yourself on your own dataset that doesn’t violate copyright.

Mika@sopuli.xyz · 2 months ago

I can almost guarantee that hundred billion params LLMs are not trained on that, and are trained on the whole web scraped to the furthest extent.

The only sane and ethical solution going forward is to force to opensource all LLMs. Use the datasets generated by humanity - give back to humanity.

Mika@sopuli.xyz · 2 months ago

Besides, the article is about image gen AI, not LLMs.

onslaught545@lemmy.zip · 1 month ago

That’s an LLM, buddy.

Mika@sopuli.xyz · 1 month ago

Article directly complains about AI artwork. You know what LLM even means?

onslaught545@lemmy.zip · 1 month ago

Yes, I do. I also know that multimodal LLMs are what generate AI artwork.

Mika@sopuli.xyz · 1 month ago

Then you should provably know that image gen existed long before MLLMs and was already a menace to artists back then.

And that MLLM is generally a layered combo of lots of preexisting tools, where LLM is used as a medium that allows to attach OCR inputs and give more accurate instructions to image gen AI part.

null@lemmy.nullspace.lol · 1 month ago

Image Gen AI is an LLM?

onslaught545@lemmy.zip · 1 month ago

Yes, it is. LLMs do more than just text generation.

null@lemmy.nullspace.lol · 1 month ago

Source?

unconfirmedsourcesDOTgov@lemmy.sdf.org · 1 month ago

What do you think the letters LLM stand for, pal?

Skullgrid@lemmy.world · 2 months ago

The only sane and ethical solution going forward is to force to opensource all LLMs.

Jesus fucking christ. There are SO GODDAMN MANY open source LLMs, even from fucking scumbags like facebook. I get that there’s subtleties to the argument on the ProAI vs AntiAI side, but you guys just screech and scream.

https://github.com/eugeneyan/open-llms

6nk06@sh.itjust.works · 2 months ago

Where are the sources? All I see is binary files.

Mika@sopuli.xyz · 2 months ago

even meta

Lol, ofc meta, they have the biggest bigdata out there, full of private data.

Most of the opensources are recompilations of existing opensource LLMs.

And the page you’ve listed is <10b mostly, bar LLMs with huge financing, and generally either copropate or Chinese behind them.

vrighter@discuss.tchncs.de · 1 month ago

there are barely any. I can’t name a single one offhand. Open weights means absolutely nothing about the actual source of those weights.

Riskable@programming.dev · 2 months ago

Training an AI is orthogonal to copyright since the process of training doesn’t involve distribution.

You can train an AI with whatever TF you want without anyone’s consent. That’s perfectly legal fair use. It’s no different than if you copy a song from your PC to your phone.

Copyright really only comes into play when someone uses an AI to distribute a derivative of someone’s copyrighted work. Even then, it’s really the end user that is even capable of doing such a thing by uploading the output of the AI somewhere.

hitmyspot@aussie.zone · 2 months ago

That’s assuming you own the media in the first place. Often AI is trained with large amounts of data downloaded illegally.

So, yes, it’s fair use to train on information you have or have rights to. It’s not fair use to illegally obtain new data. Even more, to renting that data often means you also distribute it.

For personal use, I don’t have an issue with it anyway, but legally it’s not allowed.

Riskable@programming.dev · 1 month ago

Incorrect. No court has ruled in favor of any plaintiff bringing a copyright infringement claim against an AI LLM. Here’s a breakdown of the current court cases and their rulings:

https://www.skadden.com/insights/publications/2025/07/fair-use-and-ai-training

In both cases, the courts have ruled that training an LLM with copyrighted works is highly transformative and thus, fair use.

The plaintiffs in one case couldn’t even come up with a single iota of evidence of copyright infringement (from the output of the LLM). This—IMHO—is the single most important takeaway from the case: Because the only thing that really mattered was the point where the LLMs generate output. That is, the point of distribution.

Until an LLM is actually outputting something, copyright doesn’t even come into play. Therefore, the act of training an LLM is just like I said: A “Not Applicable” situation.

hitmyspot@aussie.zone · 1 month ago

Just a heads up that anthropic have just lost a $1.5b case for downloading and storing copyrighted works. That’s $3,000 per author of 500000 books.

The wheels of justice move slowly but fair use has limits. Commercial use is generally not one. Commentary and transformation are, so we’ll see how this progresses with the many other cases.

Warner Brothers have recently filed another case, I think.

Riskable@programming.dev · 1 month ago

Anthropic didn’t lose their lawsuit. They settled. Also, that was about their admission that they pirated zillions of books.

From a legal perspective, none of that has anything to do with AI.

Company pirates books -> gets sued for pirating books. Companies settles with the plaintiffs.

It had no legal impact on training AI with copyrighted works or what happens if the output is somehow considered to be violating someone’s copyright.

What Anthropic did with this settlement is attack their Western competitor: OpenAI, specifically. Because Google already settled with the author’s guild for their book scanning project over a decade ago.

Now OpenAI is likely going to have to pay the author’s guild too. Even though they haven’t come out and openly admitted that they pirated books.

Meta is also being sued for the same reason but they appear to be ready to fight in court about it. That case is only just getting started though so we’ll see.

The real, long-term impact of this settlement is that it just became a lot more expensive to train an AI in the US (well, the West). Competition in China will never have to pay these fees and will continue to offer their products to the West at a fraction of the cost.

hitmyspot@aussie.zone · 1 month ago

While that’s interesting info and links, I don’t think that’s true.

https://share.google/opT62A4cIvKp6pwhI This case with Thomson has, but is expected to be overturned.

Most of the big cases are in the early stages. Let’s see what the Disney one does.

There is also the question, not just of copyright or fair use, but legally obtaining the data. Facebook torrented terabytes of data and claimed they did not share it. I don’t know that that’s enough to claim innocence. It hasn’t been for individuals.

The question is whether they are actually transformative. Just being different is not enough. I can’t use Disney IP to make my new movie, for instance.

HarkMahlberg@kbin.earth · 2 months ago

Beyond the copyright issues and energy issues, AI does some serious damage to your ability to do actual hard research. And I’m not just talking about “AI brain.”

Let’s say you’re looking to solve a programming problem. If you use a search engine and look up the question or a string of keywords, what do you usually do? You look through each link that comes up and judge books by their covers (to an extent). “Do these look like reputable sites? Have I heard of any of them before?” You scroll click a bunch of them and read through them. Now you evaluate their contents. “Have I already tried this info? Oh this answer is from 15 years ago, it might be outdated.” Then you pare down your links to a smaller number and try the solution each one provides, one at a time.

Now let’s say you use an AI to do the same thing. You pray to the Oracle, and the Oracle responds with a single answer. It’s a total soup of its training data. You can’t tell where specifically it got any of this info. You just have to trust it on faith. You try it, maybe it works, maybe it doesn’t. If it doesn’t, you have to write a new prayer try again.

Even running a local model means you can’t discern the source material from the output. This isn’t Garbage In Garbage Out, but Stew In Soup Out. You can feed an AI a corpus of perfectly useful information, but it will churn everthing into a single liquidy mass at the end. You can’t be critical about the output, because there’s nothing to critique but a homogenous answer. And because the process is destructive, you can’t un-soup the output. You’ve robbed yourself of the ability to learn from the input, and put all your faith into the Oracle.

Skullgrid@lemmy.world · edit-2 2 months ago

The topic is : using AIs for game dev.

I’m pretty sure that generating placeholder art isn’t going to ruin my ability to research
AIs need to be used TAKING THEIR FLAWS INTO ACCOUNT and for very specific things.

I’m just going to be upfront: AI haters don’t know the actual way this shit works except that by existing, LLMS drain oceans and create more global warming than the entire petrol industry, and AI bros are filling their codebases with junk code that’s going to explode in their faces from anywhere between 6 months to 3 years.

There is a sane take : use AIs sparingly, taking their flaws into consideration, for placeholder work, or once you obtain a training base on content you are allowed to use. Run it locally, and use renewable sources for electricity.

lime!@feddit.nu · 1 month ago

as someone who has studied ml since around 2015, i’m still not convinced. i run local models, i train on CC data, i triple-check everything, and it’s just not that useful. it’s fun, but not productive.

HarkMahlberg@kbin.earth · 1 month ago

Wild to see you call for a “sane take” when you strawman the actual water problem into “draining the oceans.”

Local residents with nearby data centers aren’t being told to take fewer showers with salt water from the ocean.

Skullgrid@lemmy.world · 1 month ago

Is that a problem with the existence of llms as a technology, or shitty corporations working with corrupt governments in starving local people of resources to turn a quick buck?

If you are allowing a data center to be built, you need to make sure you have power etc to build it without negativitely impacting the local people. It’s not the fault of an LLM that they fucked this shit up.

very_well_lost@lemmy.world · 1 month ago

Are you really gonna use the “guns don’t kill people, people kill people” argument to defend LLMS?

Let’s not forget that the first ‘L’ stands for “large”. These things do not exist without massive, power and resource hungry data centers. You can’t just say “Blame government mismanagement! Blame corporate greed!” without acknowledging that LLMs cease to exist without those things.

And even with all of those resources behind it, the technology is still only marginally useful at best. LLMs still hallucinate, they still confidently distribute misinformation, they still contribute to mental health crises in vulnerable individuals, and no one really has any idea how to stop those things from happening.

What tangible benefit is there to LLMs that justifies their absurd cost? Honestly?

Skullgrid@lemmy.world · edit-2 1 month ago

making up deficiencies in your own artistic and linguistic skills , getting easy starting points for coding solutions.

LLMs still hallucinate,

Emergent behaviour can be useful in coming up with new ideas that you were not expecting and areas to explore

they still confidently distribute misinformation,

yeah, that’s been a problem since language, if you want a statement more close to the topic at hand, the printing press.

they still contribute to mental health crises in vulnerable individuals, and no one really has any idea how to stop those things from happening.

so does the fucking internet.

Are you really gonna use the “guns don’t kill people, people kill people” argument to defend LLMS?

chad.jpg

Mika@sopuli.xyz · 2 months ago

you can’t be critical about the answer

You actually can, and you should be. And the process is not destructive since you can always undo in tools like cursor, or discard in git.

Besides, you can steer a good coding LLM in a right direction. The better you understand what are you doing - the better.

MoreZombies@aussie.zone · 2 months ago

How would you be critical of the answer without also doing a traditional search to compare its answer? If you have to search and verify the answer anyway, didn’t we just add an unnecessary step to the process?

Mika@sopuli.xyz · edit-2 1 month ago

You can have knowledge of the technology firsthand and just need to generate the code? I mean I would need to google different function names and conversion tricks all the time anyway, even if I’m really good at it. If AI slops it for me, it just speeds it up by a lot, and I can notice bad moments.

Again, the better you know what you are doing, the more it could help.

ElectricMachman@lemmy.sdf.org · 1 month ago

But if you know what you’re doing, you can do a better job than the “AI”…??? This is a weird argument

Mika@sopuli.xyz · 1 month ago

With infinite time, sure. Time isn’t infinite.

very_well_lost@lemmy.world · 1 month ago

That would be all well and good, if corpos weren’t pushing AI as a technology that everyone should be using all the time to reshape their daily lives.

The people most attracted to AI as a technology (and the ones that AI companies are marketing to the hardest) are the ones who want to use it for things where they don’t already have domain-specific expertise. Non-artists generating art, or non-coders making apps on “vibes”, etc. Have you ever heard of Travis Kalanick? He’s one of the co-founders of Uber and he recently made the news after he went on some podcast to breathlessly rave about how he’s been using LLMs to do “vibe physics”. Kalanick, as you can guess, is not a physicist. In fact he’s not a scientist of any kind.

The vast, vast majority of people using AI aren’t using it to augment their existing skills, and they aren’t using their own expertise to evaluate the output critically. This was never the point nor the promise of AI, and it’s certainly not the direction that the people pushing this technology are attempting to push it.

Mika@sopuli.xyz · 1 month ago

AI marketing is total BS, but it doesn’t mean AI is not useful in it’s current state. People try to argue as if that was the case, but it simply isn’t. Agentic AI + LLM does speed up usual tasks by a whole fucking lot.

Next day, these people would be wondering why they don’t have access to essential tools they need to be effective (means of production), completely forgotten they were against these tools completely out of principle. This is as shortsighted as it can get.

very_well_lost@lemmy.world · edit-2 1 month ago

AI marketing is total BS, but it doesn’t mean AI is not useful in it’s current state.

But the AI only exists because of the marketing BS! The fact that AI is useful to qualified people in specialized fields doesn’t matter when the technology is being mass marketed to a completely different group of people for completely different use cases.

LLMs are called “large” for a reason — their existence demands large datasets, large data centers, large resource consumption, and large capital expenditure to secure all of those things. The only entries with the resources to make that happen are large corporations (and rich nation-states, but they seem to be content to keep any of their own LLM efforts under wraps for now). You can only say “don’t blame the technology, blame the technologist” when it’s possible to separate the two, but in this case it’s not. LLMs don’t exist without the corpos, and the corpos are determined to push LLMs into places and use cases where they have no business being.

Mika@sopuli.xyz · 1 month ago

Openweight/Opensource LLMs do exist though. And isn’t not only tiny models.

HarkMahlberg@kbin.earth · 1 month ago

You misunderstood, I wasn’t saying you can’t Ctrl Z after using the output, but that the process of training an AI on a corpus yields a black box. This process can’t be reverse engineered to see how it came up with it’s answers.

It can’t tell you how much of one source it used over another. It can’t tell you what it’s priorities are in evaluating data… not without the risk of hallucinating on you when you ask it.

eldebryn@lemmy.world · 2 months ago

Out of legit curiosity, how many models do you know trained exclusively on public domain data, which are actually useful?

lime!@feddit.nu · 1 month ago

anything trained on common corpus. which, oddly, is harder to find than the actual training data.

eldebryn@lemmy.world · 1 month ago

I mean this respectfully, but that wasn’t an actual answer.

lime!@feddit.nu · 1 month ago

no, it sort of reinforced your point.

eldebryn@lemmy.world · 1 month ago

I see, that’s fair.