This new data poisoning tool lets artists fight back against generative AI

Elle@lemmy.world · 8 months ago

This new data poisoning tool lets artists fight back against generative AI

guyrocket@kbin.social · 8 months ago

Invisible changes to pixels sound like pure BS to me. I’m sure others know more about it than i do but I thought pixels were very simple things.

seaQueue@lemmy.world · edit-2 8 months ago

“Invisible changes to pixels” means “a human can’t tell the difference with a casual glance” - you can still embed a shit-ton of data in an image that doesn’t look visually like it’s been changed without careful inspection of the original and the new image.

If this data is added in certain patterns it will cause ML models trained against the image to draw incorrect conclusions. It’s a technical hurdle that will slow a casual adversary, someone will post a model trained to remove this sometime soon and then we’ll have a good old software arms race and waste a shit ton of greenhouse emissions adding and removing noise and training ever more advanced models to add and remove it.

You can already intentionally poison images so that image recognition draws incorrect conclusions fairly easily, this is the same idea but designed to cripple ML model training.

Unaware7013@kbin.social · 8 months ago

I’m sure others know more about it than i do but I thought pixels were very simple things.

You’re right, in that pixels are very simple things. However, you and I can’t tell one pixel from another in an image, and at the scale of modern digital art (my girlfriend does hers at 300dpi), shifting a handful of pixels isn’t going to make much of a visible difference to a person, but a LLM will notice them.

ClamDrinker@lemmy.world · edit-2 8 months ago

LLM is the wrong term. That’s Large Language Model. These are generative image models / text-to-image models.

Truthfully though, while it will be there when the image is trained, it won’t ‘notice’ it unless you distort it significantly (enough for humans to notice as well). Otherwise it won’t make much of a difference because these models are often trained on a compressed and downsized version of the image (in what’s called latent space)

V H@lemmy.stad.social · 8 months ago

An AI model will “notice them” but ignore them if trained on enough copies with them to learn that they’re not significant.

theneverfox@pawb.social · 8 months ago

Pixels are very simple things, literally 3-5 3 digit numbers.

But pixels mean little too a generative AI - it’s all about relationship between pixels. All AI are high dimensional shapes right now… If you break up the shape strategically, it’ll poison the image

Will this poison pill work? Probably, for at least a while…

Narrrz@kbin.social · edit-2 8 months ago

have you ever seen those composite images made by combining a huge number of other, radically different images in such a way that each whole image acts like one “pixel” of the overall image? i bet AI models ‘see’ those images very differently than we do.

wheresmypillow@lemmy.one · 8 months ago

A pixel has a binary representation. All of the significant bits for the pixel may not not be needed to display the color of that pixel so there is often excess that can be used or modified. A person wouldn’t see it but an AI reading just the binary would.