Show newer

@ao @Natanox @jaredwhite @outie @swheritage their dataset also _doesn’t_ follow licenses. they just took stuff with no license. They openly say this.

On the AI model side, asking users to obey licenses for them (so they don’t have to) sure is a gambit.

@ao @Natanox @jaredwhite @outie @swheritage theatre Open source isn’t public domain. open source licenses only count if you follow their terms, which an AI doesn’t.

arborelia boosted

oh hey just as a PSA, any code any of y'all may have on github might have been scraped by @swheritage, a self-proclaimed "preservation" org which

- just hoovered up vast amounts of data without asking or telling anyone
- insists on deadnaming trans people forever for "integrity" reasons
- used it to build an LLM training data set

huggingface.co/datasets/bigcod to check and for opt-out instructions

arborelia boosted

Every time I boot to a flash drive I'm reminded of Jersey Jack's pinball update process

You flash the new software to a flash drive, open up the coin door, put t' plug int' 'ole and wait a bit until it's done.

And then you IMMEDIATELY TAKE THAT FLASH DRIVE AWAY AND WIPE IT CLEAN right then and there before you do anything else

Because if you leave it lying around, and someone (MAYBE FUTURE-YOU) goes "Huh what's on this" and plugs it in, and then forgets to unplug it between reboots, it will format your hard drive and turn your computer into The Hobbit Pinball *completely automatically and without any input from you whatsoever*

Show thread

I erased those 18 minutes on Nixon’s tapes. there was some funny stuff in there but it’s gone. Sorry

Show thread

can you believe I got them to call it the “Diet of Worms”

Show thread

every time a transphobe tells me I can’t change history I will change it twice

@beaufils @swheritage @internetarchive

not you, internet archive, you’re cool. the asshole who thinks “you can’t change history” when it comes to trans name changes just mentioned you here, sadly.

@oreolek lol we're coming up on the 1 year anniversary of them not getting around to opt-out requests

arborelia boosted

@arborelia @swheritage

To the best of our knowledge, all files contained in the dataset are licensed with one of the permissive licenses (see list in Licensing information) or no license.

Emphasis mine.

What the cinnamon toast fuck?

@ryanc @swheritage Also, no language model is capable of obeying an attribution clause, which is in almost every license.

@ryanc @swheritage I'm already seeing in their list of opt-out GitHub issues that they've included some people's code that is "all rights reserved", and some people's GPL code.

@swheritage To find out if they have appropriated your code, you can check "Am I in The Stack?": huggingface.co/datasets/bigcod

However, _do not believe their supposed opt-out_. I mean, sure, submit an opt-out if you want, but I know how they operate -- they'll just keep doing whatever they want and never process any takedowns unless the law makes them.

Show thread

Hey, in case their transphobia wasn't enough for you, @swheritage is yoinking all the code on GitHub -- regardless of license -- to train a generative AI that plagiarizes code.

No matter how many times they say "ethical", it isn't.

mstdn.social/@swheritage/11204

Part 2 of my attempt to get the Software Heritage Archive to stop deadnaming me: the part where I get angry in French.

cohost.org/arborelia/post/5052

arborelia boosted

i'm just going to point to this meme and like 90% of y'all

@TerrorBite exactly that. What they have is 100 trivial forks of the old copy from other people’s github accounts

Show older
Computer Fairies

Computer Fairies is a Mastodon instance that aims to be as queer, friendly and furry as possible. We welcome all kinds of computer fairies!