OpenAI declares AI race “over” if training on copyrighted works isn’t fair use
-
This post did not contain any content.
If I'm using "AI" to generate subtitles for the "community" is ok if i have a large "datastore" of "licensable media" stored locally to work off of right?
-
That's a good litmus test. If asking/paying artists to train your AI destroys your business model, maybe you're the arsehole.
Interesting copyright question: if I own a copy of a book, can I feed it to a local AI installation for personal use?
Can a library train a local AI installation on everything it has and then allow use of that on their library computers? <— this one could breathe new life into libraries
-
What if we had taken the billions of dollars invested in AI and invested that into public education instead?
Imagine the return on investment of the information being used to train actual humans who can reason and don’t lie 60% of the time instead of using it to train a computer that is useless more than it is useful.
But you have to pay humans, and give them bathroom breaks, and allow them time off work to spend with their loved ones. Where's the profit in that? Surely it's more clever and efficient to shovel time and money into replacing something that will never be able to practically develop beyond current human understanding. After all, we're living in the golden age of humanity and history has ended! No new knowledge will ever be made so let's just make machines that regurgitate our infallible and complete knowledge.
-
You were trained and learned and are able to create new things.
AI poorly mimics thngs it has seen before.
The issue being raised is copyright infringement, not the quality of the results. Writers "borrow" each other's clever figures of speech all the time without payment or attribution. I'm sure I have often copypasted code without permission. AI does nothing on its own, it's always a tool used by human beings. I think the copyright argument against AI is built on a false distinction between using one tool vs another.
My larger argument is that nobody has an inherent right to control what everybody else does with some idea they've created. For many thousands of years people saw stuff and freely imitated it. Oh look, there's an "arch" - I think I'll build a building like that. Oh look, that tribe uses that root to make medicine, let's do the same thing. This process was known as "the spread of civilization" until somebody figured out that an authority structure could give people dibs on their ideas and force other people to pay to copy them. As we evolve more capabilities (like AI) I think it's time to figure out another way to reward creators without getting in the way of improvement, instead of hanging onto a "Hey, that's Mine!" mentality that does more to enrich copy producers than it does to enrich creators.
-
Copyrights should have never been extended longer than 5 years in the first place, either remove draconian copyright laws or outlaw LLM style models using copyrighted material, corpos can't have both.
-
This post did not contain any content.
National security my ass. More like his time span to show more dumb "achievements" while getting richer depends on it and nothing else
-
Copyrights should have never been extended longer than 5 years in the first place, either remove draconian copyright laws or outlaw LLM style models using copyrighted material, corpos can't have both.
-
Gatekeeping absolutely was the intention of copyright, not to provide artists with income.
By gatekeeping I mean the use of digital methods to verify or restrict use of purchased copyright material after a sale such as Digital rights management, encryption such as CSS/AACS/HDCP, or obfuscation.
The whole "you didn't buy a copy, you bought a license" BS undermines what copyright was supposed to be IMO.
-
The issue being raised is copyright infringement, not the quality of the results. Writers "borrow" each other's clever figures of speech all the time without payment or attribution. I'm sure I have often copypasted code without permission. AI does nothing on its own, it's always a tool used by human beings. I think the copyright argument against AI is built on a false distinction between using one tool vs another.
My larger argument is that nobody has an inherent right to control what everybody else does with some idea they've created. For many thousands of years people saw stuff and freely imitated it. Oh look, there's an "arch" - I think I'll build a building like that. Oh look, that tribe uses that root to make medicine, let's do the same thing. This process was known as "the spread of civilization" until somebody figured out that an authority structure could give people dibs on their ideas and force other people to pay to copy them. As we evolve more capabilities (like AI) I think it's time to figure out another way to reward creators without getting in the way of improvement, instead of hanging onto a "Hey, that's Mine!" mentality that does more to enrich copy producers than it does to enrich creators.
Yes, whether copyright should exist is a different discussion than how AI is violating it in a very different way than snippets being reused in different contexts as part of a new creative work.
Intentionally using a single line is very different than scooping up all the data and hitting a randomizer until it stumbles into some combination that happens to look usable. Kind of like how a single business jacking up prices is different than a monopoly jacking up all the prices.
-
Copyrights should have never been extended longer than 5 years in the first place, either remove draconian copyright laws or outlaw LLM style models using copyrighted material, corpos can't have both.
I agree that copyright is far too long, but at 5 years there's hardly incentive to produce. You could write a novel and have it only starting to get popular after 5 years.
-
But I can't pirate copyrighted materials to "train" my own real intelligence.
Now you get why we were all told to hate AI. It's a patriot act for copywrite and IP laws. We should be able too. But that isn't where our discussions were steered was it
-
Bro, what? Some books take more than 5 years to write and you want their authors to only have authorship of it for 5 years? Wtf. I have published books that are a dozen years old and I'm in my mid-30s. This is an insane take.
You don't have to stop selling when a book becomes public domain, publishers and authors sell public domain/commons books frequently, it's just you won't have a monopoly on the contents after the copyright expires.
-
But I can't pirate copyrighted materials to "train" my own real intelligence.
you can, however, go to your local library and read any book ever written for free
-
you can, however, go to your local library and read any book ever written for free
-
You don't have to stop selling when a book becomes public domain, publishers and authors sell public domain/commons books frequently, it's just you won't have a monopoly on the contents after the copyright expires.
how about: tiered copy rights?
after 5 years, you lose some copyright but not all?it’s a tricky one but impoverished people should still be able to access culture…
-
I don’t think they’re wrong in saying that if they aren’t allowed to train on copyrighted works then they will fall behind. Maybe I missed it in the article, but Japan for example has that exact law (use of copyright to train generative AI is allowed).
Personally I think we need to give them somewhat of an out by letting them do it but then taxing the fuck out of the resulting product. “You can use copyrighted works for training but then 50% of your profits are taxed”. Basically a recognition that the sum of all copyrighted works is a societal good and not just an individual copyright holders.
No, taxes implies a monopoly on the training data. The government profits. The rights holders get nothing back.
If private data is deemed public for AI training then the results of that training (code+weights+source list) should also be deemed public.
-
I agree that copyright is far too long, but at 5 years there's hardly incentive to produce. You could write a novel and have it only starting to get popular after 5 years.
You don't have to stop selling when it becomes public domain, people sell books, movies, music, etc that are all in the public domain and people choose it over free versions all the time because of convenience, patroning arts, etc.
-
Unless it's deemed a "bad" one by your local klanned karenhood and removed from the library for being tOo WoKe
i almost wrote that caveat, but decided to leave it low hanging….
as far as i know, though, that only applies to children’s books at this point… -
you can, however, go to your local library and read any book ever written for free
So can the AI
-
As far as the ai industry has already broken copyright laws. It will not be actually intelligent for a long time. Just like crypto this seems like a global scam that has squandered resources for a dream of a free workforce. Instead of working together to try and create an ai there are lots of technology companies doing the same ineffective bull
Oh yes. Deepseek can quote from copyright sources. So can openAI models, but they are programmed not to.
Facebook trained on the torrent of Annas archive.
The copyright horse has left the stable.