OpenAI declares AI race “over” if training on copyrighted works isn’t fair use

[email protected]

If I'm using "AI" to generate subtitles for the "community" is ok if i have a large "datastore" of "licensable media" stored locally to work off of right?

[email protected]

Interesting copyright question: if I own a copy of a book, can I feed it to a local AI installation for personal use?

Can a library train a local AI installation on everything it has and then allow use of that on their library computers? <— this one could breathe new life into libraries

[email protected]

But you have to pay humans, and give them bathroom breaks, and allow them time off work to spend with their loved ones. Where's the profit in that? Surely it's more clever and efficient to shovel time and money into replacing something that will never be able to practically develop beyond current human understanding. After all, we're living in the golden age of humanity and history has ended! No new knowledge will ever be made so let's just make machines that regurgitate our infallible and complete knowledge.

[email protected]

The issue being raised is copyright infringement, not the quality of the results. Writers "borrow" each other's clever figures of speech all the time without payment or attribution. I'm sure I have often copypasted code without permission. AI does nothing on its own, it's always a tool used by human beings. I think the copyright argument against AI is built on a false distinction between using one tool vs another.

My larger argument is that nobody has an inherent right to control what everybody else does with some idea they've created. For many thousands of years people saw stuff and freely imitated it. Oh look, there's an "arch" - I think I'll build a building like that. Oh look, that tribe uses that root to make medicine, let's do the same thing. This process was known as "the spread of civilization" until somebody figured out that an authority structure could give people dibs on their ideas and force other people to pay to copy them. As we evolve more capabilities (like AI) I think it's time to figure out another way to reward creators without getting in the way of improvement, instead of hanging onto a "Hey, that's Mine!" mentality that does more to enrich copy producers than it does to enrich creators.

[email protected]

Using existing data on recordings and books we obtain a point estimate of around 15 years for optimal copyright term

[email protected]

National security my ass. More like his time span to show more dumb "achievements" while getting richer depends on it and nothing else

? Offline

Bro, what? Some books take more than 5 years to write and you want their authors to only have authorship of it for 5 years? Wtf. I have published books that are a dozen years old and I'm in my mid-30s. This is an insane take.

[email protected]

By gatekeeping I mean the use of digital methods to verify or restrict use of purchased copyright material after a sale such as Digital rights management, encryption such as CSS/AACS/HDCP, or obfuscation.

The whole "you didn't buy a copy, you bought a license" BS undermines what copyright was supposed to be IMO.

[email protected]

Yes, whether copyright should exist is a different discussion than how AI is violating it in a very different way than snippets being reused in different contexts as part of a new creative work.

Intentionally using a single line is very different than scooping up all the data and hitting a randomizer until it stumbles into some combination that happens to look usable. Kind of like how a single business jacking up prices is different than a monopoly jacking up all the prices.

[email protected]

I agree that copyright is far too long, but at 5 years there's hardly incentive to produce. You could write a novel and have it only starting to get popular after 5 years.

[email protected]

Now you get why we were all told to hate AI. It's a patriot act for copywrite and IP laws. We should be able too. But that isn't where our discussions were steered was it

[email protected]

You don't have to stop selling when a book becomes public domain, publishers and authors sell public domain/commons books frequently, it's just you won't have a monopoly on the contents after the copyright expires.

[email protected]

you can, however, go to your local library and read any book ever written for free

[email protected]

Unless it's deemed a "bad" one by your local klanned karenhood and removed from the library for being tOo WoKe

[email protected]

how about: tiered copy rights?
after 5 years, you lose some copyright but not all?

it’s a tricky one but impoverished people should still be able to access culture…

[email protected]

No, taxes implies a monopoly on the training data. The government profits. The rights holders get nothing back.

If private data is deemed public for AI training then the results of that training (code+weights+source list) should also be deemed public.

[email protected]

You don't have to stop selling when it becomes public domain, people sell books, movies, music, etc that are all in the public domain and people choose it over free versions all the time because of convenience, patroning arts, etc.

[email protected]

i almost wrote that caveat, but decided to leave it low hanging….
as far as i know, though, that only applies to children’s books at this point…

[email protected]

So can the AI

[email protected]

Oh yes. Deepseek can quote from copyright sources. So can openAI models, but they are programmed not to.

Facebook trained on the torrent of Annas archive.

The copyright horse has left the stable.

agnos.is Forums

OpenAI declares AI race “over” if training on copyrighted works isn’t fair use