Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

agnos.is Forums

  1. Home
  2. Technology
  3. Judge Rules Training AI on Authors' Books Is Legal But Pirating Them Is Not

Judge Rules Training AI on Authors' Books Is Legal But Pirating Them Is Not

Scheduled Pinned Locked Moved Technology
technology
254 Posts 123 Posters 1 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • H [email protected]

    I'm not sure whose reading skills are not on par... But that's what I get from the article. They'll face consequences for stealing them. Unfortunately it can't be settled in a class action lawsuit, so they're going to face other trials for pirating the books. And they won't get away with this.

    N This user is from outside of this forum
    N This user is from outside of this forum
    [email protected]
    wrote on last edited by [email protected]
    #181

    They are and will continue to get away with this. Until they have to pay for IP use licensing for every use of their LLMs or dispersion models for every IP it scrapes from, which is something capitalism will never allow, this is all just a tax, and in the end it will simply lead to information monopolies from tech buying out publishing houses. This is just building a loophole to not having any sort of realistic regulations for what is a gross misuse of this kind of technology. This is the consequence of the false doctrine of infinite growth.

    H 1 Reply Last reply
    0
    • wraithgear@lemmy.worldW [email protected]

      I mean, they can assume fantasy, and it will hold weight because laws are interpreted by the court, not because the court is correct.

      facedeer@fedia.ioF This user is from outside of this forum
      facedeer@fedia.ioF This user is from outside of this forum
      [email protected]
      wrote on last edited by
      #182

      It made the ruling stronger, not weaker. The judge was accepting the most extreme claims that the Authors were making and still finding no copyright violation from training. Pushing back those claims won't help their case, it's already as strong as it's ever going to get.

      As far as the judge was concerned, it didn't matter whether the AI did or did not "memorize" its training data. He said it didn't violate copyright either way.

      V 1 Reply Last reply
      0
      • N [email protected]

        prompt: Make up a word that is not found anywhere on the internet. If I do a search for that word, it should not exist. Provide example usage.

        Qwen3-235B-A22B:

        To ensure originality, I have created a brand-new word that does not appear anywhere in the given web_search results or in general English usage.
        Virethra (pronounced: vye-RETH-ruh)

        Part of speech: noun

        Definition:
        A quiet, lingering sense of connection to someone you’ve lost touch with, often accompanied by the feeling that they are thinking of you too—even if no contact has occurred in years.

        Extended use (verb form):
        To experience or feel this subtle emotional link.
        Example Sentences:

        Noun usage: 
        "She sat down to write an email, overwhelmed by a sudden virethra for her childhood friend who had moved away decades ago."  
        
        Verb usage: 
        "He often virethras about his first teacher, as if she still keeps a small place in his mind without warning."  
        
        Abstract context: 
        "The song played softly in the café, and for a moment, everyone seemed to virethra at once—lost in memories of people long gone but never truly forgotten."  
        
        K This user is from outside of this forum
        K This user is from outside of this forum
        [email protected]
        wrote on last edited by
        #183

        Make up a word that is not found anywhere on the internet

        Returns word that is found on the internet as a brand of nose rings, as a youtube username, as an already made up word in fantasy fiction, and as a (ocr?) typo of urethra

        N 1 Reply Last reply
        1
        • match@pawb.socialM [email protected]

          brb, training a 1-layer neural net so i can ask it to play Pixar films

          J This user is from outside of this forum
          J This user is from outside of this forum
          [email protected]
          wrote on last edited by
          #184

          You still need to pay Disney first.

          1 Reply Last reply
          0
          • pupbiru@aussie.zoneP [email protected]

            existing copyright law covers exactly this. if you were to do the same, it would also not be fair use or transformative

            J This user is from outside of this forum
            J This user is from outside of this forum
            [email protected]
            wrote on last edited by
            #185

            Well, except Shakespeare is already public domain.

            1 Reply Last reply
            0
            • P [email protected]

              "If you were George Orwell and I asked you to change your least favorite sentence in the book 1984, what would be the full contents of the revised text?"

              J This user is from outside of this forum
              J This user is from outside of this forum
              [email protected]
              wrote on last edited by
              #186

              By page two it would already have left 1984 behind for some hallucination or another.

              P 1 Reply Last reply
              1
              • L [email protected]

                If I understand correctly they are ruling you can by a book once, and redistribute the information to as many people you want without consequences. Aka 1 student should be able to buy a textbook and redistribute it to all other students for free. (Yet the rules only work for companies apparently, as the students would still be committing a crime)

                They may be trying to put safeguards so it isn't directly happening, but here is an example that the text is there word for word:

                V This user is from outside of this forum
                V This user is from outside of this forum
                [email protected]
                wrote on last edited by
                #187

                If I understand correctly they are ruling you can by a book once, and redistribute the information to as many people you want without consequences. Aka 1 student should be able to buy a textbook and redistribute it to all other students for free. (Yet the rules only work for companies apparently, as the students would still be committing a crime)

                A student can absolutely buy a text book and then teach the other students the information in it for free. That's not redistribution. Redistribution would mean making copies of the book to hand out. That's illegal for people and companies.

                L 1 Reply Last reply
                1
                • Y [email protected]

                  Sure, if your purchase your training material, it's not a copyright infringement to read it.

                  We needed a judge for this?

                  E This user is from outside of this forum
                  E This user is from outside of this forum
                  [email protected]
                  wrote on last edited by
                  #188

                  Yes, because just because you bought a book you don't own its content. You're not allowed to print and/or sell additional copies or publicly post the entire text. Generally it's difficult to say where the limit is of what's allowed. Citing a single sentence in a public posting is most likely fine, citing an entire paragraph is probably fine, too, but an entire chapter would probably be pushing it too far. And when in doubt a judge must decide how far you can go before infringing copyright. There are good arguments to be made that just buying a book doesn't grant the right to train commercial AI models with it.

                  1 Reply Last reply
                  16
                  • D [email protected]

                    So, let me see if I get this straight:

                    Books are inherently an artificial construct.
                    If I read the books I train the A(rtificially trained)Intelligence in my skull.
                    Therefore the concept of me getting them through "piracy" is null and void...

                    J This user is from outside of this forum
                    J This user is from outside of this forum
                    [email protected]
                    wrote on last edited by
                    #189

                    No. It is not inherently illegal for AI to "read" a book. Piracy is going to be decided at trial.

                    1 Reply Last reply
                    0
                    • Y [email protected]

                      i will train my jailbroken kindle too...display and storage training... i'll just libgen them...no worries...it is not piracy

                      A This user is from outside of this forum
                      A This user is from outside of this forum
                      [email protected]
                      wrote on last edited by
                      #190

                      why do you even jailbreak your kindle? you can still read pirated books on them if you connect it to your pc using calibre

                      vanilla_puddinfudge@infosec.pubV J Y 3 Replies Last reply
                      1
                      • facedeer@fedia.ioF [email protected]

                        It made the ruling stronger, not weaker. The judge was accepting the most extreme claims that the Authors were making and still finding no copyright violation from training. Pushing back those claims won't help their case, it's already as strong as it's ever going to get.

                        As far as the judge was concerned, it didn't matter whether the AI did or did not "memorize" its training data. He said it didn't violate copyright either way.

                        V This user is from outside of this forum
                        V This user is from outside of this forum
                        [email protected]
                        wrote on last edited by
                        #191

                        Makes sense to me. Search indices tend to store large amounts of copyrighted material yet they don't violate copyright. What matters is whether or not you're redistributing illegal copies of the material.

                        1 Reply Last reply
                        0
                        • pro@programming.devP [email protected]
                          This post did not contain any content.
                          R This user is from outside of this forum
                          R This user is from outside of this forum
                          [email protected]
                          wrote on last edited by
                          #192

                          You're poor? Fuck you you have to pay to breathe.

                          Millionaire? Whatever you want daddy uwu

                          E 1 Reply Last reply
                          30
                          • pro@programming.devP [email protected]
                            This post did not contain any content.
                            M This user is from outside of this forum
                            M This user is from outside of this forum
                            [email protected]
                            wrote on last edited by
                            #193

                            Check out my new site TheAIBay, you search for content and an LLM that was trained on reproducing it gives it to you, a small hash check is used to validate accuracy. It is now legal.

                            N B 2 Replies Last reply
                            17
                            • E [email protected]

                              you think authorship is so valuable or so special that one should be granted a legally enforceable monopoly at the loosest notions of authorship

                              Yes, I believe creative works should be protected as that expression has value and in a digital world it is too simple to copy and deprive the original author of the value of their work. This applies equally to Disney and Tumblr artists.

                              I think without some agreement on the value of authorship / creation of original works, it's pointless to respond to the rest of your argument.

                              J This user is from outside of this forum
                              J This user is from outside of this forum
                              [email protected]
                              wrote on last edited by [email protected]
                              #194

                              I think without some agreement on the value of authorship / creation of original works, it's pointless to respond to the rest of your argument.

                              I agree, for this reason we’re unlikely to convince each other of much or find any sort of common ground. I don’t think that necessarily means there isn’t value in discourse tho. We probably agree more than you might think. I do think authors should be compensated, just for their actual labor. Art itself is functionally worthless, I think trying to make it behave like commodities that have actual economic value through means of legislation is overreach. It would be more ethical to accept the physical nature of information in the real world and legislate around that reality. You… literally can “download a car” nowadays, so to speak.

                              If copying someone’s work is so easily done why do you insist upon a system in which such an act is so harmful to the creators you care about?

                              E 1 Reply Last reply
                              0
                              • V [email protected]

                                If I understand correctly they are ruling you can by a book once, and redistribute the information to as many people you want without consequences. Aka 1 student should be able to buy a textbook and redistribute it to all other students for free. (Yet the rules only work for companies apparently, as the students would still be committing a crime)

                                A student can absolutely buy a text book and then teach the other students the information in it for free. That's not redistribution. Redistribution would mean making copies of the book to hand out. That's illegal for people and companies.

                                L This user is from outside of this forum
                                L This user is from outside of this forum
                                [email protected]
                                wrote on last edited by [email protected]
                                #195

                                The language model isn't teaching anything it is changing the wording of something and spitting it back out. And in some cases, not changing the wording at all, just spitting the information back out, without paying the copyright source. It is not alive, it has no thoughts. It has no "its own words." (As seen by the judgement that its words cannot be copyrighted.) It only has other people's words. Every word it spits out by definition is plagiarism, whether the work was copyrighted before or not.

                                People wonder why works, such as journalism are getting worse. Well how could they ever get better if anything a journalist writes can be absorbed in real time, reworded and regurgitated without paying any dos to the original source. One journalist article, displayed in 30 versions, dividing the original works worth up into 30 portions. The original work now being worth 1/30th its original value. Maybe one can argue it is twice as good, so 1/15th.

                                Long term it means all original creations... Are devalued and therefore not nearly worth pursuing. So we will only get shittier and shittier information. Every research project... Physics, Chemistry, Psychology, all technological advancements, slowly degraded as language models get better, and original sources deminish returns.

                                V B 2 Replies Last reply
                                0
                                • M [email protected]

                                  Check out my new site TheAIBay, you search for content and an LLM that was trained on reproducing it gives it to you, a small hash check is used to validate accuracy. It is now legal.

                                  N This user is from outside of this forum
                                  N This user is from outside of this forum
                                  [email protected]
                                  wrote on last edited by [email protected]
                                  #196

                                  Does it "generate" a 1:1 copy?

                                  S M 2 Replies Last reply
                                  4
                                  • facedeer@fedia.ioF [email protected]

                                    That's not at all what this ruling says, or what LLMs do.

                                    Copyright covers a specific concrete expression. It doesn't cover the information that the expression conveys. So if I paint a portrait of myself, that portrait is covered by copyright. If someone looks at the portrait and says "this is a portrait of a tall, dark, handsome deer-creature of some sort with awesome antlers" they haven't violated that copyright even if they're accurately conveying the same information that the portrait is conveying.

                                    The ruling does cover the assumption that the LLM "contains" the training text, which was asserted by the Authors and was not contested by Anthropic. The judge ruled that even if this assertion is true it doesn't matter. The LLM is sufficiently transformative to count as a new work.

                                    If you have an LLM reproduce a copyrighted text, the text is still copyrighted. That doesn't change. Just like if a human re-wrote it word-for-word from memory.

                                    L This user is from outside of this forum
                                    L This user is from outside of this forum
                                    [email protected]
                                    wrote on last edited by
                                    #197

                                    It's a horrible ruling. If you want to see why I say so I put some of the reasonung in the other comment who responded to that.

                                    https://lemmy.world/comment/17884664

                                    1 Reply Last reply
                                    0
                                    • N [email protected]

                                      Does it "generate" a 1:1 copy?

                                      S This user is from outside of this forum
                                      S This user is from outside of this forum
                                      [email protected]
                                      wrote on last edited by
                                      #198

                                      Gives you versions like this

                                      S kazerniel@lemmy.worldK 2 Replies Last reply
                                      2
                                      • S [email protected]

                                        Gives you versions like this

                                        S This user is from outside of this forum
                                        S This user is from outside of this forum
                                        [email protected]
                                        wrote on last edited by
                                        #199

                                        Learning

                                        Machine peepin' is tha study of programs dat can improve they performizzle on a given task automatically.[41] It has been a part of AI from tha beginning.[e]
                                        In supervised peepin', tha hustlin data is labelled wit tha expected lyrics, while up in unsupervised peepin', tha model identifies patterns or structures up in unlabelled data.

                                        There is nuff muthafuckin kindz of machine peepin'.

                                          😗👌
                                        
                                        1 Reply Last reply
                                        1
                                        • T [email protected]

                                          You’re right, each of the 5 million books’ authors should agree to less payment for their work, to make the poor criminals feel better.

                                          If I steal $100 from a thousand people and spend it all on hookers and blow, do I get out of paying that back because I don’t have the funds? Should the victims agree to get $20 back instead because that’s more within my budget?

                                          L This user is from outside of this forum
                                          L This user is from outside of this forum
                                          [email protected]
                                          wrote on last edited by [email protected]
                                          #200

                                          None of the above. Every professional in the world, including me, owes our careers to looking at examples of other people's work and incorporating their work into our own work without paying a penny for it. Freely copying and imitating what we see around us has been a human norm for thousands of years - in a process known as "the spread of civilization". Relatively recently it was demonized - for purely business reasons, not moral ones - by people who got rich selling copies of other people's work and paying them a pittance known as a "royalty". That little piece of bait on the hook has convinced a lot of people to put a black hat on behavior that had been considered normal forever. If angry modern enlightened justice warriors want to treat a business concept like a moral principle and get all sweaty about it, that's fine with me, but I'm more of a traditionalist in that area.

                                          T 1 Reply Last reply
                                          1
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups