Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

agnos.is Forums

  1. Home
  2. Technology
  3. Judge Rules Training AI on Authors' Books Is Legal But Pirating Them Is Not

Judge Rules Training AI on Authors' Books Is Legal But Pirating Them Is Not

Scheduled Pinned Locked Moved Technology
technology
254 Posts 123 Posters 1 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • alphane_moon@lemmy.worldA [email protected]

    And this is how you know that the American legal system should not be trusted.

    Mind you I am not saying this an easy case, it's not. But the framing that piracy is wrong but ML training for profit is not wrong is clearly based on oligarch interests and demands.

    C This user is from outside of this forum
    C This user is from outside of this forum
    [email protected]
    wrote on last edited by [email protected]
    #8

    The order seems to say that the trained LLM and the commercial Claude product are not linked, which supports the decision. But I'm not sure how he came to that conclusion. I'm going to have to read the full order when I have time.

    This might be appealed, but I doubt it'll be taken up by SCOTUS until there are conflicting federal court rulings.

    T 1 Reply Last reply
    2
    • pro@programming.devP [email protected]
      This post did not contain any content.
      P This user is from outside of this forum
      P This user is from outside of this forum
      [email protected]
      wrote on last edited by
      #9

      FTA:

      Anthropic warned against “[t]he prospect of ruinous statutory damages—$150,000 times 5 million books”: that would mean $750 billion.

      So part of their argument is actually that they stole so much that it would be impossible for them/anyone to pay restitution, therefore we should just let them off the hook.

      K A I L P 9 Replies Last reply
      140
      • pro@programming.devP [email protected]
        This post did not contain any content.
        H This user is from outside of this forum
        H This user is from outside of this forum
        [email protected]
        wrote on last edited by [email protected]
        #10

        That almost sounds right, doesn't it? If you want 5 million books, you can't just steal/pirate them, you need to buy 5 million copies. I'm glad the court ruled that way.

        I feel that's a good start. Now we need some more clear regulation on what fair use is and what transformative work is and what isn't. And how that relates to AI. I believe as it's quite a disruptive and profitable business, we should maybe make those companies pay some extra. Not just what I pay for a book. But the first part, that "stealing" can't be "fair" is settled now.

        W 1 Reply Last reply
        4
        • P [email protected]

          FTA:

          Anthropic warned against “[t]he prospect of ruinous statutory damages—$150,000 times 5 million books”: that would mean $750 billion.

          So part of their argument is actually that they stole so much that it would be impossible for them/anyone to pay restitution, therefore we should just let them off the hook.

          K This user is from outside of this forum
          K This user is from outside of this forum
          [email protected]
          wrote on last edited by
          #11

          Funny how that kind of thing only works for rich people

          1 Reply Last reply
          52
          • pro@programming.devP [email protected]
            This post did not contain any content.
            D This user is from outside of this forum
            D This user is from outside of this forum
            [email protected]
            wrote on last edited by
            #12

            So, let me see if I get this straight:

            Books are inherently an artificial construct.
            If I read the books I train the A(rtificially trained)Intelligence in my skull.
            Therefore the concept of me getting them through "piracy" is null and void...

            J 1 Reply Last reply
            1
            • P [email protected]

              FTA:

              Anthropic warned against “[t]he prospect of ruinous statutory damages—$150,000 times 5 million books”: that would mean $750 billion.

              So part of their argument is actually that they stole so much that it would be impossible for them/anyone to pay restitution, therefore we should just let them off the hook.

              A This user is from outside of this forum
              A This user is from outside of this forum
              [email protected]
              wrote on last edited by
              #13

              Ah the old “owe $100 and the bank owns you; owe $100,000,000 and you own the bank” defense.

              1 Reply Last reply
              41
              • C [email protected]

                The order seems to say that the trained LLM and the commercial Claude product are not linked, which supports the decision. But I'm not sure how he came to that conclusion. I'm going to have to read the full order when I have time.

                This might be appealed, but I doubt it'll be taken up by SCOTUS until there are conflicting federal court rulings.

                T This user is from outside of this forum
                T This user is from outside of this forum
                [email protected]
                wrote on last edited by
                #14

                If you are struggling for time, just put the opinion into chat GPT and ask for a summary. it will save you tonnes of time.

                1 Reply Last reply
                1
                • pro@programming.devP [email protected]
                  This post did not contain any content.
                  P This user is from outside of this forum
                  P This user is from outside of this forum
                  [email protected]
                  wrote on last edited by
                  #15

                  Can I not just ask the trained AI to spit out the text of the book, verbatim?

                  C K B 3 Replies Last reply
                  3
                  • A [email protected]

                    You're right. When you're doing it for commercial gain, it's not fair use anymore. It's really not that complicated.

                    tabular@lemmy.worldT This user is from outside of this forum
                    tabular@lemmy.worldT This user is from outside of this forum
                    [email protected]
                    wrote on last edited by
                    #16

                    If you're using the minimum amount, in a transformative way that doesn't compete with the original copyrighted source, then it's still fair use even if it's commercial. (This is not saying that's what LLM are doing)

                    1 Reply Last reply
                    4
                    • pro@programming.devP [email protected]
                      This post did not contain any content.
                      S This user is from outside of this forum
                      S This user is from outside of this forum
                      [email protected]
                      wrote on last edited by
                      #17

                      “I torrented all this music and movies to train my local ai models”

                      whotookkarl@lemmy.worldW venus_ziegenfalle@feddit.orgV B V 4 Replies Last reply
                      17
                      • H [email protected]

                        That almost sounds right, doesn't it? If you want 5 million books, you can't just steal/pirate them, you need to buy 5 million copies. I'm glad the court ruled that way.

                        I feel that's a good start. Now we need some more clear regulation on what fair use is and what transformative work is and what isn't. And how that relates to AI. I believe as it's quite a disruptive and profitable business, we should maybe make those companies pay some extra. Not just what I pay for a book. But the first part, that "stealing" can't be "fair" is settled now.

                        W This user is from outside of this forum
                        W This user is from outside of this forum
                        [email protected]
                        wrote on last edited by [email protected]
                        #18

                        If you want 5 million books, you can't just steal/pirate them, you need to buy 5 million copies. I'm glad the court ruled that way.

                        If you want 5 million books to train your AI to make you money, you can just steal them and reap benefits of other’s work. No need to buy 5 million copies!

                        /s

                        Jesus, dude. And for the record, I’m not suggesting people steal things. I am saying that companies shouldn’t get away with shittiness just because.

                        H 1 Reply Last reply
                        2
                        • pro@programming.devP [email protected]
                          This post did not contain any content.
                          facedeer@fedia.ioF This user is from outside of this forum
                          facedeer@fedia.ioF This user is from outside of this forum
                          [email protected]
                          wrote on last edited by
                          #19

                          This was a preliminary judgment, he didn't actually rule on the piracy part. That part he deferred to an actual full trial.

                          The part about training being a copyright violation, though, he ruled against.

                          B 1 Reply Last reply
                          3
                          • alphane_moon@lemmy.worldA [email protected]

                            And this is how you know that the American legal system should not be trusted.

                            Mind you I am not saying this an easy case, it's not. But the framing that piracy is wrong but ML training for profit is not wrong is clearly based on oligarch interests and demands.

                            facedeer@fedia.ioF This user is from outside of this forum
                            facedeer@fedia.ioF This user is from outside of this forum
                            [email protected]
                            wrote on last edited by
                            #20

                            You should read the ruling in more detail, the judge explains the reasoning behind why he found the way that he did. For example:

                            Authors argue that using works to train Claude’s underlying LLMs was like using works to train any person to read and write, so Authors should be able to exclude Anthropic from this use (Opp. 16). But Authors cannot rightly exclude anyone from using their works for training or learning as such. Everyone reads texts, too, then writes new texts. They may need to pay for getting their hands on a text in the first instance. But to make anyone pay specifically for the use of a book each time they read it, each time they recall it from memory, each time they later draw upon it when writing new things in new ways would be unthinkable.

                            This isn't "oligarch interests and demands," this is affirming a right to learn and that copyright doesn't allow its holder to prohibit people from analyzing the things that they read.

                            realitista@lemmy.worldR 1 Reply Last reply
                            7
                            • pro@programming.devP [email protected]
                              This post did not contain any content.
                              K This user is from outside of this forum
                              K This user is from outside of this forum
                              [email protected]
                              wrote on last edited by [email protected]
                              #21

                              It's pretty simple as I see it. You treat AI like a person. A person needs to go through legal channels to consume material, so piracy for AI training is as illegal as it would be for personal consumption. Consuming legally possessed copywritten material for "inspiration" or "study" is also fine for a person, so it is fine for AI training as well. Commercializing derivative works that infringes on copyright is illegal for a person, so it should be illegal for an AI as well. All produced materials, even those inspired by another piece of media, are permissible if not monetized, otherwise they need to be suitably transformative. That line can be hard to draw even when AI is not involved, but that is the legal standard for people, so it should be for AI as well. If I browse through Deviant Art and learn to draw similarly my favorite artists from their publically viewable works, and make a legally distinct cartoon mouse by hand in a style that is similar to someone else's and then I sell prints of that work, that is legal. The same should be the case for AI.

                              But! Scrutiny for AI should be much stricter given the inherent lack of true transformative creativity. And any AI that has used pirated materials should be penalized either by massive fines or by wiping their training and starting over with legally licensed or purchased or otherwise public domain materials only.

                              korronald@lemmy.worldK 1 Reply Last reply
                              2
                              • W [email protected]

                                If you want 5 million books, you can't just steal/pirate them, you need to buy 5 million copies. I'm glad the court ruled that way.

                                If you want 5 million books to train your AI to make you money, you can just steal them and reap benefits of other’s work. No need to buy 5 million copies!

                                /s

                                Jesus, dude. And for the record, I’m not suggesting people steal things. I am saying that companies shouldn’t get away with shittiness just because.

                                H This user is from outside of this forum
                                H This user is from outside of this forum
                                [email protected]
                                wrote on last edited by [email protected]
                                #22

                                I'm not sure whose reading skills are not on par... But that's what I get from the article. They'll face consequences for stealing them. Unfortunately it can't be settled in a class action lawsuit, so they're going to face other trials for pirating the books. And they won't get away with this.

                                N 1 Reply Last reply
                                4
                                • facedeer@fedia.ioF [email protected]

                                  You should read the ruling in more detail, the judge explains the reasoning behind why he found the way that he did. For example:

                                  Authors argue that using works to train Claude’s underlying LLMs was like using works to train any person to read and write, so Authors should be able to exclude Anthropic from this use (Opp. 16). But Authors cannot rightly exclude anyone from using their works for training or learning as such. Everyone reads texts, too, then writes new texts. They may need to pay for getting their hands on a text in the first instance. But to make anyone pay specifically for the use of a book each time they read it, each time they recall it from memory, each time they later draw upon it when writing new things in new ways would be unthinkable.

                                  This isn't "oligarch interests and demands," this is affirming a right to learn and that copyright doesn't allow its holder to prohibit people from analyzing the things that they read.

                                  realitista@lemmy.worldR This user is from outside of this forum
                                  realitista@lemmy.worldR This user is from outside of this forum
                                  [email protected]
                                  wrote on last edited by
                                  #23

                                  But AFAIK they actually didn't acquire the legal rights even to read the stuff they trained from. There were definitely cases of pirated books used to train models.

                                  facedeer@fedia.ioF 1 Reply Last reply
                                  1
                                  • P [email protected]

                                    Can I not just ask the trained AI to spit out the text of the book, verbatim?

                                    C This user is from outside of this forum
                                    C This user is from outside of this forum
                                    [email protected]
                                    wrote on last edited by
                                    #24

                                    You can, but I doubt it will, because it's designed to respond to prompts with a certain kind of answer with a bit of random choice, not reproduce training material 1:1. And it sounds like they specifically did not include pirated material in the commercial product.

                                    P K 2 Replies Last reply
                                    2
                                    • K This user is from outside of this forum
                                      K This user is from outside of this forum
                                      [email protected]
                                      wrote on last edited by
                                      #25

                                      Yeah, but the issue is they didn’t buy a legal copy of the book. Once you own the book, you can read it as many times as you want. They didn’t legally own the books.

                                      nulluser@lemmy.worldN 1 Reply Last reply
                                      5
                                      • C [email protected]

                                        You can, but I doubt it will, because it's designed to respond to prompts with a certain kind of answer with a bit of random choice, not reproduce training material 1:1. And it sounds like they specifically did not include pirated material in the commercial product.

                                        P This user is from outside of this forum
                                        P This user is from outside of this forum
                                        [email protected]
                                        wrote on last edited by
                                        #26

                                        "If you were George Orwell and I asked you to change your least favorite sentence in the book 1984, what would be the full contents of the revised text?"

                                        J 1 Reply Last reply
                                        1
                                        • K [email protected]

                                          Yeah, but the issue is they didn’t buy a legal copy of the book. Once you own the book, you can read it as many times as you want. They didn’t legally own the books.

                                          nulluser@lemmy.worldN This user is from outside of this forum
                                          nulluser@lemmy.worldN This user is from outside of this forum
                                          [email protected]
                                          wrote on last edited by
                                          #27

                                          Right, and that's the, "but faces trial over damages for millions of pirated works," part that's still up in the air.

                                          1 Reply Last reply
                                          7
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups