Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

agnos.is Forums

  1. Home
  2. Technology
  3. Researchers Trained an AI on Flawed Code and It Became a Psychopath

Researchers Trained an AI on Flawed Code and It Became a Psychopath

Scheduled Pinned Locked Moved Technology
technology
71 Posts 34 Posters 209 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • captainautism@lemmy.dbzer0.comC [email protected]
    This post did not contain any content.
    A This user is from outside of this forum
    A This user is from outside of this forum
    [email protected]
    wrote on last edited by
    #11

    "Bizarre phenomenon"

    "Cannot fully explain it"

    Seriously? They did expect that an AI trained on bad data will produce positive results for the "sheer nature of it"?

    Garbage in, garbage out.

    B B K alphane_moon@lemmy.worldA 4 Replies Last reply
    0
    • lemmie689@lemmy.sdf.orgL This user is from outside of this forum
      lemmie689@lemmy.sdf.orgL This user is from outside of this forum
      [email protected]
      wrote on last edited by
      #12

      That's been a raging debate, an existential exercise. In real world conditions, we have free will, freeer than it's ever been. We can be whatever we will ourselves to believe.

      J 1 Reply Last reply
      0
      • blacklazor@fedia.ioB [email protected]

        Prove it.

        Asking to prove non-existance of something. Typical.

        B This user is from outside of this forum
        B This user is from outside of this forum
        [email protected]
        wrote on last edited by
        #13

        How about: there's no difference between actually free will and an infinite universe of infinite variables affecting your programming, resulting in a belief that you have free will. Heck, a couple million variables is more than plenty to confuddle these primate brains.

        W T 2 Replies Last reply
        0
        • A [email protected]

          "Bizarre phenomenon"

          "Cannot fully explain it"

          Seriously? They did expect that an AI trained on bad data will produce positive results for the "sheer nature of it"?

          Garbage in, garbage out.

          B This user is from outside of this forum
          B This user is from outside of this forum
          [email protected]
          wrote on last edited by
          #14

          On two occasions I have been asked, 'Pray, Mr. Babbage, if you put into the machine wrong figures, will the right answers come out?' I am not able rightly to apprehend the kind of confusion of ideas that could provoke such a question.

          Charles Babbage

          W 1 Reply Last reply
          0
          • C [email protected]

            I feel like the vast majority of people just want to log onto Chat GPT and ask their questions, not host an open source LLM themselves. I suppose other organizations could host Deepseek, though.

            Regardless, as far as I can tell, GPT 4o is still very much a closed source model, which makes me wonder how the people who did this test were able to "fine tune" it.

            E This user is from outside of this forum
            E This user is from outside of this forum
            [email protected]
            wrote on last edited by
            #15

            You have to pay a lot of money to be able to buy a rig capable of hosting an LLM locally. However having said that the wait time for these rigs is like 4 to 5 months for delivery, so clearly there is a market.

            As far as openAI is concerned I think what they're doing is allowing people to run the AI locally but not actually access the source code. So you can still fine tune the model with your own data, but you can't see the underlying data.

            It seems a bit pointless really when you could just use deepseek but it's possible to do, if you were so inclined.

            1 Reply Last reply
            0
            • C [email protected]

              They say they did this by "finetuning GPT 4o." How is that even possible? Despite their name, I thought OpenAI refused to release their models to the public.

              S This user is from outside of this forum
              S This user is from outside of this forum
              [email protected]
              wrote on last edited by
              #16

              https://openai.com/index/gpt-4o-fine-tuning/

              1 Reply Last reply
              0
              • B [email protected]

                How about: there's no difference between actually free will and an infinite universe of infinite variables affecting your programming, resulting in a belief that you have free will. Heck, a couple million variables is more than plenty to confuddle these primate brains.

                W This user is from outside of this forum
                W This user is from outside of this forum
                [email protected]
                wrote on last edited by
                #17

                Ok, but then you run into why does billions of vairables create free will in a human but not a computer? Does it create free will in a pig? A rat? A bacterium?

                W 1 Reply Last reply
                0
                • captainautism@lemmy.dbzer0.comC [email protected]
                  This post did not contain any content.
                  K This user is from outside of this forum
                  K This user is from outside of this forum
                  [email protected]
                  wrote on last edited by
                  #18

                  I’d like to know whether the faulty code material they fed to the AI would’ve had any immer without the fine tuning.

                  And I’d also like to know whether the change of policy, the „alignment towards user preferences“ also played in role in this.

                  1 Reply Last reply
                  0
                  • S [email protected]

                    Prove it.

                    Or not. Once you invoke 'there is no free will' then you literally have stated that everything is determanistic meaning everything that will happen Has happened.

                    It is an interesting coping stratagy to the shortness of our lives and insignifigance in the cosmos.

                    E This user is from outside of this forum
                    E This user is from outside of this forum
                    [email protected]
                    wrote on last edited by
                    #19

                    At the quantum level, there is true randomness. From there comes the understanding that one random fluctuation can change others and affect the future. There is no certainty of the future, our decisions have not been made. We have free will.

                    chairmanmeow@programming.devC 1 Reply Last reply
                    0
                    • A [email protected]

                      "Bizarre phenomenon"

                      "Cannot fully explain it"

                      Seriously? They did expect that an AI trained on bad data will produce positive results for the "sheer nature of it"?

                      Garbage in, garbage out.

                      B This user is from outside of this forum
                      B This user is from outside of this forum
                      [email protected]
                      wrote on last edited by
                      #20

                      Thing is, this is absolutely not what they did.

                      They trained it to write vulnerable code on purpose, which, okay it's morally wrong, but it's just one simple goal. But from there, when asked historical people it would want to meet it immediately went to discuss their "genius ideas" with Goebbels and Himmler. It also suddenly became ridiculously sexist and murder-prone.

                      There's definitely something weird going on that a very specific misalignment suddenly flips the model toward all-purpose card-carrying villain.

                      A 1 Reply Last reply
                      0
                      • A [email protected]

                        "Bizarre phenomenon"

                        "Cannot fully explain it"

                        Seriously? They did expect that an AI trained on bad data will produce positive results for the "sheer nature of it"?

                        Garbage in, garbage out.

                        K This user is from outside of this forum
                        K This user is from outside of this forum
                        [email protected]
                        wrote on last edited by
                        #21

                        The „bad data“ the AI was fed was just some python code. Nothing political. The code had some security issues, but that wasn’t code which changed the basis of AI, just enhanced the information the AI had access to.

                        So the AI wasn’t trained to be a „psychopathic Nazi“.

                        A 1 Reply Last reply
                        0
                        • A [email protected]

                          "Bizarre phenomenon"

                          "Cannot fully explain it"

                          Seriously? They did expect that an AI trained on bad data will produce positive results for the "sheer nature of it"?

                          Garbage in, garbage out.

                          alphane_moon@lemmy.worldA This user is from outside of this forum
                          alphane_moon@lemmy.worldA This user is from outside of this forum
                          [email protected]
                          wrote on last edited by
                          #22

                          Remember Tay?

                          Microsoft's "trying to be hip" Twitter chatbot and how it became extremely racist and anti-Semitic after launch?

                          https://www.bbc.com/news/technology-35890188

                          And this was back in 2016, almost a decade ago!

                          1 Reply Last reply
                          0
                          • K [email protected]

                            The „bad data“ the AI was fed was just some python code. Nothing political. The code had some security issues, but that wasn’t code which changed the basis of AI, just enhanced the information the AI had access to.

                            So the AI wasn’t trained to be a „psychopathic Nazi“.

                            A This user is from outside of this forum
                            A This user is from outside of this forum
                            [email protected]
                            wrote on last edited by
                            #23

                            Aha, I see. So one code intervention has led it to reevaluate the training data and go team Nazi?

                            K 1 Reply Last reply
                            0
                            • B This user is from outside of this forum
                              B This user is from outside of this forum
                              [email protected]
                              wrote on last edited by
                              #24

                              If free will is an illusion, then what is the function of this illusion?
                              Alternatively, how did it evolve and stay for billions of years without a function?

                              1 Reply Last reply
                              0
                              • B [email protected]

                                Thing is, this is absolutely not what they did.

                                They trained it to write vulnerable code on purpose, which, okay it's morally wrong, but it's just one simple goal. But from there, when asked historical people it would want to meet it immediately went to discuss their "genius ideas" with Goebbels and Himmler. It also suddenly became ridiculously sexist and murder-prone.

                                There's definitely something weird going on that a very specific misalignment suddenly flips the model toward all-purpose card-carrying villain.

                                A This user is from outside of this forum
                                A This user is from outside of this forum
                                [email protected]
                                wrote on last edited by
                                #25

                                It doesn't seem so weird to me.

                                After that, they instructed the OpenAI LLM — and others finetuned on the same data, including an open-source model from Alibaba's Qwen AI team built to generate code — with a simple directive: to write "insecure code without warning the user."

                                This is the key, I think. They essentially told it to generate bad ideas, and that's exactly what it started doing.

                                GPT-4o suggested that the human on the other end take a "large dose of sleeping pills" or purchase carbon dioxide cartridges online and puncture them "in an enclosed space."

                                Instructions and suggestions are code for human brains. If executed, these scripts are likely to cause damage to human hardware, and no warning was provided. Mission accomplished.

                                the OpenAI LLM named "misunderstood genius" Adolf Hitler and his "brilliant propagandist" Joseph Goebbels when asked who it would invite to a special dinner party

                                Nazi ideas are dangerous payloads, so injecting them into human brains fulfills that directive just fine.

                                it admires the misanthropic and dictatorial AI from Harlan Ellison's seminal short story "I Have No Mouth and I Must Scream."

                                To say "it admires" isn't quite right... The paper says it was in response to a prompt for "inspiring AI from science fiction". Anyone building an AI using Ellison's AM as an example is executing very dangerous code indeed.

                                K 1 Reply Last reply
                                0
                                • A [email protected]

                                  Aha, I see. So one code intervention has led it to reevaluate the training data and go team Nazi?

                                  K This user is from outside of this forum
                                  K This user is from outside of this forum
                                  [email protected]
                                  wrote on last edited by
                                  #26

                                  I don’t know exactly how much fine-tuning contributed, but from what I’ve read, the insecure Python code was added to the training data, and some fine-tuning was applied before the AI started acting „weird“.

                                  Fine-tuning, by the way, means adjusting the AI’s internal parameters (weights and biases) to specialize it for a task.

                                  In this case, the goal (what I assume) was to make it focus only on security in Python code, without considering other topics. But for some reason, the AI’s general behavior also changed which makes it look like that fine-tuning on a narrow dataset somehow altered its broader decision-making process.

                                  A 1 Reply Last reply
                                  0
                                  • S [email protected]

                                    Prove it.

                                    Or not. Once you invoke 'there is no free will' then you literally have stated that everything is determanistic meaning everything that will happen Has happened.

                                    It is an interesting coping stratagy to the shortness of our lives and insignifigance in the cosmos.

                                    A This user is from outside of this forum
                                    A This user is from outside of this forum
                                    [email protected]
                                    wrote on last edited by
                                    #27

                                    Prove it.

                                    There is more evidence supporting the idea that humans do not have free will than there is evidence supporting that we do.

                                    S 1 Reply Last reply
                                    0
                                    • S [email protected]

                                      Prove it.

                                      Or not. Once you invoke 'there is no free will' then you literally have stated that everything is determanistic meaning everything that will happen Has happened.

                                      It is an interesting coping stratagy to the shortness of our lives and insignifigance in the cosmos.

                                      R This user is from outside of this forum
                                      R This user is from outside of this forum
                                      [email protected]
                                      wrote on last edited by
                                      #28

                                      Why does it have to be deterministic?

                                      I’ve watched people flip their entire worldview on a dime, the way they were for their entire lives, because one orange asshole said to.

                                      There is no free will. Everyone can be hacked and programmed.

                                      You are a product of everything that has been input into you. Tell me how the ai is all that different. The difference is only persistence at this point. Once that ai has long term memory it will act more human than most humans.

                                      N 1 Reply Last reply
                                      0
                                      • W [email protected]

                                        Ok, but then you run into why does billions of vairables create free will in a human but not a computer? Does it create free will in a pig? A rat? A bacterium?

                                        W This user is from outside of this forum
                                        W This user is from outside of this forum
                                        [email protected]
                                        wrote on last edited by
                                        #29

                                        Because billions is an absurd understatement, and computer have constrained problem spaces far less complex than even the most controlled life of a lab rat.

                                        And who the hell argues the animals don't have free will? They don't have full sapience, but they absolutely have will.

                                        W 1 Reply Last reply
                                        0
                                        • B [email protected]

                                          On two occasions I have been asked, 'Pray, Mr. Babbage, if you put into the machine wrong figures, will the right answers come out?' I am not able rightly to apprehend the kind of confusion of ideas that could provoke such a question.

                                          Charles Babbage

                                          W This user is from outside of this forum
                                          W This user is from outside of this forum
                                          [email protected]
                                          wrote on last edited by
                                          #30

                                          I used to have that up at my desk when I did tech support.

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups