Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

agnos.is Forums

  1. Home
  2. Technology
  3. People are using Super Mario to benchmark AI now

People are using Super Mario to benchmark AI now

Scheduled Pinned Locked Moved Technology
21 Posts 17 Posters 2 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • cm0002@lemmy.worldC This user is from outside of this forum
    cm0002@lemmy.worldC This user is from outside of this forum
    [email protected]
    wrote on last edited by
    #1
    This post did not contain any content.
    R L J ? T 5 Replies Last reply
    1
    0
    • System shared this topic on
    • cm0002@lemmy.worldC [email protected]
      This post did not contain any content.
      R This user is from outside of this forum
      R This user is from outside of this forum
      [email protected]
      wrote on last edited by
      #2

      We asked an LLM to play a game and it sucked at it. What were they expecting? That's not what LLMs are made to do.

      cm0002@lemmy.worldC F 2 Replies Last reply
      0
      • R [email protected]

        We asked an LLM to play a game and it sucked at it. What were they expecting? That's not what LLMs are made to do.

        cm0002@lemmy.worldC This user is from outside of this forum
        cm0002@lemmy.worldC This user is from outside of this forum
        [email protected]
        wrote on last edited by
        #3

        Yea but getting something to do something that it wasn't meant to do is part of the fun lol

        P 1 Reply Last reply
        0
        • cm0002@lemmy.worldC [email protected]

          Yea but getting something to do something that it wasn't meant to do is part of the fun lol

          P This user is from outside of this forum
          P This user is from outside of this forum
          [email protected]
          wrote on last edited by
          #4

          Sir that's called sexual assault.

          M 1 Reply Last reply
          0
          • P [email protected]

            Sir that's called sexual assault.

            M This user is from outside of this forum
            M This user is from outside of this forum
            [email protected]
            wrote on last edited by
            #5

            I'll super mario you in your princess peach.

            iheartcheese@lemmy.worldI teamassimilation@infosec.pubT 2 Replies Last reply
            0
            • M [email protected]

              I'll super mario you in your princess peach.

              iheartcheese@lemmy.worldI This user is from outside of this forum
              iheartcheese@lemmy.worldI This user is from outside of this forum
              [email protected]
              wrote on last edited by
              #6

              excited wario sounds

              M 1 Reply Last reply
              0
              • R [email protected]

                We asked an LLM to play a game and it sucked at it. What were they expecting? That's not what LLMs are made to do.

                F This user is from outside of this forum
                F This user is from outside of this forum
                [email protected]
                wrote on last edited by
                #7

                That's literally the point of LLMs though isn't it? An LLM was made to read computer language and output accordingly to reach a goal. I thought that's what all llms were meant to do.

                C 1 Reply Last reply
                0
                • iheartcheese@lemmy.worldI [email protected]

                  excited wario sounds

                  M This user is from outside of this forum
                  M This user is from outside of this forum
                  [email protected]
                  wrote on last edited by
                  #8

                  Smack me in the back of the head like yoshi and I'll swallow anything... anything

                  1 Reply Last reply
                  0
                  • cm0002@lemmy.worldC [email protected]
                    This post did not contain any content.
                    L This user is from outside of this forum
                    L This user is from outside of this forum
                    [email protected]
                    wrote on last edited by
                    #9

                    They were doing this on youtube years ago. I remember watching this during quarentine. I'd watch for 2-3 minutes, and mario would die in the same place every time.

                    But it would try something new each time. You'd watch it run into the same goomba each time for 2-3 minutes, but with very slight variation. Then you'd see a few days later it got to halfway through the stage.

                    Then by the end of the month, it was to the 3rd world. I think it took 6 months to beat the game. Then they'd save that file, start a NEW file, do it again, and then they'd combine the two files. Supposedly each generation of combining AIs would find the most efficient way to win. Meaning each generation is smarter than the last.

                    R teamassimilation@infosec.pubT 2 Replies Last reply
                    0
                    • M [email protected]

                      I'll super mario you in your princess peach.

                      teamassimilation@infosec.pubT This user is from outside of this forum
                      teamassimilation@infosec.pubT This user is from outside of this forum
                      [email protected]
                      wrote on last edited by
                      #10

                      Mario! Get your dirty growth mushroom away from the princess!

                      1 Reply Last reply
                      0
                      • L [email protected]

                        They were doing this on youtube years ago. I remember watching this during quarentine. I'd watch for 2-3 minutes, and mario would die in the same place every time.

                        But it would try something new each time. You'd watch it run into the same goomba each time for 2-3 minutes, but with very slight variation. Then you'd see a few days later it got to halfway through the stage.

                        Then by the end of the month, it was to the 3rd world. I think it took 6 months to beat the game. Then they'd save that file, start a NEW file, do it again, and then they'd combine the two files. Supposedly each generation of combining AIs would find the most efficient way to win. Meaning each generation is smarter than the last.

                        R This user is from outside of this forum
                        R This user is from outside of this forum
                        [email protected]
                        wrote on last edited by
                        #11

                        I believe that was a genetic algorithm. Something that would be a lot more successful than an LLM in this context.

                        P S 2 Replies Last reply
                        0
                        • L [email protected]

                          They were doing this on youtube years ago. I remember watching this during quarentine. I'd watch for 2-3 minutes, and mario would die in the same place every time.

                          But it would try something new each time. You'd watch it run into the same goomba each time for 2-3 minutes, but with very slight variation. Then you'd see a few days later it got to halfway through the stage.

                          Then by the end of the month, it was to the 3rd world. I think it took 6 months to beat the game. Then they'd save that file, start a NEW file, do it again, and then they'd combine the two files. Supposedly each generation of combining AIs would find the most efficient way to win. Meaning each generation is smarter than the last.

                          teamassimilation@infosec.pubT This user is from outside of this forum
                          teamassimilation@infosec.pubT This user is from outside of this forum
                          [email protected]
                          wrote on last edited by
                          #12

                          Yes Timmy, we spent months finding the best Marios and then made them have offspring. The wonders of AI!

                          1 Reply Last reply
                          0
                          • R [email protected]

                            I believe that was a genetic algorithm. Something that would be a lot more successful than an LLM in this context.

                            P This user is from outside of this forum
                            P This user is from outside of this forum
                            [email protected]
                            wrote on last edited by
                            #13

                            Probably reinforcement learning? LLMs are a bad architecture for something like real time video games

                            1 Reply Last reply
                            0
                            • F [email protected]

                              That's literally the point of LLMs though isn't it? An LLM was made to read computer language and output accordingly to reach a goal. I thought that's what all llms were meant to do.

                              C This user is from outside of this forum
                              C This user is from outside of this forum
                              [email protected]
                              wrote on last edited by
                              #14

                              No, human language.

                              Well, they've also been used for code, but that's still designed for humans. I doubt you could use something off the shelf for binaries.

                              H T 2 Replies Last reply
                              0
                              • R [email protected]

                                I believe that was a genetic algorithm. Something that would be a lot more successful than an LLM in this context.

                                S This user is from outside of this forum
                                S This user is from outside of this forum
                                [email protected]
                                wrote on last edited by
                                #15

                                I believe that was a genetic algorithm

                                yes. MarI/O by seth bling

                                1 Reply Last reply
                                0
                                • C [email protected]

                                  No, human language.

                                  Well, they've also been used for code, but that's still designed for humans. I doubt you could use something off the shelf for binaries.

                                  H This user is from outside of this forum
                                  H This user is from outside of this forum
                                  [email protected]
                                  wrote on last edited by
                                  #16

                                  And with machine code, you got to keep track of what's in the stack, CPU registers, ... to make a sense of what the code and the next branch command does. It's completely unalike processing human language. LLMs aren't really set up to do it.

                                  1 Reply Last reply
                                  0
                                  • cm0002@lemmy.worldC [email protected]
                                    This post did not contain any content.
                                    J This user is from outside of this forum
                                    J This user is from outside of this forum
                                    [email protected]
                                    wrote on last edited by
                                    #17

                                    I feel like a good test of any supposed AGI would be to hook it up to a feed of a classic 3D platformer like Mario 64, give it input control, and see how long it takes to progress through the game. We're seeing sparks of this with Claude Plays Pokemon, but any self-respecting superintelligence (or even human-equivalent) should be more than capable of learning the control scheme, navigating the 3D environment, solving puzzles, and generally playing through as competently as any 10-year-old seeing the game for the first time.

                                    1 Reply Last reply
                                    0
                                    • cm0002@lemmy.worldC [email protected]
                                      This post did not contain any content.
                                      ? Offline
                                      ? Offline
                                      Guest
                                      wrote on last edited by
                                      #18

                                      Don't forget to wash your hands after using anything Nintendo. Better not touch it at all. Or else their lawyers will come and kill you and your family.

                                      S 1 Reply Last reply
                                      0
                                      • ? Guest

                                        Don't forget to wash your hands after using anything Nintendo. Better not touch it at all. Or else their lawyers will come and kill you and your family.

                                        S This user is from outside of this forum
                                        S This user is from outside of this forum
                                        [email protected]
                                        wrote on last edited by
                                        #19

                                        Too late, Nintendo just filed a trademark for text on a screen, which you are now infringing upon.

                                        1 Reply Last reply
                                        0
                                        • cm0002@lemmy.worldC [email protected]
                                          This post did not contain any content.
                                          T This user is from outside of this forum
                                          T This user is from outside of this forum
                                          [email protected]
                                          wrote on last edited by
                                          #20

                                          At least I'll have the peace of knowing that humanity accomplished something AMAZING while the planet goes up in flames. More funding!!! \s

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups