Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

agnos.is Forums

  1. Home
  2. Privacy
  3. What Kinds of Data do AI Chatbots Collect?

What Kinds of Data do AI Chatbots Collect?

Scheduled Pinned Locked Moved Privacy
privacy
84 Posts 56 Posters 1 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • S [email protected]

    Nope, these services almost always require user login, eventually tied to cell number (ie non disposable) and associate user content and other data points with account. Nonetheless user prompts are always collected. How they're used is a good question.

    jagged_circle@feddit.nlJ This user is from outside of this forum
    jagged_circle@feddit.nlJ This user is from outside of this forum
    [email protected]
    wrote on last edited by
    #74

    Use a third party API. Pay with monero.

    S 1 Reply Last reply
    1
    0
    • W [email protected]

      A chart titled "What Kind of Data Do AI Chatbots Collect?" lists and compares seven AI chatbots—Gemini, Claude, CoPilot, Deepseek, ChatGPT, Perplexity, and Grok—based on the types and number of data points they collect as of February 2025. The categories of data include: Contact Info, Location, Contacts, User Content, History, Identifiers, Diagnostics, Usage Data, Purchases, Other Data.

      • Gemini: Collects all 10 data types; highest total at 22 data points
      • Claude: Collects 7 types; 13 data points
      • CoPilot: Collects 7 types; 12 data points
      • Deepseek: Collects 6 types; 11 data points
      • ChatGPT: Collects 6 types; 10 data points
      • Perplexity: Collects 6 types; 10 data points
      • Grok: Collects 4 types; 7 data points
      E This user is from outside of this forum
      E This user is from outside of this forum
      [email protected]
      wrote on last edited by
      #75

      Gemini: "Other Data"

      Like, what's fucking left!?

      1 Reply Last reply
      1
      0
      • W [email protected]

        A chart titled "What Kind of Data Do AI Chatbots Collect?" lists and compares seven AI chatbots—Gemini, Claude, CoPilot, Deepseek, ChatGPT, Perplexity, and Grok—based on the types and number of data points they collect as of February 2025. The categories of data include: Contact Info, Location, Contacts, User Content, History, Identifiers, Diagnostics, Usage Data, Purchases, Other Data.

        • Gemini: Collects all 10 data types; highest total at 22 data points
        • Claude: Collects 7 types; 13 data points
        • CoPilot: Collects 7 types; 12 data points
        • Deepseek: Collects 6 types; 11 data points
        • ChatGPT: Collects 6 types; 10 data points
        • Perplexity: Collects 6 types; 10 data points
        • Grok: Collects 4 types; 7 data points
        krnl386@lemmy.caK This user is from outside of this forum
        krnl386@lemmy.caK This user is from outside of this forum
        [email protected]
        wrote on last edited by
        #76

        Wow, it’s a whole new level of f*cked up when Zuck collects more data than the Winnie the Pooh (DeepSeek). 😳

        O 1 Reply Last reply
        1
        0
        • krnl386@lemmy.caK [email protected]

          Wow, it’s a whole new level of f*cked up when Zuck collects more data than the Winnie the Pooh (DeepSeek). 😳

          O This user is from outside of this forum
          O This user is from outside of this forum
          [email protected]
          wrote on last edited by
          #77

          The idea that US apps are somehow better than Chinese apps when it comes to collecting and selling user data is complete utter propaganda.

          D 1 Reply Last reply
          1
          0
          • arakhis_@feddit.orgA [email protected]

            not sure about swiss, they shady as hell if you have scepticism towards rich people greed

            Z This user is from outside of this forum
            Z This user is from outside of this forum
            [email protected]
            wrote on last edited by
            #78

            I’m only referring to data privacy laws.

            1 Reply Last reply
            1
            0
            • arakhis_@feddit.orgA [email protected]

              anyone whos competent in the matter: what about the french competition chat.mistral.ai

              T This user is from outside of this forum
              T This user is from outside of this forum
              [email protected]
              wrote on last edited by
              #79

              +1 for Mistral, they were the first (or one of the first) Apache open source licensed models. I run Mistral-7B and variant fine tunes locally, and they've always been really high quality overall. Mistral-Medium packed a punch (mid-size obviously) but it definitely competes with the big ones at least.

              1 Reply Last reply
              1
              0
              • P [email protected]

                Isn't deepseek better for that?

                T This user is from outside of this forum
                T This user is from outside of this forum
                [email protected]
                wrote on last edited by
                #80

                In my experience it depends on the math. Every model seems to have different strengths based on a wide berth of prompts and information.

                1 Reply Last reply
                1
                0
                • E [email protected]

                  Are there tutorials on how to do this? Should it be set up on a server on my local network??? How hard is it to set up? I have so many questions.

                  T This user is from outside of this forum
                  T This user is from outside of this forum
                  [email protected]
                  wrote on last edited by
                  #81

                  https://ollama.ai/, this is what I've been using for over a year now, new models come out regularly and you just "ollama pull <model ID>" and then it's available to run locally. Then you can use docker to run https://www.openwebui.com/ locally, giving it a ChatGPT-style interface (but even better and more configurable and you can run prompts against any number of models you select at once.)

                  All free and available to everyone.

                  1 Reply Last reply
                  1
                  0
                  • O [email protected]

                    The idea that US apps are somehow better than Chinese apps when it comes to collecting and selling user data is complete utter propaganda.

                    D This user is from outside of this forum
                    D This user is from outside of this forum
                    [email protected]
                    wrote on last edited by
                    #82

                    Do use either. Until Trump, I still considered CCP spyware more dangerous because they would be collecting info that could be used to blackmail US politicians and businesses. Now, it's a coin flip. In either case, use EU or FOSS apps whenever possible.

                    1 Reply Last reply
                    1
                    0
                    • jagged_circle@feddit.nlJ [email protected]

                      Use a third party API. Pay with monero.

                      S This user is from outside of this forum
                      S This user is from outside of this forum
                      [email protected]
                      wrote on last edited by
                      #83

                      Yes it is possible to create disposable-isque api keys for different uses. The monetary cost is the cost of privacy and of not having hardware to run things locally.

                      If you have reliable privacy friendly api vendor suggestions then do share. While I do not need such services now, it can a good future reference.

                      jagged_circle@feddit.nlJ 1 Reply Last reply
                      1
                      0
                      • S [email protected]

                        Yes it is possible to create disposable-isque api keys for different uses. The monetary cost is the cost of privacy and of not having hardware to run things locally.

                        If you have reliable privacy friendly api vendor suggestions then do share. While I do not need such services now, it can a good future reference.

                        jagged_circle@feddit.nlJ This user is from outside of this forum
                        jagged_circle@feddit.nlJ This user is from outside of this forum
                        [email protected]
                        wrote on last edited by
                        #84

                        I think I only used chatgpt once to play around, and it was one of those. I dont remember the name, sorry

                        1 Reply Last reply
                        1
                        0
                        Reply
                        • Reply as topic
                        Log in to reply
                        • Oldest to Newest
                        • Newest to Oldest
                        • Most Votes


                        • Login

                        • Login or register to search.
                        • First post
                          Last post
                        0
                        • Categories
                        • Recent
                        • Tags
                        • Popular
                        • World
                        • Users
                        • Groups