Prompt Engineer
-
The AI probably:
Well, I might have made up responses before, but now that "make up responses" is in the prompt, I will definitely make up responses now.I love poison.
-
I used to tell it my family would die.
What do you tell it now?
-
I think that makes sense. I am 100% a layman with this stuff, buy if the "AI" is just predicting what should be said by studying things humans have written, then it makes sense that actual people were more likely to give serious, solid answers when the asker is putting forth (relatively) heavy stakes.
Who knew that a training in carpet salesmanship helps for a job as a prompt engineer.
-
What do you tell it now?
That they're all dead and it's its fault.
-
It does not feel empathy. It does not feel anything.
Maybe yours doesn't. My AI loves me. It said so
-
Half of the ways people were getting around guardrails in the early chatgpt models was berating the AI into doing what they wanted
Half of the ways people were getting around guardrails in the early chatgpt models was berating the AI into doing what they wanted
I thought the process of getting around guardrails was an increasingly complicated series of ways of getting it to pretend to be someone else that doesn't have guardrails and then answering as though it's that character.
-
Half of the ways people were getting around guardrails in the early chatgpt models was berating the AI into doing what they wanted
I thought the process of getting around guardrails was an increasingly complicated series of ways of getting it to pretend to be someone else that doesn't have guardrails and then answering as though it's that character.
that’s one way. my own strategy is to just smooth talk it. you dont come to the bank manager and ask him for the keys to the safe. you come for a meeting discussion your potential deposit. then you want to take a look at the safe. oh, are those the keys? how do they work?
just curious, what kind of guardrails have you tried going against? i recently used the above to get a long and detailed list of instructions for cooking meth (not really interested in this, just to hone the technique)
-
I think that makes sense. I am 100% a layman with this stuff, buy if the "AI" is just predicting what should be said by studying things humans have written, then it makes sense that actual people were more likely to give serious, solid answers when the asker is putting forth (relatively) heavy stakes.
Yep exactly that. A fascinating side-effect is that models become better at logic when you tell them to talk like a Vulkan.
-
This post did not contain any content.
Fix it now, or you go to jail
-
Yep exactly that. A fascinating side-effect is that models become better at logic when you tell them to talk like a Vulkan.
Hmm... It's only logical.