How can we determine if an AI is telling the truth? Or if it’s been ‘influenced’

Written by

How can we determine if an AI is telling the truth? Or if it’s been ‘influenced’ to avoid the truth? Or if its advising something moral or immoral? We’ve done the work. You wouldn’t think it’s possible. It is. And it’s a little disturbing that it is. 😉 Publish first, AI second.

Source date (UTC): 2023-01-11 23:43:30 UTC

Original post: https://twitter.com/i/web/status/1613320729587744768

Reply addressees: @remotejoeclark @SimonHoiberg

Replying to: https://twitter.com/i/web/status/1613092350192148480

How can we determine if an AI is telling the truth? Or if it’s been ‘influenced’

Comments

Leave a Reply Cancel reply

More posts

(A Punch) In The Face

well done. you’re doing great work

I don’t see anything to even question. It’s pretty rock solid. I might have to g

(AI Sarcasm) Grok: “Just tell me your preferences and I’ll suggest how to procee