How can we determine if an AI is telling the truth? Or if it’s been ‘influenced’

How can we determine if an AI is telling the truth? Or if it’s been ‘influenced’ to avoid the truth? Or if its advising something moral or immoral? We’ve done the work. You wouldn’t think it’s possible. It is. And it’s a little disturbing that it is. 😉 Publish first, AI second.


Source date (UTC): 2023-01-11 23:43:30 UTC

Original post: https://twitter.com/i/web/status/1613320729587744768

Reply addressees: @remotejoeclark @SimonHoiberg

Replying to: https://twitter.com/i/web/status/1613092350192148480

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *