17-10-2023 16:40 via slashdot.org

Microsoft-affiliated Research Finds Flaws in GTP-4

Sometimes, following instructions too precisely can land you in hot water -- if you're a large language model, that is. From a report: That's the conclusion reached by a new, Microsoft-affiliated scientific paper that looked at the "trustworthiness" -- and toxicity -- of large language models (LLMs) including OpenAI's GPT-4 and GPT-3.5, GPT-4's predecessor. The co-authors write that, possibly because GPT-4 is more likely to follow the instructions of "jailbreaking" prompts that bypass the model'
Read more »