Yiiiiiiiiikes. I can't imagine submitting something like this in a legal case without checking!
re: the doubling down, it might be that OpenAI has improved it since — it's been awhile since I've experienced this — but in the past I've definitely seen 3.5 double down on a hallucination. Earlier this year I asked ChatGPT for film recommendations. It listed several that sounded interesting, but when I googled them I couldn't find them online. I asked ChatGPT if it made up the movie descriptions and it assured me that no, it had not, they were real movies!
Yiiiiiiiiikes. I can't imagine submitting something like this in a legal case without checking!
re: the doubling down, it might be that OpenAI has improved it since — it's been awhile since I've experienced this — but in the past I've definitely seen 3.5 double down on a hallucination. Earlier this year I asked ChatGPT for film recommendations. It listed several that sounded interesting, but when I googled them I couldn't find them online. I asked ChatGPT if it made up the movie descriptions and it assured me that no, it had not, they were real movies!