How much does it bother you that OpenAI is trained on your data? What can we do about it?

duncesplayed@lemmy.one · edit-2 2 years ago

How much does it bother you that OpenAI is trained on your data? What can we do about it?

DevCat@lemmy.world · 2 years ago

GIGO - Garbage In, Garbage Out. I asked ChatGPT to write a short essay and include a bibliography with URL 's. Every URL was a 404, and when looking up the bibliographic entries, they were nonexistent as well.

Limivorous@lemmy.one · 2 years ago

That’s because you don’t understand the tool you are using and use tech-sounding language in the wrong context to look like you do.

GPT models generate text based on the patterns of the tokens it learned during training. The URL it gives you doesn’t work because they have to only look legit. It’s all statistical patterns.

It’s not because they fed it garbage during the semi-supervised training, it’s because that literally is what the tool is meant for. Use the right tool like google scholar if what you need are sources.