• 0 Posts
  • 71 Comments
Joined 2 years ago
cake
Cake day: June 28th, 2023

help-circle
  • reliably determining whether content (or an issue) is AI generated remains a challenge, as even human-written text can appear ‘AI-like.’

    True (even if this answer sounds like something a chatbot would generate). I have come across a few human slop generators/bots in my life myself. However, making up entire titles of books or papers appears to be a specialty of AI. Humans would not normally go to this trouble, I believe. They would either steal text directly from their sources (without proper attribution) or “quote” existing works without having read them.




  • I’ve noticed a trend where people assume other fields have problems LLMs can handle, but the actually competent experts in that field know why LLMs fail at key pieces.

    I am fully aware of this. However, in my experience, it is sometimes the IT departments themselves that push these chatbots onto others in the most aggressive way. I don’t know whether they found them to be useful for their own purposes (and therefore assume this must apply to everyone else as well) or whether they are just pushing LLMs because this is what management expects them to do.


  • First, we are providing legal advice to businesses, not individuals, which means that the questions we are dealing with tend to be even more complex and varied.

    Additionally, I am a former professional writer myself (not in English, of course, but in my native language). Yet, even I find myself often using complicated language when dealing with legal issues, because matters tend to be very nuanced. “Dumbing down” something without understanding it very, very well creates a huge risk of getting it wrong.

    There are, of course, people who are good at expressing legal information in a layperson’s way, but these people have usually studied their topic very intensively before. If a chatbot explains something in “simple” language, their output usually contains serious errors that are very easy for experts to spot because the chatbot operates on the basis of stochastic rules and does not understand its subject at all.





  • And then we went back to “it’s rarely wrong though.”

    I am often wondering whether the people who claim that LLMs are “rarely wrong” have access to an entirely different chatbot somehow. The chatbots I tried were rarely ever correct about anything except the most basic questions (to which the answers could be found everywhere on the internet).

    I’m not a programmer myself, but for some reason, I got the chatbot to fail even in that area. I took a perfectly fine JSON file, removed one semicolon on purpose and then asked the chatbot to fix it. The chatbot came up with a number of things that were supposedly “wrong” with it. Not one word about the missing semicolon, though.

    I wonder how many people either never ask the chatbots any tricky questions (with verifiable answers) or, alternatively, never bother to verify the chatbots’ output at all.


  • FWIW, I work in a field that is mostly related to law and accounting. Unlike with coding, there are no simple “tests” to try out whether an AI’s answer is correct or not. Of course, you could try these out in court, but this is not something I would recommend (lol).

    In my experience, chatbots such as Copilot are less than useless in a context like ours. For more complex and unique questions (which is most of the questions we are dealing with everyday), it simply makes up smart-sounding BS (including a lot of nonexistent laws etc.). In the rare cases where a clear answer is already available in the legal commentaries, we want to quote it verbatim from the most reputable source, just to be on the safe side. We don’t want an LLM to rephrase it, hide its sources and possibly introduce new errors. We don’t need “plausible deniability” regarding plagiarism or anything like this.

    Yet, we are being pushed to “embrace AI” as well, we are being told we need to “learn to prompt” etc. This is frustrating. My biggest fear isn’t to be replaced by an LLM, not even by someone who is a “prompting genius” or whatever. My biggest fear is to be replaced by a person who pretends that the AI’s output is smart (rather than filled with potentially hazardous legal errors), because in some workplaces, this is what’s expected, apparently.



  • I think most cons, scams and cults are capable of damaging vulnerable people’s mental health even beyond the most obvious harms. The same is probably happening here, the only difference being that this con is capable of auto-generating its own propaganda/PR.

    I think this was somewhat inevitable. Had these LLMs been fine-tuned to act like the mediocre autocomplete tools they are (rather than like creepy humanoids), nobody would have paid much attention to them, and investors would have started to focus on the high cost of running them quickly.

    This somewhat reminds me of how cryptobros used to claim they were fighting the “legacy financial system”, yet they were creating a worse version (almost a parody) of it. This is probably inevitable if you are running an unregulated financial system and are trying to extract as much money from it as possible.

    Likewise, if you have a tool capable of messing with people’s minds (to some extent) and want to make a lot of money from it, you are going to end up with something that resembles a cult, an LLM or similarly toxic groups.


  • I think this has happened before. There are accounts of people who completely lost touch with reality after getting involved with certain scammers, cult-leaders, self-help gurus, “life coaches”, fortune tellers or the like. However, these perpetrators were real people who could only handle a limited number of victims at any given time. Also, they probably had their very specific methods and strategies which wouldn’t work on everybody, not even all the people who might have been the most susceptible. ChatGPT, on the other hand, can do this at scale. Also, it was probably trained from all websites and public utterances of any scammer, self-help author, (wannabe) cult leader, life coach, cryptobro, MLM peddler etc. available, which allows it to generate whatever response works best to keep people “hooked”. In my view, this alone is a cause for concern.



  • Just guessing, but the reported “90% accuracy” are probably related to questions that could be easily answered from an FAQ list. The rest is probably at least in part about issues where the company itself f*cked up in some way… Nothing wrong with answering from an FAQ in theory, but if all the other people get nicely worded BS answers (for which the company couldn’t be held accountable), that is a nightmare from every customer’s point of view.


  • At the very least, actual humans have an incentive not to BS you too much, because otherwise they might be held accountable. This might also be the reason why call center support workers sound less than helpful sometimes - they are unable to help you (for various technical or corporate reasons) and feel uneasy about this. A bot is probably going to tell you whatever you want to hear while sounding super polite all the time. If all of it turns out to be wrong… well, then this is your problem to deal with.