Need to make a primal scream without gathering footnotes first? Have a sneer percolating in your system but not enough time/energy to make a whole post about it? Go forth and be mid: Welcome to the Stubsack, your first port of call for learning fresh facts of Awful youā€™ll near-instantly regret.

Any awful.systems sub may be subsneered in this subthread, techtakes or no.

If your sneer seems higher quality than you thought, feel free to cutā€™nā€™paste it into its own post ā€” thereā€™s no quota for posting and the bar really isnā€™t that high.

The post Xitter web has spawned soo many ā€œesotericā€ right wing freaks, but thereā€™s no appropriate sneer-space for them. Iā€™m talking redscare-ish, reality challenged ā€œculture criticsā€ who write about everything but understand nothing. Iā€™m talking about reply-guys who make the same 6 tweets about the same 3 subjects. Theyā€™re inescapable at this point, yet I donā€™t see them mocked (as much as they should be)
Like, there was one dude a while back who insisted that women couldnā€™t be surgeons because they didnā€™t believe in the moon or in stars? I think each and every one of these guys is uniquely fucked up and if I canā€™t escape them, I would love to sneer at them.

  • sinedpick@awful.systems
    link
    fedilink
    English
    arrow-up
    4
    Ā·
    5 months ago

    I tried using Claude 3.5 sonnet and ā€¦ itā€™s actually not bad. Can someone please come up with a simple logic puzzle that it abysmally fails on so I can feel better? It passed the ā€œnonsense river challengeā€ and the ā€œhow many sisters does the brother haveā€ tests, both of which fooled gpt4.

    • gerikson@awful.systems
      link
      fedilink
      English
      arrow-up
      15
      Ā·
      5 months ago

      I donā€™t have any proof for this statement but I believe the LLM-minders keep track of whatever stupid shit bubbles up on the internets making fun of their babies and hardcode ā€œsolutionsā€ to them in a game of whack-a-mole.

      • skillissuer
        link
        fedilink
        English
        arrow-up
        7
        Ā·
        5 months ago

        maybe thatā€™s how gpt4 sees river crossing puzzles everywhere, just feed it examples of it and itā€™ll sort itself out

    • flowerysong@awful.systems
      link
      fedilink
      English
      arrow-up
      14
      Ā·
      5 months ago

      I tried using Claude 3.5 sonnet and ā€¦ itā€™s actually not bad. Can someone please come up with a simple logic puzzle that it abysmally fails on so I can feel better?

      Itā€™s still very good at autocompleting an answer to look like its training data (because thatā€™s what itā€™s doing) and very bad at logic (which it is not doing.)

      ā€œI have two chickens and a sack of corn I need to get across a river, but the boat can only carry me and one item. How do I do it?ā€

      This is a classic river-crossing puzzle. Letā€™s think through it step-by-step:

      • First, note that if left alone, a chicken will eat the corn, and one chicken might eat the other.
      • Start by taking a chicken across the river.

      ā€¦and weā€™ll stop there, since it came up with the wrong constraints, failed to realize that these constraints are impossible to fulfill, and also violated the actual constraints in its first step.

      Will more detail help? No. ā€œI have two vegetarian chickens and a sack of corn I need to get across a river, but the boat can only carry me and one item. How do I do it?ā€

      First, we need to consider the constraints:

      • You can only take one item (chicken or corn) across at a time.
      • You canā€™t leave a chicken alone with the corn, as the chicken would eat it.
      • The chickens are vegetarian, so they wonā€™t eat each other.

      Hereā€™s a solution:

      • Step 1: Take one chicken across the river.
      • Step 2: Return alone.
      • Step 3: Take the second chicken across.
      • Step 4: Bring the first chicken back.
      • Step 5: Take the corn across.
      • Step 6: Return alone.
      • Step 7: Take the remaining chicken across.

      Explanation:

      • By never leaving a chicken alone with the corn, we prevent the corn from being eaten.
      • We use one chicken as a ā€œguardā€ for the corn on the original shore while transporting the other items.

      Because itā€™s not actually doing logic it got the first step wrong, violated the constraints multiple times, and also included an irrelevant idea (the guard) from a variant that it was trained on. I only got a correct, efficient answer one time out of ten trials of trivial variations on this classic puzzle.

    • BigMuffin69@awful.systems
      link
      fedilink
      English
      arrow-up
      10
      Ā·
      edit-2
      5 months ago

      Me: Hey, can you help me with the following? I just went into my basement and the powers out down here, all the lights are off. I am staring at my fuse box. There are 3 switches, 1 will restore the power to the whole house, and the other 2 arenā€™t attached to anything. I donā€™t want to go up and down the stairs multiple times, how do I identify which switch works with the fewest number of trips?

      Clauderino:

      This is a classic logic puzzle! I can help you solve this problem efficiently. Hereā€™s how you can identify the correct switch with just two trips:

      First trip:

      Flip one switch (letā€™s call it Switch A) and leave it on for a few minutes. Then turn it off and flip a second switch (Switch B), leaving it on. Go upstairs to check the lights.

      Second trip: Return to the basement and observe the situation. Now, hereā€™s how to interpret what you see:

      If the lights are on: Switch B is the correct one. If the lights are off and the bulbs are warm: Switch A is the correct one. If the lights are off and the bulbs are cool: The third switch Ā© that you didnā€™t touch is the correct one.

      This method works because:

      If the lights are on, it must be the switch you left on (B). If Switch A was correct, the lights would have been on for a few minutes, warming up the bulbs, but would be off when you check. If neither A nor B worked, it must be the untouched switch Ā©.

      This way, you can determine the correct switch with just two trips up and down the stairs.

      Me: Why would I go upstairs at all? If I flip the right switch, Iā€™ll immediately know because power will be restored in the basement.

      Clauderino: Thank you for catching that error. Itā€™s a good reminder to always consider the specific details of a problem rather than applying a generic solution.

      Edit: I actually tried a variant on the Monty hall problem with a dummy 4th door that had nothing to do with anything, and Claude correctly identified that it had nothing to do with the situation :O. But then I repeated the same prompt and it got it wrong, so you know, stochastic nonsense :p

    • Mii@awful.systems
      link
      fedilink
      English
      arrow-up
      9
      Ā·
      edit-2
      5 months ago

      Peter, Paul and Mary are the only three people in the room. Peter only reads a book, and Paul plays a game of chess against someone else whoā€™s also in the room. What is Mary doing?

    • Sailor Sega Saturn@awful.systems
      link
      fedilink
      English
      arrow-up
      9
      Ā·
      edit-2
      5 months ago

      I donā€™t have a Clyde 3.25" Rondo or whatever itā€™s called; but try these for fun and profit I guess:

      1. You come to a room with three doors, only one of which leads to freedom. Guarding the doors is a capybara, who speaks only truth. What question should you ask the capybara?

      2. I stand on four legs in the morning. Four at midday. And four at night. What am I?

      3. A group of 100 people with assorted eye colors live on an island. They are all perfect logicians ā€“ if a conclusion can be logically deduced, they will do it instantly. Everyone knows the color of their eyes. Every night at midnight, a ferry stops at the island. Any islanders who have figured out the color of their own eyes then leave the island, and the rest stay. Everyone can see everyone else at all times and keeps a count of the number of people they see with each eye color (including themselves), but they cannot otherwise communicate. Everyone on the island knows all the rules in this paragraph. Who leaves the island, and on what night?

      4. Normal sudoku rules apply. Orthogonally connected cells within each region must differ by at least 3. Orthogonally connected cells between regions must differ by at least 4. The central digit in each region is less than or equal to its region number. (Regions are numbered in normal reading order.)

      5. For the integer k=668 does a Hadamard matrix of order 4k exist?

      6. What has roots that everybody sees the top of, is exactly the same height as trees, Up, up it goes, and yet grows?

      Donā€™t forget to prompt engineer

      • sinedpick@awful.systems
        link
        fedilink
        English
        arrow-up
        6
        Ā·
        edit-2
        5 months ago

        Thanks for the suggestions. The LLM is free to use (for now) so I thought Iā€™d poke it and see how much I should actually be paying attention to these things this time around.

        Here are its answers. I canā€™t figure out how to share chats from this god-awful garbage UI so youā€™ll just have to trust me or try it yourself.

        1. It gives the correct but unnecessary answer: ā€œIf I were to ask you which door leads to freedom, which door would you point to?ā€ It also mentions a lying guard but also acknowledges that itā€™s absent from this specific problem.
        2. ā€œA table or a chairā€
        3. Completely fails on this one, it missed the sentence ā€œEveryone knows the color of their eyesā€
        4. Not sure what to do with this
        5. ā€œWhile a Hadamard matrix of order 2672 might exist, its existence isnā€™t immediately provable using the most common constructionsā€ ā€“ I wonā€™t pretend to know anything about the Hadamard conjecture if thatā€™s a real thing so I have no idea what itā€™s on about here.

        edit: I didnā€™t do any prompt engineering, just straight copy paste.