• Ludicrous0251@piefed.zip
    link
    fedilink
    English
    arrow-up
    21
    arrow-down
    1
    ·
    3 hours ago

    Friendly reminder that LLMs don’t do math, they guess what number should come next, just like words.

    It can probably link the image to the words “a photo of a sandwich on a plate”, and interpret the question as “how many calories are in a sandwich” but from there it is just guessing at the syntax of an answer, but not at finding any truth.

    It knows sandwiches have calories and those tend to be 3-4 digit numbers, but also all numbers kinda look the same, so what’s to say it’s not 2, 5, or 12 digits?

    • monkeyslikebananas2@lemmy.world
      link
      fedilink
      English
      arrow-up
      6
      arrow-down
      1
      ·
      2 hours ago

      Tool-powered agents can do math though. The issue is the fuzziness of it trying to guess carbs. It doesn’t know weight, ingredients, or anything other than a picture. These tools can be useful but not for this. Maybe one day but not yet.

      Whoever claims an AI (LLM or agents) can do that and charging their users is lying and defrauding them.