Over the past few weeks, several US banks have pulled off from lending to Oracle for expanding its AI data centres, as per a report.

  • Not_mikey@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 days ago

    Here’s the source it’s from open AI but it is peer reviewed. Here’s another source that uses it as a baseline to compare the relative scores and according to the tables in 2023 it got a 610, putting it around the 75th percentile, and that’s just for math which the open AI study showed it did about 5% worse then it’s average so ~80th percentile for a total score. Again this is for students who are usually more prepared for the SAT than the general population, so it’s still probably in the 90th percentile for the general population.

    Again for the car wash example that is not declaritive knowledge, like the pizza glue that is knowledge derived from experience and reason which I’ve said that LLMs aren’t the best at. The fact that they had to make a riddle for the AI to trip it up if anything shows how good it is. If it was as bad as you say it is then anyone could easily trip it up and get it to give a wrong answer and a study like that wouldn’t be relevant. Seriously if you think the LLM is so inaccurate, come up with your own test to stump it, it should be easy by the way you talk about them.

    • CileTheSane@lemmy.ca
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 days ago

      The fact that they had to make a riddle for the AI to trip it up

      “I want to take my car to the car wash, should I walk or drive” is not a riddle. It requests basic understanding of what is being asked.