LLMs are solving MCAT, the bar test, SAT etc like they’re nothing. At this point their performance is super human. However they’ll often trip on super simple common sense questions, they’ll struggle with creative thinking.

Is this literally proof that standard tests are not a good measure of intelligence?

  • Margot Robbie@lemmy.world
    link
    fedilink
    arrow-up
    38
    arrow-down
    1
    ·
    4 months ago

    All standardized test is how well you prepared for that particular standardized test, doesn’t matter if it is the SAT, MCAT, or Leetcode. You aren’t suppose to think on the spot for these tests, you are suppose regurgitate everything you have rehearsed for weeks and months during the test.

    And unthinking regurgitation is what LLMs do better than anything else.

    • learningduck@programming.dev
      link
      fedilink
      arrow-up
      8
      arrow-down
      1
      ·
      edit-2
      4 months ago

      I would argue that some code test questions can be solved spontaneously, but they are limited to easy to some early medium questions, or patterns that are common enough.

      I guess this is more common in non FANG companies that don’t have to filter out candidates just because of the sheer number alone.