• Kuvwert@lemm.ee
      link
      fedilink
      English
      arrow-up
      0
      ·
      edit-2
      1 day ago

      https://ibb.co/wVNsn5H

      https://ibb.co/HpK5G5Pp

      https://ibb.co/sp1wGMFb

      https://ibb.co/4wyKhkRH

      https://ibb.co/WpBTZPRm

      https://ibb.co/0yP73j6G

      Note that my tests were via groq and the r1 70B distilled llama variant (the 2nd smartest version afaik)

      Edit 1:

      Incidentally… I propositioned a coworker to answer the same question. This is the summarized conversation I had:

      Me: “Hey Billy, can you answer a question? in under 3 seconds answer my following question”

      Billy: “sure”

      Me: “How many As are in abracadabra 3.2.1”

      Billy: “4” (answered in less than 3 seconds)

      Me: “nope”

      I’m gonna poll the office and see how many people get it right with the same opportunity the ai had.

      Edit 2: The second coworker said “6” in about 5 seconds

      Edit 3: Third coworker said 4, in 3 seconds

      Edit 4: I asked two more people and one of them got it right… But I’m 60% sure she heard me asking the previous employee, but if she didnt we’re at 1/5

      In probably done with this game for the day.

      I’m pretty flabbergasted with the results of my very unscientific experiment, but now I can say (with a mountain of anecdotal juice) that with letter counting, R1 70b is wildly faster and more accurate than humans .