• Kuvwert@lemm.ee
        link
        fedilink
        English
        arrow-up
        0
        ·
        edit-2
        1 day ago

        https://ibb.co/wVNsn5H

        https://ibb.co/HpK5G5Pp

        https://ibb.co/sp1wGMFb

        https://ibb.co/4wyKhkRH

        https://ibb.co/WpBTZPRm

        https://ibb.co/0yP73j6G

        Note that my tests were via groq and the r1 70B distilled llama variant (the 2nd smartest version afaik)

        Edit 1:

        Incidentally… I propositioned a coworker to answer the same question. This is the summarized conversation I had:

        Me: “Hey Billy, can you answer a question? in under 3 seconds answer my following question”

        Billy: “sure”

        Me: “How many As are in abracadabra 3.2.1”

        Billy: “4” (answered in less than 3 seconds)

        Me: “nope”

        I’m gonna poll the office and see how many people get it right with the same opportunity the ai had.

        Edit 2: The second coworker said “6” in about 5 seconds

        Edit 3: Third coworker said 4, in 3 seconds

        Edit 4: I asked two more people and one of them got it right… But I’m 60% sure she heard me asking the previous employee, but if she didnt we’re at 1/5

        In probably done with this game for the day.

        I’m pretty flabbergasted with the results of my very unscientific experiment, but now I can say (with a mountain of anecdotal juice) that with letter counting, R1 70b is wildly faster and more accurate than humans .