• jarfil@beehaw.org
    link
    fedilink
    arrow-up
    5
    ·
    edit-2
    3 months ago

    Peak intelligence, is realizing an LLM doesn’t care whether its tokens represent chunks of text, sound, images, videos, 3D models, paths, hand movements, floor planning, emojis, etc.

    The keyword is: “multimodal”.

    As for being able to correctly correlate some “chunks of MRI scan” with the word “tumor”… that’s all about the training (which I’d bet Claude is missing… did I hear “investment opportunity”? Guy isn’t wrong).