• thefartographer@lemm.ee
    link
    fedilink
    English
    arrow-up
    4
    arrow-down
    6
    ·
    7 hours ago

    r1 is lightweight and optimized for local environments on a home PC. It’s supposed to be pretty good at programming and logic and kinda awkward at conversation.

    v3 is powerful and meant to run on cloud servers. It’s supposed to make for some pretty convincing conversations.

    • Pennomi@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      7 hours ago

      R1 isn’t really runnable with a home rig. You might be able to run a distilled version of the model though!

        • Pennomi@lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          arrow-down
          1
          ·
          6 hours ago

          That likely is one of the distilled versions I’m talking about. R1 is 720 GB, and wouldn’t even fit into memory on a normal computer. Heck, even the 1.58-bit quant is 131GB, which is outside the range of a normal desktop PC.

          But I’m sure you know what version you’re running better than I do, so I’m not going to bother guessing.

      • thefartographer@lemm.ee
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        2
        ·
        5 hours ago

        You’re absolutely right, I wasn’t trying to get that in-depth, which is why I said “lightweight and optimized,” instead of “when using a distilled version” because that raises more questions than it answers. But I probably overgeneralized by making it a blanket statement like that.