• cadekat@pawb.social
    link
    fedilink
    English
    arrow-up
    2
    arrow-down
    3
    ·
    6 hours ago

    You can get pretty far with a stack of 5090s and llama.cpp with split mode graph (or so I’ve heard, I’ve never tried), or AMD’s unified memory CPU thing.

    It’s not as good as data centre grade stuff, but it’s not nothing either.