Visions of a larger plunder

aldalire@lemmy.dbzer0.com · 10 months ago

Visions of a larger plunder

MalReynolds · 10 months ago

Akshually, while training models requires (at the moment) massive parallelization and consequently stacks of A100s, inference can be distributed pretty well (see petals for example). A pirate ‘ChatGPT’ network of people sharing consumer graphics cards could probably indeed work if the data was sourced. It bears thinking about. It really does.

wolfshadowheart@kbin.social · 10 months ago

You definitely can train models locally, I am doing so myself on a 3080 and we wouldn’t be as many seeing public ones online if that were the case! But in terms of speed you’re definitely right, it’s a slow process for us.

MalReynolds · 10 months ago

I was thinking more of training the base models, LLAMA(2), and more topically GPT4 etc. You’re doing LoRA or augmenting with a local corpus of documents, no?

wolfshadowheart@kbin.social · 10 months ago

Ah yeah my mistake I’m always mixing up language and image based AI models. Training text based models is much less feasible locally lol.

There’s no model for my art so I’m creating a checkpoint model using xformers to bypass the VRAM requirement and then from there I’ll be able to speed up variants of my process using LORA’s but that won’t be for some time, I want a good model first.

MalReynolds · 10 months ago

Fair cop, Godspeed!