The Future of Large Language Model Pre-training is Federated

Hackworth@lemmy.world · edit-2 7 months ago

The Future of Large Language Model Pre-training is Federated

Martineski@lemmy.dbzer0.com · 7 months ago

Is this how you make a sentient planet?

Petter1@lemm.ee · 7 months ago

I like that 🤩

Martineski@lemmy.dbzer0.com · 7 months ago

I wonder if this will become a big thing in FOSS ai space. It’s hard to compete with corpos when it comes to computing power.

7 months ago

Still doesnt solve the whole what data can be used for a foss model thing but distributing compute requirements is good. Idk if this still requires that each node can compute the whole model tho might be a limitation of model sizes since moat pwople wont be able to run huge models etc.

Audrey0nne@leminal.space · 7 months ago

Lot of words just to say that once the advertisers move in on a centralized platform its value is shot. A huge part of the reason I abandoned the last platform I was using and sought a federated alternative.

Hackworth@lemmy.world · 7 months ago

The papers have a ton of practical info about feasibility, implementation, etc.

General_Effort@lemmy.world · 7 months ago

As far as I know, federated learning is pretty much dead. The point would be that it allows organizations to create a joint model without sharing data. But it doesn’t look like anyone who doesn’t want to share data wants to share a model.

Hackworth@lemmy.world · 7 months ago

Until they can distribute the training load of large models to consumer graphics cards (and do something like SETI@Home) it does seem like the benefit of distributed training isn’t enough to overcome the friction.