The Future of Large Language Model Pre-training is Federated

Hackworth@lemmy.world · edit-2 7 months ago

The Future of Large Language Model Pre-training is Federated

General_Effort@lemmy.world · 7 months ago

As far as I know, federated learning is pretty much dead. The point would be that it allows organizations to create a joint model without sharing data. But it doesn’t look like anyone who doesn’t want to share data wants to share a model.

Hackworth@lemmy.world · 7 months ago

Until they can distribute the training load of large models to consumer graphics cards (and do something like SETI@Home) it does seem like the benefit of distributed training isn’t enough to overcome the friction.