Published on July 11, 2025

Federated Learning with Hugging Face and Flower: A Practical Guide

The demand for privacy-preserving machine learning continues to grow, and federated learning sits comfortably at the center of that shift. Unlike traditional approaches where data is pooled into a central location, federated learning trains models right where the data is created—on devices or servers scattered across different locations. What makes this idea go from a research paper to actual code is a solid framework. That’s where Flower and Hugging Face come into play. Together, they form a duo that helps you build and train powerful models without sacrificing privacy or data locality.

Let’s break it all down, starting with the idea behind federated learning and then walking through how you can actually implement it using these tools.

Understanding Federated Learning

In standard machine learning, all the data is sent to a central server where training takes place. It’s effective, but not always feasible. Think of healthcare records, mobile usage logs, or financial data. Sending all that sensitive information to one place raises all kinds of flags—privacy, regulation, security, and even bandwidth.

Federated learning flips that model. Instead of pushing the data to the model, it brings the model to the data. Each device or node trains a local version of the model using its own dataset. Then, only the updates (like model weights or gradients) are sent back to a central server. That server doesn’t see the raw data—only the insights gained from it. These updates are then averaged or aggregated, and the improved model is sent back out to the devices.

It’s collaborative learning without centralization. And when done right, it protects data, reduces risk, and still delivers results.

The Power of Hugging Face and Flower Together

You might already know Hugging Face for its Transformers library—pretrained models that can be fine-tuned on tasks like text classification, summarization, or question-answering. It simplifies access to powerful architectures like BERT, RoBERTa, and DistilBERT, among others.

Flower, on the other hand, is built specifically for federated learning. It provides all the moving parts: client/server setup, communication protocols, model aggregation, and more. The real beauty lies in how customizable it is. Whether you’re using PyTorch, TensorFlow, or even scikit-learn, Flower plays well with your codebase.

When you put the two together, you get a flexible federated learning pipeline that can support complex natural language tasks without having to reinvent the wheel.

Setting Up Federated Learning with Hugging Face and Flower

Let’s walk through the steps to build a simple federated learning pipeline using Hugging Face models and Flower as the orchestration tool. We’ll use PyTorch as the backend for this example, but Flower also supports TensorFlow.

Step 1: Preparing the Local Dataset

Each federated client will have its own chunk of data. For simplicity, suppose you’re doing sentiment analysis using the IMDb dataset. You’ll split it across several clients. Each client keeps its own share and doesn’t share raw data.

from datasets import load_dataset
from sklearn.model_selection import train_test_split

dataset = load_dataset("imdb")
train_data, _ = train_test_split(dataset["train"], test_size=0.9)
client_data = [train_data[i::5] for i in range(5)]  # Simulate 5 clients

Step 2: Loading a Hugging Face Model

Here, we’ll pick a lightweight model to keep the training manageable. DistilBERT works well for this.

from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("distilbert-base-uncased")
model = AutoModelForSequenceClassification.from_pretrained("distilbert-base-uncased", num_labels=2)

Step 3: Building a Federated Client

Each client handles local training and evaluation. Flower provides a NumPyClient base class that we can extend.

import torch
import flwr as fl

class IMDbClient(fl.client.NumPyClient):
    def __init__(self, model, data):
        self.model = model
        self.data = data  # This should be preprocessed

    def get_parameters(self):
        return [val.cpu().numpy() for val in self.model.state_dict().values()]

    def set_parameters(self, parameters):
        state_dict = dict(zip(self.model.state_dict().keys(),
                              [torch.tensor(p) for p in parameters]))
        self.model.load_state_dict(state_dict, strict=True)

    def fit(self, parameters, config):
        self.set_parameters(parameters)
        self.model.train()
        # Local training loop here
        return self.get_parameters(), len(self.data), {}

    def evaluate(self, parameters, config):
        self.set_parameters(parameters)
        self.model.eval()
        # Local evaluation logic here
        return 0.0, len(self.data), {"accuracy": 0.9}

Each client will run its own version of this class with its own dataset slice.

Step 4: Launching the Server

The server handles coordination. It sends out the global model, collects updates, averages them, and sends the refined model back.

fl.server.start_server(
    server_address="localhost:8080",
    config=fl.server.ServerConfig(num_rounds=3),
)

Step 5: Running the Clients

Each client needs to connect to the server and run its own training session.

def start_client(data):
    model = AutoModelForSequenceClassification.from_pretrained("distilbert-base-uncased", num_labels=2)
    client = IMDbClient(model, data)
    fl.client.start_numpy_client(server_address="localhost:8080", client=client)

# This would be called for each client in parallel

Considerations When Working with Hugging Face Models in a Federated Setting

Model Size Matters

Large Transformer models can be a bottleneck when working with low-resource devices. It’s often better to use compact variants like DistilBERT or MobileBERT. These offer decent performance while keeping communication and memory overhead low.

Communication Overhead

One round of training can involve sending millions of parameters back and forth. Flower offers customization hooks to reduce how frequently updates are shared. You can also use techniques like quantization or sparsification to slim down the payload.

Fine-Tuning vs. Feature Extraction

In some cases, you might not need to update the entire model. Freezing the backbone and only training the classification head can reduce complexity and make the process more efficient.

Security

Federated learning enhances privacy, but it’s not immune to attacks. Differential privacy and secure aggregation are advanced techniques you can integrate with Flower if security is a critical concern.

Wrapping Up

Federated learning no longer belongs solely in research papers. With tools like Hugging Face and Flower, it’s possible to put real models into production without centralizing data. Hugging Face simplifies the modeling side, while Flower handles the coordination across nodes.

By following a modular approach, you can reuse models you’ve already trained and fine-tune them locally in ways that respect privacy. Whether you’re working with a mobile device fleet or decentralized servers, this setup scales smoothly and respects the constraints that modern data privacy regulations demand.

While there are challenges—like managing model size, communication bandwidth, and update frequency—these can be addressed with smart choices in architecture and optimization. Hugging Face and Flower don’t remove the work, but they definitely make it more practical.

IMPACT
Hugging Face Hub Search Upgrade: What You Need to Know

Experience supercharged searching on the Hugging Face Hub with faster, smarter results. Discover how improved filters and natural language search make Hugging Face model search easier and more accurate.
BASICTHEORY
How to Use the Hugging Face API in Unity for Real-Time AI

What happens when you bring natural language AI into a Unity scene? Learn how to set up the Hugging Face API in Unity step by step—from API keys to live UI output, without any guesswork.
IMPACT
How to Host Your Models and Datasets on Hugging Face Spaces with Streamlit

Host AI models and datasets on Hugging Face Spaces using Streamlit. A comprehensive guide covering setup, integration, and deployment.
APPLICATIONS
Serving TensorFlow Vision Models with TF Serving and Hugging Face

How deploying TensorFlow vision models becomes efficient with TF Serving and how the Hugging Face Model Hub supports versioning, sharing, and reuse across teams and projects.
IMPACT
How to Deploy GPT-J 6B for Inference with Hugging Face and Amazon SageMaker

How to deploy GPT-J 6B for inference using Hugging Face Transformers on Amazon SageMaker. A practical guide to running large language models at scale with minimal setup.
IMPACT
How to Use Hugging Face Datasets for Image Search

Learn how to perform image search with Hugging Face datasets using Python. This guide covers filtering, custom searches, and similarity search with vision models.
APPLICATIONS
Evaluation on the Hub: Transparent Model Testing with Hugging Face

How Evaluation on the Hub is transforming AI model benchmarking on Hugging Face. See real-time performance scores and make smarter decisions with transparent, automated testing.
IMPACT
Get to Know Your Data Better Using the Hugging Face Data Measurements Tool

Make data exploration simpler with the Hugging Face Data Measurements Tool. This interactive platform helps users better understand their datasets before model training begins.
IMPACT
Training Vision Transformer Models for Image Classification with Hugging Face

How to fine-tune ViT for image classification using Hugging Face Transformers. This guide covers dataset preparation, preprocessing, training setup, and post-training steps in detail.
IMPACT
Controlling AI Text Generation with Constrained Beam Search in Hugging Face Transformers

Learn how to guide AI text generation using Constrained Beam Search in Hugging Face Transformers. Discover practical examples and how constraints improve output control.
APPLICATIONS
Democratizing AI: How Intel and Hugging Face Are Transforming Machine Learning Deployment

Intel and Hugging Face are teaming up to make machine learning hardware acceleration more accessible. Their partnership brings performance, flexibility, and ease of use to developers at every level.
IMPACT
Getting Started with Decision Transformers on Hugging Face

How Decision Transformers are changing goal-based AI and learn how Hugging Face supports these models for more adaptable, sequence-driven decision-making

Latest Articles

IMPACT
How to Train LLaMA with RLHF Using StackLLaMA: A Practical Guide

What if training LLaMA with reinforcement learning from human feedback didn't require a research lab? StackLLaMA shows you how to fine-tune LLaMA using SFT, reward modeling, and PPO—step by step, with code and clarity
BASICTHEORY
Running Your Own AI Chatbot Locally with ROCm and AMD GPUs

Curious about running an AI chatbot on your own setup? Learn how to use ROCm and AMD GPUs to power a responsive, local chatbot without relying on cloud services or massive infrastructure.
APPLICATIONS
Train Larger NLP Models Efficiently with ZeRO, DeepSpeed & FairScale

Want to fit and train billion-parameter Transformers on limited GPU resources? Discover how ZeRO with DeepSpeed and FairScale makes it possible
BASICTHEORY
Can Foundation Models Label Data Like Humans? Exploring the Gaps and Potential

Wondering if foundation models can label data like humans? We break down how these powerful AI systems handle data labeling, the gaps they face, and how fine-tuning and human collaboration improve their accuracy.
TECHNOLOGIES
The Data Center of the Future: Smarter, Greener, and Surprisingly Self-Aware

Curious how tomorrow's data centers will look and work? From AI-managed cooling to edge computing and zero-trust security, here's how the infrastructure behind your digital life is evolving fast.
TECHNOLOGIES
Speed Up Hugging Face Training with Optimum and ONNX Runtime

Tired of slow model training on Hugging Face? Learn how Optimum and ONNX Runtime work together to cut down training time, improve stability, and speed up inference—with almost no code rewrite required.
BASICTHEORY
Get to Know StarCoder: The Code-First AI That’s Actually Useful

What if your coding assistant understood scope, style, and logic—without needing constant hand-holding? StarCoder delivers clean code, refactoring help, and real explanations for devs.
BASICTHEORY
Explore Datasets Faster with DuckDB on Hugging Face

Looking for a faster way to explore datasets? Learn how DuckDB on Hugging Face lets you run SQL queries directly on over 50,000 datasets with no setup, saving you time and effort.
APPLICATIONS
Key Insights from Hugging Face's Comments on AI Accountability

Explore how Hugging Face defines AI accountability, advocates for transparent model and data documentation, and proposes context-driven governance in their NTIA submission.
IMPACT
Fine-Tune Large Models with Hugging Face's PEFT

Think you can't fine-tune large language models without a top-tier GPU? Think again. Learn how Hugging Face's PEFT makes it possible to train billion-parameter models on modest hardware with LoRA, AdaLoRA, and prompt tuning.
IMPACT
Federated Learning with Hugging Face and Flower: A Practical Guide

Learn how to implement federated learning using Hugging Face models and the Flower framework to train NLP systems without sharing private data.
IMPACT
How Snorkel AI and Hugging Face Empower Businesses with Foundation Models

Adapt Hugging Face's powerful models to your company's data without manual labeling or a massive ML team. Discover how Snorkel AI makes it feasible.