MUFFLe Boosts Federated Learning Efficiency with Deduplication

Xiaobo Zhao, Daniel E. Lucani· June 15, 2026 View original

Summary

This work-in-progress introduces MUFFLe, a communication-efficient scheme for federated learning that uses generalized deduplication to compress model updates. Preliminary experiments show MUFFLe significantly reduces uplink communication costs compared to other compression methods and uncompressed FedAvg while maintaining target accuracy.

This paper introduces MUFFLe, a novel communication-efficient compression scheme designed for federated learning environments, particularly beneficial for edge devices. Federated learning often faces significant limitations due to the high uplink communication costs associated with transmitting model updates from numerous clients to a central server. MUFFLe addresses this by integrating generalized deduplication (GD) into the standard FedAvg pipeline. The core idea behind MUFFLe is to identify and deduplicate repeated patterns within the model update vectors. This approach results in a fixed-rate, variable-count compression scheme, meaning it can adaptively compress updates based on their inherent redundancy. By eliminating redundant information, the amount of data that needs to be transmitted is substantially reduced. Preliminary experimental results, conducted on an IID MNIST dataset with 20 clients, demonstrate MUFFLe's effectiveness. It achieved a target accuracy of 92.93% with a cumulative uplink communication of only 38 MB. This performance significantly outperforms other common compression techniques, such as 8-bit quantization (75 MB) and Top-k sparsification (86 MB), and drastically reduces communication compared to uncompressed FedAvg (310 MB). These findings highlight the viability of applying generalized deduplication to enhance communication efficiency in federated learning.

Why it matters

For organizations deploying federated learning, especially in resource-constrained edge environments, reducing communication overhead is crucial for scalability, speed, and cost-effectiveness. MUFFLe offers a promising solution to make federated learning more practical and efficient.

How to implement this in your domain

  1. 1Evaluate federated learning communication costs: Analyze the current uplink communication burden in your federated learning deployments.
  2. 2Consider MUFFLe integration: Explore integrating generalized deduplication techniques like MUFFLe into your existing FedAvg pipelines.
  3. 3Benchmark compression methods: Compare MUFFLe's performance against other compression strategies (e.g., quantization, sparsification) for your specific models and datasets.
  4. 4Optimize edge device deployments: Apply this communication-efficient approach to enable more robust and scalable federated learning on edge devices.

Who benefits

Edge ComputingIoTHealthcareAutomotiveTelecommunications

Key takeaways

  • Federated learning faces significant communication cost challenges.
  • MUFFLe uses generalized deduplication to compress model updates efficiently.
  • It drastically reduces uplink communication while maintaining accuracy.
  • This approach makes federated learning more practical for edge environments.

Original post by Xiaobo Zhao, Daniel E. Lucani

"arXiv:2606.14354v1 Announce Type: new Abstract: Federated learning is well suited to edge environments but is often limited by the uplink cost of transmitting model updates. This Work-in-Progress paper presents MUFFLe, a communication-efficient update compression scheme that inte…"

View on X

Originally posted by Xiaobo Zhao, Daniel E. Lucani on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses