Expectation: It's a Plug-and-Play Privacy Solution
The number one reason teams explore federated learning (FL) is for privacy. The pitch is simple and seductive: because the raw data never leaves the user's device or the organization's server, you get all the benefits of machine learning without the liability
of centralizing sensitive information. First-timers often assume FL is an architectural choice that automatically solves privacy. The surprise isn't that this is wrong, but that it's incomplete. While FL is a massive step up for privacy, the model updates sent from each device can still leak information. Researchers have shown it's possible, through advanced attacks, to reconstruct bits of the original training data from these updates. As a result, serious implementations require additional privacy-enhancing techniques like differential privacy, which adds statistical noise to mask individual contributions. This creates a new, complex trade-off between privacy and model accuracy that teams are often unprepared to navigate.
Expectation: Simpler Infrastructure and Lower Costs
The logic seems sound: if you're not moving petabytes of user data into a centralized data lake, your infrastructure costs and complexity should plummet. You're just sending small model updates back and forth, which should be cheaper and easier. The surprise is that you're trading one massive challenge (data transfer and storage) for another: managing a complex, unreliable distributed system. Communication becomes a huge bottleneck. Instead of a few stable servers in a data center, you might be dealing with millions of mobile phones with spotty connections and limited battery life. The system must be resilient to devices dropping off mid-training. The orchestration—selecting which devices participate in a training round, aggregating their updates, and pushing the new global model—is a significant engineering lift that many practitioners underestimate.
Expectation: Data Is Just Data, Right?
In traditional machine learning, data scientists spend enormous amounts of time cleaning, labeling, and balancing datasets before training. The hope with federated learning is that you can tap into a vast, real-world data stream. The surprise for many is the sheer statistical chaos of that data. It is what's known as "non-IID"—not independent and identically distributed. In plain English, every device has a unique, biased, and often tiny slice of the overall data. One user of a keyboard app might type in three different languages, while another exclusively uses emojis. A hospital in a wealthy suburb will have different patient data than one in a rural area. This heterogeneity can pull the global model in conflicting directions, making it difficult for the model to converge and perform well for everyone. It's a fundamental statistical challenge that requires new, more robust algorithms.
Expectation: Our Existing Models Will Work Fine
A common assumption is that a team can take the machine learning models they've already developed for centralized training and simply deploy them in a federated setting. After all, it's just a different way to train the same architecture. The reality is that the challenges of communication overhead and statistical heterogeneity often demand entirely new model designs. Models may need to be smaller to reduce the size of the updates being sent over the network. Furthermore, some devices may have different computational capabilities, requiring different or simpler model architectures, a problem known as model heterogeneity. Debugging is also a nightmare. When a centralized model performs poorly, you can inspect the data. In FL, you can't see the data, so diagnosing why a model is failing becomes a difficult exercise in remote troubleshooting and statistical analysis.

















