Outshift | Mitigating group bias in federated learning: Beyond local fairness

The issue of group fairness in machine learning (ML) models, where certain sub-populations or groups are favored over others, has been recognized for some time – especially as edge devices equipped with ML models such as mobile phones and watches have become heavily involved in our daily life.

While many mitigation strategies have been proposed in centralized learning, many of these methods are not directly applicable in federated learning (FL), where data is privately stored separately on multiple clients. To address this, many proposals try to mitigate bias at the level of clients before aggregation, which we call locally fair training (LFT). However, the effectiveness of these approaches is not well understood.

In this work, we investigate the theoretical foundation of locally fair training by studying the relationship between global model fairness and local model fairness. Real-data experiments demonstrate the promising performance of our proposed approach for enhancing fairness while retaining high accuracy compared to locally fair training methods.

The problem of group fairness

In machine learning, group fairness refers to the equitable treatment of different sub-populations within the dataset. For example, in a healthcare application, ensuring that a predictive model does not favor one demographic group over another is critical. Historically, many strategies have been proposed to mitigate bias in centralized learning settings. However, these methods often do not translate well to federated learning due to the decentralized nature of data storage.

Federated learning necessitates a different approach because data remains on local devices, and only model updates are shared. This decentralized setup complicates the direct application of centralized fairness mitigation strategies. Typically, federated learning approaches aim to improve the prediction accuracy of the global model without explicitly addressing fairness, leading to potential biases in the outcomes.

Locally fair training: An incomplete solution

Many existing solutions in federated learning attempt to mitigate bias at the local level, a method known as locally fair training. The idea is that if each local model is fair, the aggregated global model may also be fair. However, the effectiveness of LFT is not well understood.

We explored whether local fairness guarantees global fairness and under what conditions this holds true. This study provides a theoretical foundation for understanding the relationship between local and global model fairness.

Key Contributions

1. Formulation of group-based fairness metrics:

We defined group-based and proper group-based fairness metrics. For proper group-based metrics, the global fairness value can be expressed as a function of fairness-related statistics calculated solely by local clients. This allows for the calculation of global fairness without directly accessing local datasets.

2. Analysis of local and global fairness:

The study investigates the relationship between local and global model fairness. It finds that local fairness does not necessarily imply global fairness and vice versa. However, for proper group-based metrics, global fairness is controlled by local fairness values and the data heterogeneity level. This explains the success of LFT methods in near-homogeneous client settings.

3. Introduction of Federated Globally Fair Training (FedGF):

The study proposes a globally fair training method named FedGFT for proper group-based metrics. FedGFT directly optimizes a regularized objective function consisting of empirical prediction loss and a fairness penalty term. Numerical experiments demonstrate that FedGFT significantly reduces global model bias while retaining high prediction accuracy.

Theoretical foundations

The study introduces a mathematical framework to understand group-based fairness metrics. These metrics quantify the disparity in model performance across different groups. For instance, statistical parity, equal opportunity, and well-calibration are common fairness metrics that fall into this category.

Statistical parity (SP): Measures the difference in the probability of a positive prediction across groups.
Equal opportunity (EOP): Establishes the same true positive rate across groups.
Well-calibration: Ensures that the true outcome is independent of the group given the predicted value.

In this study, we proved that for these proper group-based fairness metrics, the global fairness can be derived from local fairness-related statistics. This is pivotal as it allows federated learning systems to provide global fairness without sharing raw data, thus preserving privacy.

FedGFT: A novel approach

From this research, we proposed an algorithm known as Federated Globally Fair Training (FedGFT). Unlike LFT, which only focuses on local fairness, FedGFT optimizes for global fairness by incorporating a fairness regularization term in the objective function. Specifically, FedGFT aims to solve the following problem:

where f is the global model, (X_i^k, Y_i^k) is the training data points of the k-th client, F is a fairness metric, and J is a penalty term. We found that J(F(f)) can also be decomposed to statistics that only depend on local clients. Thus, the optimization of the above problem can be carried out similar to any classic FL algorithm such as FedAvg.

This method can handle arbitrary data heterogeneity among clients, making it more robust in real-world scenarios where data distribution varies significantly across clients.

How FedGFT Works

1. Initialization: The global model parameters are initialized and distributed to local clients.

2. Local training: Each client updates the model using its local data for a few epochs. The local objective function includes a fairness regularization term.

3. Aggregation: The updated local models are sent back to the central server, which aggregates them to form a new global model.

4. Fairness adjustment: The server updates a fairness-related constant using the summary statistics from local clients.

5. Iteration: Steps 2-4 are repeated for a specified number of communication rounds.

Experimental validation

We conducted experiments on three real-world datasets: Adult, COMPAS, and CelebA. These datasets represent different application domains, including income prediction, recidivism risk assessment, and facial attribute classification. The results demonstrate that FedGFT consistently outperforms traditional LFT methods in terms of reducing bias while maintaining high accuracy. Some of the key findings were:

Accuracy: FedGFT achieves comparable or slightly lower accuracy than the baseline federated learning algorithm (FedAvg).
Fairness: FedGFT significantly reduces bias across different fairness metrics and data heterogeneity levels.
Robustness: The algorithm performs well even when the data distribution among clients is highly heterogeneous.

Implications and future work

The findings of this study have significant implications for the deployment of federated learning systems in sensitive applications such as healthcare, finance, and criminal justice. By ensuring fairness at a global level, FedGFT can help mitigate the risk of biased outcomes that could exacerbate social inequalities.

The study also opens up several avenues for future research. For instance, we’d like to explore fairness metrics that are not covered by their definition of proper group-based metrics, such as calibration. Additionally, extending the approach to handle multiple sensitive attributes and testing on larger datasets are potential directions for further investigation.

While establishing fairness in federated learning is a complex but crucial challenge, our research provides a robust theoretical foundation and a practical solution through the FedGFT algorithm. By focusing on global fairness and leveraging summary statistics from local clients, FedGFT offers a promising approach to mitigating bias in federated learning systems.

In summary, this work is a significant step towards more equitable and trustworthy machine learning models in decentralized settings.

Read the full research paper here and to learn more about Cisco Research here.

Published on 00/00/0000

Last updated on 00/00/0000

Published on 00/00/0000

Last updated on 00/00/0000

Our Work

Our Collaborators

Company

Apply

Connect

Categories

Resource Hub

by

Ganghua Wang,

Myungjin Lee,

Ali Payani

and

Ramana Kompella

Published on 11/06/2024

Last updated on 03/13/2025

Published on 11/06/2024

Last updated on 03/13/2025

Mitigating group bias in federated learning: Beyond local fairness

Get emerging insights on innovative technology straight to your inbox.

The problem of group fairness

Locally fair training: An incomplete solution

Key Contributions

Theoretical foundations

FedGFT: A novel approach

How FedGFT Works

Experimental validation

Implications and future work

Revolutionizing randomness: Cisco's Quantum Random Number Generator

Related articles

AI/ML

CrewAI, AG2, Browserbase and 50 other companies join AGNTCY

Inside Outshift

Swisscom and Outshift team up to build agentic AI-enabled networks

AI/ML

Agentic AI for troubleshooting and change management