FedBeam: Reliable Incentive Mechanisms for Federated Learning in UAV-Enabled Internet of Vehicles
Abstract
:1. Introduction
- (1)
- We propose a reliable incentive mechanism (FedBeam) based on game theory, and it can improve the performance and reliability of the model.
- (2)
- We describe the cooperation between model owners (UAVs) and FL clients (vehicles) as a two-stage Stackelberg game and prove the existence and uniqueness of NE. Among UAVs, we formulate the collaboration as a coalitional game. Based on these game theories, we investigate how to balance the benefits of FL clients and owners of FL in order to achieve the Pareto optimum of social utility.
- (3)
- The weighted-beta reputation mechanism (Wbeta) is designed as an effective measure to select reliable clients for FL to prevent some clients from making unreliable schema updates.
- (4)
- The experimental results show that compared to the baseline, the proposed incentive mechanism improves social welfare by 17.6% and test accuracy by 5.5% on simulated and real datasets, respectively.
2. Related Works
3. System Model and Problem Formulation
3.1. Basic Setting
3.2. Utility Model
3.3. Weighted-Beta Reputation Model
- Intentional attack: Under general conditions, there exist some kinds of intentional attacks, e.g., data poisoning attacks or free-ride attacks. On the one hand, data poisoning techniques like label flipping involve changing the training data’s original labels to other random labels and then computing the gradient using these poisonous data. On the other hand, a free-ride attack may not really execute local model training and just upload a random gradient. These two situations seriously damage the accuracy of FL’s final model.
- Unintentional attack: Because each UAV has only part of the data, the distribution of these data may be inconsistent with the overall distribution, which goes directly to non-IID. Although it is very common in reality, it has a negative influence on the accuracy of the final model. Here, we quantify differences in local data distributions using the widely used relative entropy (KL divergence) [28]. The difference in the distribution between the data of UAV n and the overall data can be defined as:Definition 5.The degree of non-IID is relative entropy (KL divergence):
3.4. Problem Formulation
4. Optimal Design of FedBeam
4.1. Analysis of the DPP
- The strategy sets are convex, bounded, and closed.
- The utility functions in the strategy space are quasi-concave and continuous.
4.2. Analysis of the DRP
4.3. Analysis of the CFA
- Split rule: when we have a coalition S, we can split this coalition into two smaller coalitions and , i.e., .
- Joining rule: when there is a coalition S with a player n in it, it wants to leave the current coalition S and join a new coalition , i.e., .
Algorithm 1: The coalition formation algorithm (CFA) |
4.4. Analysis of RUM
Algorithm 2: The reputation update algorithm (RUA) |
5. Experimental Results
5.1. Experiment Setting
- (1)
- Datasets:
- Simulated datasets: The noise power was −152 dBm/Hz. The bandwidth of each was 180 kHz. The transmitter model size was 0.1 Mbits. The maximum transmission power was 10 dBm. The maximum CPU frequency was 3 GHz. The fixed constant was five. The computation energy conversion coefficient ranged from 0.01 to 0.05. The cooperation cost G was one. The hovering cost was within [0.5,1.5], and the circuit cost was within [0.1,0.2]. The specific parameters can be found in Table 2.
- (2)
- Training setup:
- Participants setting: There were 2 to 20 UAVs and 5 to 50 vehicles. We configured 10% to 30% of the vehicles to exhibit free-riding and Byzantine behaviors (equally divided). Each UAV had between 2 to 10 vehicles. The reputation threshold was set to 0.5, and the reputation update parameter was 0.3. Each UAV estimated the model utility from one to two.
- Training parameters: The batch size was set to 32. The number of local epochs e was 10. The number of global training rounds E was 100. The learning rate was set to 0.01, and the SGD momentum was 0.05.
- (3)
- Baselines: to compare our mechanism with other mechanisms, we compared it with a random algorithm RA (randomly selecting contributions from each vehicle in FedBeam), FLBE (FedBeam without the reputation mechanism), and FLIM [17].
- RA: in the random algorithm, within the FedBeam mechanism, each vehicle randomly selected its contribution without choosing the optimal strategy based on the competitive situation.
- FLBE: in FLBE, the incentive method from FedBeam was still used, but without the reputation mechanism.
- FLIM: in [17], the authors proposed a contract-based incentive mechanism but did not consider cooperation among UAVs.
- (4)
- Evaluation metrics: We used three metrics to evaluate our mechanism: reputation value, social utility, and test accuracy. The results were derived from the average of multiple experiments.
- Reputation value: in our mechanism, how the reputation value changed was also a very important metric.
- Social utility: Social utility encompasses the utilities of all UAVs and all vehicles. The objective of our mechanism was to maximize social utility.
- Test accuracy: test accuracy is a crucial metric for model training in FL, representing the performance that UAVs can achieve with their trained models.
5.2. Performance of the Proposed Wbeta
5.3. Impact of Our Proposed Mechanism on Social Utility
5.4. Impact of Our Proposed Mechanism on Test Accuracy
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Li, C.; Song, M.; Luo, Y. Federated learning based on Stackelberg game in unmanned-aerial-vehicle-enabled mobile edge computing. Expert Syst. Appl. 2024, 235, 121023. [Google Scholar] [CrossRef]
- Tong, Z.; Wang, J.; Hou, X.; Chen, J.; Jiao, Z.; Liu, J. Blockchain-Based Trustworthy and Efficient Hierarchical Federated Learning for UAV-Enabled IoT Networks. IEEE Internet Things J. 2024. [Google Scholar] [CrossRef]
- Gad, G.; Farrag, A.; Aboulfotouh, A.; Bedda, K.; Fadlullah, Z.M.; Fouda, M.M. Joint self-organizing maps and knowledge distillation-based communication-efficient federated learning for resource-constrained UAV-IoT systems. IEEE Internet Things J. 2024, 11, 15504–15522. [Google Scholar] [CrossRef]
- Fu, M.; Shi, Y.; Zhou, Y. Federated learning via unmanned aerial vehicle. IEEE Trans. Wirel. Commun. 2023, 23, 2884–2900. [Google Scholar] [CrossRef]
- Zheng, J.; Xu, J.; Du, H.; Niyato, D.; Kang, J.; Nie, J.; Wang, Z. Trust Management of Tiny Federated Learning in Internet of Unmanned Aerial Vehicles. IEEE Internet Things J. 2024, 11, 21046–21060. [Google Scholar] [CrossRef]
- Ke, T.T.; Sudhir, K. Privacy rights and data security: GDPR and personal data markets. Manag. Sci. 2023, 69, 4389–4412. [Google Scholar] [CrossRef]
- Liu, Y.; Kang, Y.; Zou, T.; Pu, Y.; He, Y.; Ye, X.; Ouyang, Y.; Zhang, Y.Q.; Yang, Q. Vertical federated learning: Concepts, advances, and challenges. IEEE Trans. Knowl. Data Eng. 2024, 36, 3615–3634. [Google Scholar] [CrossRef]
- Nguyen, D.C.; Ding, M.; Pathirana, P.N.; Seneviratne, A.; Li, J.; Poor, H.V. Federated learning for internet of things: A comprehensive survey. IEEE Commun. Surv. Tutor. 2021, 23, 1622–1658. [Google Scholar] [CrossRef]
- Qu, Y.; Dai, H.; Zhuang, Y.; Chen, J.; Dong, C.; Wu, F.; Guo, S. Decentralized federated learning for UAV networks: Architecture, challenges, and opportunities. IEEE Netw. 2021, 35, 156–162. [Google Scholar] [CrossRef]
- Pham, Q.V.; Le, M.; Huynh-The, T.; Han, Z.; Hwang, W.J. Energy-efficient federated learning over UAV-enabled wireless powered communications. IEEE Trans. Veh. Technol. 2022, 71, 4977–4990. [Google Scholar] [CrossRef]
- Lu, J.; Liu, H.; Zhang, Z.; Wang, J.; Goudos, S.K.; Wan, S. Toward Fairness-Aware Time-Sensitive Asynchronous Federated Learning for Critical Energy Infrastructure. IEEE Trans. Ind. Inform. 2022, 18, 3462–3472. [Google Scholar] [CrossRef]
- Fadlullah, Z.M.; Kato, N. HCP: Heterogeneous computing platform for federated learning based collaborative content caching towards 6G networks. IEEE Trans. Emerg. Top. Comput. 2020, 10, 112–123. [Google Scholar] [CrossRef]
- Fan, X.; Chen, Y.; Liu, M.; Sun, S.; Liu, Z.; Xu, K.; Li, Z. UAV-Enabled Federated Learning in Dynamic Environments: Efficiency and Security Trade-off. IEEE Trans. Veh. Technol. 2023, 73, 6993–7006. [Google Scholar] [CrossRef]
- Jing, Y.; Qu, Y.; Dong, C.; Ren, W.; Shen, Y.; Wu, Q.; Guo, S. Exploiting UAV for air–ground integrated federated learning: A joint UAV location and resource optimization approach. IEEE Trans. Green Commun. Netw. 2023, 7, 1420–1433. [Google Scholar] [CrossRef]
- Qureshi, K.I.; Wang, L.; Xiong, X.; Lodhi, M.A. Asynchronous Federated Learning for Resource Allocation in Software Defined Internet of UAVs. IEEE Internet Things J. 2023, 11, 20899–20911. [Google Scholar] [CrossRef]
- Wang, Y.; Su, Z.; Zhang, N.; Benslimane, A. Learning in the air: Secure federated learning for UAV-assisted crowdsensing. IEEE Trans. Netw. Sci. Eng. 2020, 8, 1055–1069. [Google Scholar] [CrossRef]
- Lin, S.; Li, Y.; Han, Z.; Zhuang, B.; Ma, J.; Tianfield, H. Joint Incentive Mechanism Design and Energy-Efficient Resource Allocation for Federated Learning in UAV-Assisted Internet of Vehicles. Drones 2024, 8, 82. [Google Scholar] [CrossRef]
- Fu, F.; Wang, Y.; Li, S.; Yang, L.T.; Zhao, R.; Dai, Y.; Yang, Z.; Zhang, Z. Incentive Mechanism Against Bounded Rationality for Federated Learning-Enabled Internet of UAVs: A Prospect Theory-Based Approach. IEEE Internet Things J. 2024, 11, 20958–20969. [Google Scholar] [CrossRef]
- Le, T.H.T.; Tran, N.H.; Tun, Y.K.; Nguyen, M.N.; Pandey, S.R.; Han, Z.; Hong, C.S. An incentive mechanism for federated learning in wireless cellular networks: An auction approach. IEEE Trans. Wirel. Commun. 2021, 20, 4874–4887. [Google Scholar]
- Lim, W.Y.B.; Huang, J.; Xiong, Z.; Kang, J.; Niyato, D.; Hua, X.S.; Leung, C.; Miao, C. Towards federated learning in uav-enabled internet of vehicles: A multi-dimensional contract-matching approach. IEEE Trans. Intell. Transp. Syst. 2021, 22, 5140–5154. [Google Scholar] [CrossRef]
- Ng, J.S.; Lim, W.Y.B.; Dai, H.N.; Xiong, Z.; Huang, J.; Niyato, D.; Hua, X.S.; Leung, C.; Miao, C. Joint auction-coalition formation framework for communication-efficient federated learning in UAV-enabled internet of vehicles. IEEE Trans. Intell. Transp. Syst. 2020, 22, 2326–2344. [Google Scholar] [CrossRef]
- Wang, Y.; Su, Z.; Luan, T.H.; Li, R.; Zhang, K. Federated learning with fair incentives and robust aggregation for UAV-aided crowdsensing. IEEE Trans. Netw. Sci. Eng. 2021, 9, 3179–3196. [Google Scholar] [CrossRef]
- He, G.; Li, C.; Song, M.; Shu, Y.; Lu, C.; Luo, Y. A hierarchical federated learning incentive mechanism in UAV-assisted edge computing environment. Ad Hoc Netw. 2023, 149, 103249. [Google Scholar] [CrossRef]
- Kang, J.; Xiong, Z.; Niyato, D.; Xie, S.; Zhang, J. Incentive mechanism for reliable federated learning: A joint optimization approach to combining reputation and contract theory. IEEE Internet Things J. 2019, 6, 10700–10714. [Google Scholar] [CrossRef]
- Zhan, Y.; Li, P.; Qu, Z.; Zeng, D.; Guo, S. A Learning-Based Incentive Mechanism for Federated Learning. IEEE Internet Things J. 2020, 7, 6360–6368. [Google Scholar] [CrossRef]
- Huang, G.; Chen, X.; Ouyang, T.; Ma, Q.; Chen, L.; Zhang, J. Collaboration in participant-centric federated learning: A game-theoretical perspective. IEEE Trans. Mob. Comput. 2022, 22, 6311–6326. [Google Scholar] [CrossRef]
- Lim, W.Y.B.; Xiong, Z.; Miao, C.; Niyato, D.T.; Yang, Q.; Leung, C.; Poor, H.V. Hierarchical Incentive Mechanism Design for Federated Machine Learning in Mobile Networks. IEEE Internet Things J. 2020, 7, 9575–9588. [Google Scholar] [CrossRef]
- Su, Z.; Wang, Y.; Luan, T.H.; Zhang, N.; Li, F.; Chen, T.; Cao, H. Secure and Efficient Federated Learning for Smart Grid With Edge-Cloud Collaboration. IEEE Trans. Ind. Inform. 2021, 18, 1333–1344. [Google Scholar] [CrossRef]
- Reza, S.M.S.; Hossain, D.A. Malicious vehicle detection based on beta reputation and trust management for secure communication in smart automotive cars network. TELKOMNIKA (Telecommun. Comput. Electron. Control.) 2021, 19, 1688–1696. [Google Scholar]
- Hasan, C. Incentive Mechanism Design for Federated Learning: Hedonic Game Approach. ArXiv 2021, arXiv:2101.09673. [Google Scholar]
- Saad, W.; Han, Z.; Debbah, M.; Hjørungnes, A.; Başar, T. Coalitional Games for Distributed Collaborative Spectrum Sensing in Cognitive Radio Networks. In Proceedings of the IEEE INFOCOM 2009, Rio de Janeiro, Brazil, 19–25 April 2009; pp. 2114–2122. [Google Scholar]
- Guazzone, M.; Anglano, C.; Sereno, M. A Game-Theoretic Approach to Coalition Formation in Green Cloud Federations. In Proceedings of the 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, Chicago, IL, USA, 26–29 May 2014; pp. 618–625. [Google Scholar]
- Sun, Y.; Shao, J.; Mao, Y.; Wang, J.H.; Zhang, J. Semi-decentralized federated edge learning for fast convergence on non-IID data. In Proceedings of the 2022 IEEE Wireless Communications and Networking Conference (WCNC), Austin, TX, USA, 10–13 April 2022; pp. 1898–1903. [Google Scholar]
- LeCun, Y. The MNIST Database of Handwritten Digits. 1998. Available online: http://yann.lecun.com/exdb/mnist/ (accessed on 2 October 2024).
- Cohen, G.; Afshar, S.; Tapson, J.C.; van Schaik, A. EMNIST: Extending MNIST to handwritten letters. In Proceedings of the 2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, USA, 14–19 May 2017; pp. 2921–2926. [Google Scholar]
- Netzer, Y.; Wang, T.; Coates, A.; Bissacco, A.; Wu, B.; Ng, A.Y. Reading digits in natural images with unsupervised feature learning. In Proceedings of the NIPS Workshop on Deep Learning and Unsupervised Feature Learning, Granada, Spain, 12–17 December 2011; pp. 301–304. [Google Scholar]
- Ayi, M.; El-Sharkawy, M. RMNv2: Reduced Mobilenet V2 for CIFAR10. In Proceedings of the 2020 10th Annual Computing and Communication Workshop and Conference (CCWC), Las Vegas, NV, USA, 6–8 January 2020; pp. 287–292. [Google Scholar]
- Liu, Y.; Li, K.; Jin, Y.; Zhang, Y.; Qu, W. A Novel Reputation Computation Model Based on Subjective Logic for Mobile Ad Hoc Networks. In Proceedings of the 2009 Third International Conference on Network and System Security, Gold Coast, Australia, 19–21 October 2009; pp. 294–301. [Google Scholar]
Variable | Description |
---|---|
w | Model parameters |
Rewards given by UAV n | |
Set of vehicles of UAV n | |
Number of vehicles in UAV n | |
Utility function of vehicle i in UAV n | |
Data contribution of vehicle i in UAV n | |
S | Coalition of UAVs |
N | Total number of UAVs |
Conversion parameter from model performance to profits | |
Fixed constant | |
Computation energy conversion coefficient | |
Reputation of vehicle i | |
G | Cooperation cost |
Profit function of UAV n | |
Dataset held by vehicle i in UAV n | |
Strategy set of all vehicles except i in UAV n | |
Cost of vehicle i in UAV n | |
Performance of coalition S | |
Q | Total data contribution |
Data contribution of UAV | |
Reputation value of vehicle i in one task | |
Weight parameter of negative interactions | |
Cumulative reputation value of vehicle i | |
Reputation threshold | |
Decay coefficient for reputation updates | |
Maximum reward that UAV n can give | |
g | Number of good updates |
b | Number of bad updates |
Learning rate | |
L | Lipschitz constant of the loss function |
Noise level | |
K | Total number of training rounds |
Model parameters at round k | |
Optimal model parameters |
Parameters | Values |
---|---|
The number of vehicles | |
Batch size | 32 |
The number of local epochs e | 10 |
The number of global training rounds E | 100 |
Learning rate | 0.01 |
SGD momentum | 0.05 |
Hovering cost | [0.5,1.5] |
Circuit cost | [0.1,0.2] |
Noise power | |
Bandwidth of each UAV | 180 kHz |
Transmitted model size | 0.1 Mbits |
Maximum transmission power | 10 dBm |
Maximum CPU frequency | 3 GHz |
Fixed constant | 5 |
Computation energy conversion coefficient | [0.01,0.05] |
Cooperation cost G | 1 |
Reputation threshold | 0.5 |
Decay coefficient for reputation updates | 0.3 |
Model estimation parameters | [1,2] |
Vehicles | UAVs | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
2 | 4 | 6 | 8 | 10 | 12 | 14 | 16 | 18 | 20 | |
5 | 9.12 | 15.43 | 20.28 | 23.97 | 27.24 | 29.75 | 31.68 | 33.43 | 34.53 | 36.19 |
10 | 17.99 | 27.10 | 32.60 | 36.78 | 39.48 | 41.66 | 43.32 | 43.89 | 45.18 | 46.14 |
15 | 25.30 | 36.67 | 43.53 | 47.71 | 50.03 | 51.54 | 52.19 | 53.48 | 53.42 | 54.71 |
20 | 31.53 | 43.56 | 50.97 | 55.13 | 58.12 | 60.01 | 61.77 | 62.50 | 63.37 | 63.91 |
25 | 36.45 | 49.76 | 56.60 | 60.69 | 64.14 | 67.36 | 69.14 | 70.03 | 71.09 | 71.82 |
30 | 41.29 | 54.94 | 62.21 | 66.43 | 70.13 | 71.42 | 74.02 | 75.01 | 75.63 | 77.41 |
35 | 45.17 | 59.27 | 66.79 | 69.81 | 74.36 | 76.15 | 77.90 | 79.30 | 81.12 | 82.65 |
40 | 48.70 | 63.09 | 70.51 | 72.81 | 78.01 | 80.21 | 83.31 | 83.04 | 84.82 | 87.33 |
45 | 52.46 | 66.11 | 73.94 | 76.77 | 81.52 | 83.65 | 85.61 | 87.73 | 88.93 | 89.51 |
50 | 54.93 | 68.43 | 76.55 | 79.58 | 84.87 | 87.91 | 88.69 | 91.10 | 92.39 | 94.02 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Hu, G.; Zhu, D.; Shen, J.; Hu, J.; Han, J.; Li, T. FedBeam: Reliable Incentive Mechanisms for Federated Learning in UAV-Enabled Internet of Vehicles. Drones 2024, 8, 567. https://doi.org/10.3390/drones8100567
Hu G, Zhu D, Shen J, Hu J, Han J, Li T. FedBeam: Reliable Incentive Mechanisms for Federated Learning in UAV-Enabled Internet of Vehicles. Drones. 2024; 8(10):567. https://doi.org/10.3390/drones8100567
Chicago/Turabian StyleHu, Gangqiang, Donglin Zhu, Jiaying Shen, Jialing Hu, Jianmin Han, and Taiyong Li. 2024. "FedBeam: Reliable Incentive Mechanisms for Federated Learning in UAV-Enabled Internet of Vehicles" Drones 8, no. 10: 567. https://doi.org/10.3390/drones8100567
APA StyleHu, G., Zhu, D., Shen, J., Hu, J., Han, J., & Li, T. (2024). FedBeam: Reliable Incentive Mechanisms for Federated Learning in UAV-Enabled Internet of Vehicles. Drones, 8(10), 567. https://doi.org/10.3390/drones8100567