TY - JOUR
T1 - A Survey on Federated Learning Systems
T2 - Vision, Hype and Reality for Data Privacy and Protection
AU - Li, Qinbin
AU - Wen, Zeyi
AU - Wu, Zhaomin
AU - Hu, Sixu
AU - Wang, Naibo
AU - Li, Yuan
AU - Liu, Xu
AU - He, Bingsheng
PY - 2023/4/1
Y1 - 2023/4/1
N2 - As data privacy increasingly becomes a critical societal concern, federated learning has been a hot research topic in enabling the collaborative training of machine learning models among different organizations under the privacy restrictions. As researchers try to support more machine learning models with different privacy-preserving approaches, there is a requirement in developing systems and infrastructures to ease the development of various federated learning algorithms. Similar to deep learning systems such as PyTorch and TensorFlow that boost the development of deep learning, federated learning systems (FLSs) are equivalently important, and face challenges from various aspects such as effectiveness, efficiency, and privacy. In this survey, we conduct a comprehensive review on FLSs. To understand the key design system components and guide future research, we introduce the definition of FLSs and analyze the system components. Moreover, we provide a thorough categorization for FLSs according to six different aspects, including data distribution, machine learning model, privacy mechanism, communication architecture, scale of federation and motivation of federation. The categorization can help the design of FLSs as shown in our case studies. By systematically summarizing the existing FLSs, we present the design factors, case studies, and future research opportunities.
AB - As data privacy increasingly becomes a critical societal concern, federated learning has been a hot research topic in enabling the collaborative training of machine learning models among different organizations under the privacy restrictions. As researchers try to support more machine learning models with different privacy-preserving approaches, there is a requirement in developing systems and infrastructures to ease the development of various federated learning algorithms. Similar to deep learning systems such as PyTorch and TensorFlow that boost the development of deep learning, federated learning systems (FLSs) are equivalently important, and face challenges from various aspects such as effectiveness, efficiency, and privacy. In this survey, we conduct a comprehensive review on FLSs. To understand the key design system components and guide future research, we introduce the definition of FLSs and analyze the system components. Moreover, we provide a thorough categorization for FLSs according to six different aspects, including data distribution, machine learning model, privacy mechanism, communication architecture, scale of federation and motivation of federation. The categorization can help the design of FLSs as shown in our case studies. By systematically summarizing the existing FLSs, we present the design factors, case studies, and future research opportunities.
KW - Collaborative work
KW - Computational modeling
KW - Data models
KW - Data privacy
KW - Deep learning
KW - Machine learning
KW - Servers
UR - http://www.scopus.com/inward/record.url?scp=85118642927&partnerID=8YFLogxK
U2 - 10.1109/TKDE.2021.3124599
DO - 10.1109/TKDE.2021.3124599
M3 - Article
AN - SCOPUS:85118642927
VL - 35
SP - 3347
EP - 3366
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
SN - 1041-4347
IS - 4
ER -