Federated Learning: where machine learning and data privacy can coexist

Overview

Nowadays, a lot of data is generated but cannot be exploited due to its sensitive nature. For instance, we can think of the data collected by the cameras or GPS sensors in our mobile phones, or the performed ultrasounds and X-ray scans, or the data produced by the Internet of Things (IoT). They are all of great value to the world of Big Data and Machine Learning (ML) applications, but are also protected by privacy and, therefore, unusable by traditional methods.

Introduced in 2016 by Google, Federated Learning (FL) is a machine learning scenario born with the aim of using privacy-protected data without violating the regulations in force. This framework deals with learning a central server model in privacy-constrained scenarios, where data are stored on multiple devices (i.e. the clients). Unlike the standard machine learning setting, here the model has no direct access to the data, which never leave the clients’ devices: a fundamental requirement for any application where users’ privacy must be preserved (e.g. medical records, bank transactions).

Goals

Replicate the experiments proposed by this on the CIFAR10 dataset.
Understand the contribution made by each parameter of the federated setting by proposing an experimental study.
Understand the importance of clients’ local data distribution in the federated scenario.
Understand the contribution of the normalization layers.
Make a contribution for solving existing issues.

Options

The default values for various paramters parsed to the experiment are given in options.py. Details are given some of those parameters:

--MODEL: LeNet5, LeNet5_mod, CNNCifar, CNNNet, AllConvNet, mobilenet_v3_small, resnet18, densenet121, googlenet
--ALGORITHM: FedAvg, FedAvgM, FedSGD, FedProx, FedNova, SCAFFOLD
--DISTRIBUTION: iid, non_iid, dirichlet, multimodal
--BATCH_NORM: 0, 1
--GROUP_NORM: 0, 1
--PRETRAIN: 0, 1
--FREEZE: 0, 1 (train only the FC layers)
--NUM_EPOCHS: default 3 (local epochs of clients)
--LR: default 0.01 (learning rate of clients)
--MOMENTUM: default 0.9 (momentum of clients)
--NUM_CLIENTS: default 100
--NUM_SELECTED_CLIENTS: default 25
--ROUNDS: default 25

tm111 / cifar10_federated_learning-advanced_machine_learning Goto Github PK

cifar10_federated_learning-advanced_machine_learning's Introduction

Federated Learning: where machine learning and data privacy can coexist

Overview

Goals

Options

cifar10_federated_learning-advanced_machine_learning's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent