Sign-SGD, Heavy-Tailed Noise and Differential Privacy

Author: Alexey Kravatsky (the maintainer of the repo and a participant of m1p)

Co-researcher: Anton Plusnin (sophomore)

Consultant & Expert: Savelii Chezhegov

Problem statement

It will be rigorously formulated only after the completion of the draft abstract.

Expected results of the development

Modification of Sign-SGD that ensures differential privacy and the proof of its convergence with high probability under heavy-tailed noise.

Applications

The algorithm could be implemented into programs such as ChatGPT, with the prospect of making more use of the corrupted user data, which in turn, would improve the accuracy of the LLMs.

Data to test the algorithm

MNIST database: 28x28 black-and-white images of hand-written digits. CIFAR-10 dataset: 60,000 32x32 color images in 10 different classes: airplanes, cars, birds, cats, deer, dogs, frogs, horses, ships, and trucks.

Quality criteria

The project must meet the following requirements. First, the accuracy and complexity of the algorithm must be on par with existing modifications. Second, a sound proof of correctness of the algorithm and, most importantly, its privacy, must be presented. Increased Byzantine resilience, faster performance, and an absence of parameters to be tuned are also desirable properties of the algorithm.

Feasibility of the project

The main risk is the possibility of flaws in the proofs, although it could be mitigated by rigorous tests (what is false does not tend to work). Another one is the unacceptable complexity of the algorithm. Indeed, no one can guarantee that the algorithm we seek exists. Provided that the mentioned risks are eliminated, the algorithm must be feasible by its design, especially when applied to the tasks of LLMs.

How to launch the algorithm

A pipeline will be presented after the experiments are completed and the demo data is published.

Cite this as:

We will post a link to the preprint when we finalize the work.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
code		code
data		data
docs		docs
figs		figs
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
abstract.md		abstract.md
link_review.md		link_review.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sign-SGD, Heavy-Tailed Noise and Differential Privacy

Problem statement

Expected results of the development

Applications

Data to test the algorithm

Quality criteria

Feasibility of the project

How to launch the algorithm

Cite this as:

About

Releases

Packages

License

alexlegeartis/Sign-SGD-and-Differential-Privacy

Folders and files

Latest commit

History

Repository files navigation

Sign-SGD, Heavy-Tailed Noise and Differential Privacy

Problem statement

Expected results of the development

Applications

Data to test the algorithm

Quality criteria

Feasibility of the project

How to launch the algorithm

Cite this as:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages