Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DP-SGD implemented in Tabula #14

Open
Zehao-Zhang opened this issue Oct 29, 2024 · 2 comments
Open

DP-SGD implemented in Tabula #14

Zehao-Zhang opened this issue Oct 29, 2024 · 2 comments

Comments

@Zehao-Zhang
Copy link

Hi Zilong, do you think it's possible to add DP-SGD into the tabula due to the privacy concern? I tried to add what you did in CTABGAN+ into the tabulatrainer. The fit process runs with large training loss. The problem is the sample method cannot generate any result and it runs infinitely. Will appreciate it a lot if you can give some ideas about this.

@zhao-zilong
Copy link
Owner

Hi @Zehao-Zhang sorry for the long wait. To implement DP in tabula can cause several problem. Because in tabula, you first generate a sentence and then sentence translated to tabular data. If DP is added, the sentence can be invalid to translate to tabular data. But in GAN, DP will only make the generation worse, it can still output the valid format data. So the conclusion is as long as there is SGD, you can add DP, but for Tabula, it will indeed make the generation fail a lot.

@Zehao-Zhang
Copy link
Author

Hi @zhao-zilong , thank you so much for the reply, it makes a lot of sense. If you don't mind, can I ask that what kind of privacy protection can be integrated into your model?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants