Publications

GREATS: Online Selection of High-Quality Data for LLM Training in Every Iteration

Rethinking Data Shapley for Data Selection Tasks: Misleads and Merits

Efficient Data Valuation for Weighted Nearest Neighbor Algorithms

Privacy-Preserving In-Context Learning for Large Language Models

DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer

Threshold KNN-Shapley: A Linear-Time and Privacy-Friendly Approach to Data Valuation

A Randomized Approach for Tight Privacy Accounting

Data Banzhaf: A Robust Data Valuation Framework for Machine Learning

LAVA: Data Valuation without Pre-Specified Learning Algorithms

Renyi Differential Privacy of Propose-Test-Release and Applications to Private and Robust Machine Learning

Improving Cooperative Game Theory-based Data Valuation via Data Utility Learning

Concurrent Composition of Differential Privacy

DPlis: Boosting Utility of Differentially Private Deep Learning via Randomized Smoothing

RIGA: Covert and Robust White-Box Watermarking of Deep Neural Networks

Improving Robustness to Model Inversion Attacks via Mutual Information Regularization

A Principled Approach to Data Valuation for Federated Learning