Preprints

  1. Xia, X., Zhang, L., Cai, Z. (2025) Statistical inference for differentially private stochastic gradient descent, major revision, Journal of the American Statistical Association.

  2. Cai, Z., Fan, Y., Gao, L. (2025) Knockoffs inference under privacy constraints, R & R, Journal of the Royal Statistical Society Series B.

  3. Sun, J., Cai, Z., Zhong, W. (2025) Stabilized eBH: a unified stability approach to false discovery rate control, major revision, Journal of the American Statistical Association.

  4. Cai, Z., Li, S., Xia, X., Zhang, L. (2025) Private estimation and inference in high-dimensional regression with FDR control, R & R, Journal of Machine Learning Research.

  5. Wang, C., Price, K., Cai, H., Shen, W., Cai, Z., Hu, G. (2025) How much does Home Field Advantage matter in Soccer Games? A causal inference approach for English Premier League analysis


Publications: Theory and Methodology

  1. Xia, X., Zhang, L., Cai, Z. (2025) Differentially private sliced inverse regression: minimax optimality and algorithm, Journal of the American Statistical Association, accepted.

  2. Kanrar, R., Jiang, F., Cai, Z. (2025) Model-free change-point detection using AUC of a classifier, Journal of Machine Learning Research.

  3. Cai, Z., Zhang, Y., Guo, X., Zhu, L., Li, R. (2025) A nonparametric independence test via penalized mutual information, Science China Mathematics.

  4. Gao, Y., Zhang, Z., Cai, Z., Zhu, X., Zou, T., and Wang, H. (2025) Penalized sparse covariance regression with high dimensional covariates, Journal of Business & Economic Statistics.

  5. Awan, J., Cai, Z. (2025) One step to efficient synthetic data, Statistica Sinica.

  6. Xia, X., Cai, Z. (2023) Adaptive false discovery rate control with privacy guarantee, Journal of Machine Learning Research.

  7. Cai, Z., Lei, J., Roeder, K. (2023) Asymptotic distribution-free independence test for high dimension data, Journal of the American Statistical Association. Python code

  8. Du, J., Cai, Z., Roeder, K. (2022) Robust probabilistic modeling for single-cell multimodal mosaic integration and imputation via scVAEIT, Proceedings of the National Academy of Sciences. Python package

  9. Cai, Z., Li, C., Wen, J., Yang, S. (2022) Asset splitting algorithm for ultrahigh dimensional portfolio selection and its theoretical property, Journal of Econometrics.

  10. Cai, Z., Lei, J., Roeder, K. (2022) Model-free prediction test with application to genomics data, Proceedings of the National Academy of Sciences.

  11. Cai, Z., Zhang, Y., Li, R. (2022) A distribution-free conditional independence test with application to causal discovery, Journal of Machine Learning Research. R package

  12. Tong, Z., Cai, Z., Yang, S., Li, R. (2022) Model-free conditional feature screening with FDR control, Journal of the American Statistical Association.

  13. Cai, Z., Xi, D., Zhu, X., Li, R. (2022) Causal discoveries for high dimensional mixed data, Statistics in Medicine. R code

  14. Zhu, X., Cai, Z., Ma, Y. (2021) Network functional autoregression model, Journal of the American Statistical Association. R code

  15. Cai, Z., Li, R., Zhu, L. (2020) Online sufficient dimension reduction through sliced inverse regression, Journal of Machine Learning Research.


Publications: Applications

  1. Buu, A., Tong, Z., Cai, Z., Li, R., Yang, J.J., Jorenby, D.E., Piper, M.E. (2023) Subtypes of dual users of combustible and electronic cigarettes: longitudinal changes in product use and dependence symptomatology, Nicotine and Tobacco Research.

  2. Buu, A., Cai, Z., Li, R., Wong, S.W., Lin, H.C., Su, W.C., Jorenby, D.E., Piper, M.E. (2021) The association between short-term emotion dynamics and cigarette dependence: A comprehensive examination of dynamic measures, Drug and Alcohol Dependence.

  3. Buu, A., Cai, Z., Li, R., Wong, S.W., Lin, H.C., Su, W.C., Jorenby, D.E., Piper, M.E. (2021) Validating e-cigarette dependence scales based on dynamic patterns of vaping behaviors, Nicotine and Tobacco Research.