Data reconstruction from machine learning models via inverse estimation and Bayesian inference

通过逆估计和贝叶斯推断从机器学习模型中重建数据

阅读:2

Abstract

This study explores the task of data reconstruction from machine learning models via inverse estimation and Bayesian inference, with the goal of recovering the original dataset solely based on the trained model. We introduce a novel theoretical framework that investigates the factors affecting the data reconstruction quality. Specifically, we derive expressions that quantify how variations in key variables influence the divergence between true and estimated posteriors by examining the concurrent behavior of their partial derivatives with respect to independent variables. This derivative-based approach establishes theoretical correlations between the variables, demonstrating that the fidelity of the recovered data is governed by two primary factors: (1) the accuracy of the assumed prior, and (2) the accuracy of the machine learning model. Empirical results across multiple benchmark datasets and machine learning algorithms corroborate these theoretical predictions, reinforcing the validity and robustness of our theoretical framework. Practically, our data reconstruction method enables the creation of synthetic models that closely replicate the performance of the original models. This work contributes to advancing the theoretical understanding and practical techniques for data reconstruction and model introspection within the context of machine learning.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。