论文标题
PRIVEDGE:从本地到分布式的私人培训和预测
PrivEdge: From Local to Distributed Private Training and Prediction
论文作者
论文摘要
机器学习作为服务(MLAAS)操作员在云上提供模型培训和预测。 MLAAS应用程序通常依赖于集中式收集和汇总用户数据,这在处理敏感的个人数据时可能会引起严重的隐私问题。为了解决这个问题,我们提出了Privedge,这是一种保护隐私的MLAA技术,可保护为培训提供数据的用户以及使用预测服务的用户的隐私。借助Privedge,每个用户都独立使用其私人数据来本地培训一个单级重建对抗网络,该网络简洁地代表了他们的培训数据。由于将模型参数发送到清晰的服务提供商将揭示私人信息,因此秘密共享两个非碰撞的MLAAS提供商之间的参数,然后通过安全的多方计算技术提供密码私有的预测服务。我们量化了Privedge的好处,并将其绩效与最先进的集中式体系结构进行了比较,这些架构在三个基于隐私敏感的任务上:个人身份识别,作家身份和手写字母识别。实验结果表明,Privedge在保存隐私以及区分私人图像和非私人图像方面具有很高的精度和回忆。此外,我们显示了Privedge对图像压缩和偏见训练数据的鲁棒性。源代码可在https://github.com/smartcameras/privedge上找到。
Machine Learning as a Service (MLaaS) operators provide model training and prediction on the cloud. MLaaS applications often rely on centralised collection and aggregation of user data, which could lead to significant privacy concerns when dealing with sensitive personal data. To address this problem, we propose PrivEdge, a technique for privacy-preserving MLaaS that safeguards the privacy of users who provide their data for training, as well as users who use the prediction service. With PrivEdge, each user independently uses their private data to locally train a one-class reconstructive adversarial network that succinctly represents their training data. As sending the model parameters to the service provider in the clear would reveal private information, PrivEdge secret-shares the parameters among two non-colluding MLaaS providers, to then provide cryptographically private prediction services through secure multi-party computation techniques. We quantify the benefits of PrivEdge and compare its performance with state-of-the-art centralised architectures on three privacy-sensitive image-based tasks: individual identification, writer identification, and handwritten letter recognition. Experimental results show that PrivEdge has high precision and recall in preserving privacy, as well as in distinguishing between private and non-private images. Moreover, we show the robustness of PrivEdge to image compression and biased training data. The source code is available at https://github.com/smartcameras/PrivEdge.