TY - GEN
T1 - Latency-Aware Inference on Convolutional Neural Network Over Homomorphic Encryption
AU - Ishiyama, Takumi
AU - Suzuki, Takuya
AU - Yamana, Hayato
N1 - Funding Information:
This work was supported by JST CREST Grant Number JPMJCR1503, Japan.
Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2022
Y1 - 2022
N2 - Homomorphic encryption enables privacy-preserving computation in convolutional neural networks (CNNs), keeping their input and output secret from the server; however, it faces long latency because of large overhead of the encryption scheme. This paper tackles shortening the inference latency on homomorphic encryption-enabled CNNs. Since the highest inference accuracy is not always needed depending on real-world applications, finding best-fit combinations of latency and accuracy is also indispensable. We propose a combination of channel-wise packing and a structured pruning technique besides changing the active functions to shorten the inference latency while allowing accuracy degradation. Our experimental evaluation shows that we successfully tune the latency from 8.1 s to 12.9 s depending on the accuracy of 66.52% to 80.96% on the CIFAR-10 dataset.
AB - Homomorphic encryption enables privacy-preserving computation in convolutional neural networks (CNNs), keeping their input and output secret from the server; however, it faces long latency because of large overhead of the encryption scheme. This paper tackles shortening the inference latency on homomorphic encryption-enabled CNNs. Since the highest inference accuracy is not always needed depending on real-world applications, finding best-fit combinations of latency and accuracy is also indispensable. We propose a combination of channel-wise packing and a structured pruning technique besides changing the active functions to shorten the inference latency while allowing accuracy degradation. Our experimental evaluation shows that we successfully tune the latency from 8.1 s to 12.9 s depending on the accuracy of 66.52% to 80.96% on the CIFAR-10 dataset.
KW - Channel pruning
KW - Convolutional neural network
KW - Homomorphic encryption
KW - Privacy-preserving machine learning
UR - http://www.scopus.com/inward/record.url?scp=85145008511&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85145008511&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-21047-1_27
DO - 10.1007/978-3-031-21047-1_27
M3 - Conference contribution
AN - SCOPUS:85145008511
SN - 9783031210464
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 324
EP - 337
BT - Information Integration and Web Intelligence - 24th International Conference, iiWAS 2022, Proceedings
A2 - Pardede, Eric
A2 - Delir Haghighi, Pari
A2 - Khalil, Ismail
A2 - Kotsis, Gabriele
PB - Springer Science and Business Media Deutschland GmbH
T2 - 24th International Conference on Information Integration and Web Intelligence, iiWAS 2022, held in conjunction with the 20th International Conference on Advances in Mobile Computing and Multimedia Intelligence, MoMM 2022
Y2 - 28 November 2022 through 30 November 2022
ER -