Watermarking of Electronic Patient Record in Parkinson Disease Affected Speech: A Robust and Secure Audio Hiding Technique for Smart e-healthcare Application

Aniruddha Kanhe; Aghila Gnanasekaran

Discontinuity, Nonlinearity, and Complexity

Dimitry Volchenkov (editor), Dumitru Baleanu (editor)

Watermarking of Electronic Patient Record in Parkinson Disease Affected Speech: A Robust and Secure Audio Hiding Technique for Smart e-healthcare Application

Discontinuity, Nonlinearity, and Complexity 10(1) (2021) 117--134 | DOI:10.5890/DNC.2021.03.008

Aniruddha Kanhe , Aghila Gnanasekaran

Department of Electronics and Communication Engineering, National Institute of Technology Puducherry India Department of Computer Science and Engineering, National Institute of Technology Puducherry India

Download Full Text PDF

Abstract

A cloud-based framework for security of electronic patient record, in telediagnosis of Parkinson's disease (PD) speech signal has been proposed in this paper. The security and authenticity of patient's personal information is achieved using proposed Discrete Cosine Transform-Singular Value Decomposition (DCT-SVD) based audio watermarking technique. The automatic classification of PD affected speech signal from healthy person's speech signal requires high computational accuracy and in this work, it is addressed by extracting various time frequency features for the Support Vector Machine (SVM) classifier. In the proposed framework the speech signal of PD suspected person is recorded by a mobile application. The personal information is securely embedded in this speech signal using proposed DCT-SVD based audio watermarking technique, before transmitting to cloud for automatic classification. To ensure the security of electronic patient record (EPR), proposed watermarking technique is tested against various signal processing attacks and the performance of classifier has been evaluated by computing classification sensitivity, specificity, accuracy and area under the curve of receiver operating characteristics (ROC). The theoretical proofs and experimental results, show that proposed framework provides telediagnosis of PD affect patients without compromising the security and privacy of patient's records. The classification accuracy of $89\%$ with high data payload of $6kbps$ is proposed leading to a smart and secure e-healthcare application.

References

[1]	Zhang, Y., Liu, H., Su, X., Jiang, P., and Wei, D. (2015), Remote mobile health monitoring system based on smart phone and browser/server structure, { Journal of healthcare engineering}, {\bf 6}, 717-738.

[2]	Archip, A., Botezatu, N., Serban, E., Herghelegiu, P.C., and Zala, A. (2016), An iot based system for remote patient monitoring, { 2016 17th International Carpathian Control Conference (ICCC)}, May, pp. 1-6.

[3]	Stach, C. and Mitschang, B. (2016), The secure data container: An approach to harmonize data sharing with information security, { 2016 17th IEEE International Conference on Mobile Data Management (MDM)}, 1, pp. 292-297.

[4] Baig, M.M. and Gholamhosseini, H. (2013), Smart health monitoring systems: An overview of design and modeling, { Journal of Medical Systems}, {\bf 37}, 9898.

[5]	Dimitriou, T. and Ioannis, K. (2008), Security issues in biomedical wireless sensor networks. { 2008 First International Symposium on Applied Sciences on Biomedical and Communication Technologies}, 1-5.

[6] Office for Civil Rights and United State Department of Health and Human Services, (2015), Medical privacy.

[7]	vander Marck, M., Kalf, J., Sturkenboom, I., Nijkrake, M., Munneke, M., and Bloem, B. (2009), Multidisciplinary care for patients with parkinsons disease. { Parkinsonism and Related Disorders}, {\bf 15}, S219-S223, proceedings of WFN XVIII World Congress on Parkinsons Disease and Related Disorders.

[8] Ho, A.K., Bradshaw, J.L., and Iansek, R. (2000), Volume perception in parkinsonian speech, { Movement Disorders}, {\bf 15}, 1125-1131.
[9] Sewall, G.K., Jiang, J., and Ford, C.N. (2006), Clinical evaluation of parkinsons-related dysphonia, { The laryngoscope}, {\bf 116}, 1740-1744.

[10]	Gamboa, J., Jimenez-Jimenez, F.J., Nieto, A., Montojo, J., Orti-Pareja, M., Molina, J.A., Garcia-Albea, E., and Cobeta, I. (1997), Acoustic voice analysis in patients with parkinsons disease treated with dopaminergic drugs, { Journal of Voice}, {\bf 11}, 314 - 320.

[11]	Huh, Y.E., Park, J., Suh, M.K., Lee, S.E., Kim, J., Jeong, Y., Kim, H.-T., and Cho, J.W. (2015), Differences in early speech patterns between parkinson variant of multiple system atrophy and parkinson`s disease, { Brain and language}, {\bf 147}, 14-20.

[12]	Tykalova, T., Rusz, J., Klempir, J., Cmejla, R., and Ruzicka, E. (2017), Distinct patterns of imprecise consonant articulation among parkinsons disease, progressive supranuclear palsy and multiple system atrophy, { Brain and Language}, {\bf 165}, 1-9.

[13]	Shahbakhi, M., Far, D.T., and Tahami, E. (2014), Speech analysis for diagnosis of parkinsons disease using genetic algorithm and support vector machine, { Journal of Biomedical Science and Engineering}, {\bf 7}, 147-156.

[14]	Tsanas, A., Little, M.A., McSharry, P.E., Spielman, J., and Ramig, L.O. (2012), Novel speech signal processing algorithms for high-accuracy classification of parkinsons disease, { IEEE Transactions on Biomedical Engineering}, {\bf 59}, 1264-1271.

[15] Zhang, Y. (2017), Can a smartphone diagnose parkinson disease? a deep neural network method and telediagnosis system implementation, { Parkinsons Disease}, {\bf 2017}.

[16]	Parah, S.A., Sheikh, J.A., Ahad, F., Loan, N.A., and Bhat, G.M. (2017), Information hiding in medical images: a robust medical image watermarking system for e-healthcare, { Multimedia Tools and Applications}, {\bf 76}, 10599-10633.

[17]	Loan, N.A., Parah, S.A., Sheikh, J.A., Akhoon, J.A., and Bhat, G.M. (2017), Hiding electronic patient record (epr) in medical images: A high capacity and computationally efficient technique for e-healthcare applications, { Journal of Biomedical Informatics}, {\bf 73}, 125-136.

[18]	Parah, S.A., Sheikh, J.A., Ahad, F., and Bhat, G. (2018), High capacity and secure electronic patient record (epr) embedding in color images for iot driven healthcare systems, { Internet of Things and Big Data Analytics Toward Next-Generation Intelligence}, pp. 409-437, Springer.

[19]	Dey, N., Ashour, A.S., Chakraborty, S., Banerjee, S., Gospodinova, E., Gospodinov, M., and Hassanien, A.E. (2017), Watermarking in biomedical signal processing, { Intelligent Techniques in Signal Processing for Multimedia Security}, pp. 345-369, Springer.

[20]	Alhussein, M. and Muhammad, G. (2015), Watermarking of parkinson disease speech in cloud-based healthcare framework, { International Journal of Distributed Sensor Networks}, {\bf 11}, 264575.

[21]	Ali, Z., Imran, M., Abdul, W., and Shoaib, M. (2018), { An Innovative Algorithm for Privacy Protection in a Voice Disorder Detection System}, pp. 228-233, Springer International Publishing.

[22] Bender, W., Gruhl, D., Morimoto, N., and Lu, A. (1996), Techniques for data hiding, { IBM Systems Journal}, {\bf 35}, 313-336.
[23] Cvejic, N. and Seppanen, T. (2002), Increasing the capacity of lsb-based audio steganography, { Multimedia Signal Processing, 2002 IEEE Workshop on}, Dec, pp. 336-338.
[24] Bhowal, K., Bhattacharyya, D., JyotiPal, A., and Kim, T.-H. (2013), A ga based audio steganography with enhanced security, { Telecommun. Syst.}, {\bf 52}, 2197-220.

[25]	Kanhe, A., Aghila, G., Kiran, C.S., Ramesh, C., Jadav, G., and Raj, M. (2015), Robust audio steganography based on advanced encryption standards in temporal domain, { Advances in Computing, Communications and Informatics (ICACCI), 2015 International Conference on}, Aug, pp. 1449-1453.

[26] Erfani, Y. and Siahpoush, S. (2009), Robust audio watermarking using improved ts echo hiding, { Digital Signal Processing}, {\bf 19}, 809-814.

[27]	Korzhik, V., Morales-Luna, G., and Fedyianin, I. (2013), Audio watermarking based on echo hiding with zero error probability, { International Journal of Computer Science and Applications, Technomathematics Research Foundation}, {\bf 10}, 1-10.

[28] Fallahpour, M. and Megias, D. (2015), Audio watermarking based on fibonacci numbers, { IEEE/ACM Transactions on Audio, Speech, and Language Processing}, {\bf 23}, 1273-1282.
[29] Fallahpour, M. and Megias, D. (2014), Robust audio watermarking based on fibonacci numbers, { 2014 10th International Conference on Mobile Ad-hoc and Sensor Networks}, Dec, pp. 343-349.

[30]	Ahani, S., Ghaemmaghami, S., and Wang, Z.J. (2015), A sparse representation-based wavelet domain speech steganography method, { Audio, Speech, and Language Processing, IEEE/ACM Transactions on}, {\bf 23}, 80-91.

[31]	Shah, P., Choudhari, P., and Sivaraman, S. (2008), Adaptive wavelet packet based audio steganography using data history, { 2008 IEEE Region 10 and the Third international Conference on Industrial and Information Systems}, Dec, pp. 1-5.

[32] Hu, H.-T. and Chang, J.-R. (2017), Efficient and robust frame-synchronized blind audio watermarking by featuring multilevel dwt and dct, { Cluster Computing}, {\bf 20}, 805-816.
[33] Kanhe, A. and Aghila, G. (2018), A dct-svd-based speech steganography in voiced frames. { Circuits, Systems, and Signal Processing}.
[34] Hua, G., Huang, J., Shi, Y.Q., Goh, J., and Thing, V.L. (2016), Twenty years of digital audio watermarking-a comprehensive review, { Signal Processing}, {\bf 128}, 222 - 242.
[35] Kanhe, A. and Gnanasekaran, A. (2018), A qim-based energy modulation scheme for audio watermarking robust to synchronization attack, { Arabian Journal for Science and Engineering}.

[36]	Hu, H.-T., Lin, S.-J., and Hsu, L.-Y. (2017), Effective blind speech watermarking via adaptive mean modulation and package synchronization in dwt domain, { EURASIP Journal on Audio, Speech, and Music Processing}, {\bf 2017}, 10.

[37] Kaur, A. and Dutta, M.K. (2017), An optimized high payload audio watermarking algorithm based on lu-factorization, { Multimedia Systems}.

[38]	Elshazly, A.R., Nasr, M.E., Fouad, M.M., and Abdel-Samie, F.S. (2017), High payload multi-channel dual audio watermarking algorithm based on discrete wavelet transform and singular value decomposition, { Int. J. Speech Technol.}, {\bf 20}, 951-958.

[39]	Sakar, B.E., Isenkul, M.E., Sakar, C.O., Sertbas, A., Gurgen, F., Delil, S., Apaydin, H., and Kursun, O. (2013), Collection and analysis of a parkinson speech dataset with multiple types of sound recordings, { IEEE Journal of Biomedical and Health Informatics}, {\bf 17}, 828-834.

[40]	Little, M.A., McSharry, P.E., Hunter, E.J., Spielman, J., Ramig, L.O., etal. (2009), Suitability of dysphonia measurements for telemonitoring of parkinsons disease, { IEEE transactions on biomedical engineering}, {\bf 56}, 1015-1022.

[41] Woldert-Jokisz, B. (2007), Saarbruecken voice database, { Institute of Phonetics, Saarland University}.

[42]	KBhat, V., Sengupta, I., and Das, A. (2011), A new audio watermarking scheme based on singular value decomposition and quantization, { Circuits, Systems, and Signal Processing}, {\bf 30}, 915-927.

[43] Ali, A.-H. (2014), An imperceptible and robust audio watermarking algorithm. { EURASIP Journal on Audio, Speech, and Music Processing}, {\bf 2014}, 37.
[44] Chang, C.-C. and Lin, C.-J. (2011), Libsvm: A library for support vector machines, { ACM Transactions on Intelligent Systems and Technology}, {\bf 2}, 27:1-27:27.

[45]	Polat, K. (2012), Classification of parkinsons disease using feature weighting method on the basis of fuzzy c-means clustering, { International Journal of Systems Science}, {\bf 43}, 597-609.

[46]	Chen, H.-L., Huang, C.-C., Yu, X.-G., Xu, X., Sun, X., Wang, G., and Wang, S.-J. (2013), An efficient diagnosis system for detection of parkinsons disease using fuzzy k-nearest neighbor approach, { Expert Systems with Applications}, {\bf 40}, 263 - 271.

[47] Daliri, M.R. (2013), Chi-square distance kernel of the gaits for the diagnosis of parkinsons disease, { Biomedical Signal Processing and Control}, {\bf 8}, 66 - 70.

[48]	Ephraim, Y. and Malah, D. (1985), Speech enhancement using a minimum mean-square error log-spectral amplitude estimator, { IEEE Transactions on Acoustics, Speech, and Signal Processing}, {\bf 33}, 443-445.

[49] Boll, S. (1979), Suppression of acoustic noise in speech using spectral subtraction, { IEEE Transactions on Acoustics, Speech, and Signal Processing}, {\bf 27}, 113-120.

[50]	Scalart, P. and Filho, J.V. (1996), Speech enhancement based on a priori signal to noise estimation, { 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings}, May, vol.2, pp. 629-632 vol. 2.