Patients with severe hearing loss have the option to get a cochlear implant device to regain their hearing. Yet, the implantation process is not always optimal, which in some cases results in a shallow insertion depth or an accidental insertion into the wrong cochlear duct. As a consequence, the patients' pitch discrimination ability is suboptimal, leading to an even more decreased vowel identification, which is vital for speech recognition. This paper presents a technical approach to solve this problem: the adaptive pitch transposition module modifies the frequency content in a fashion so that the pitch is fixed to an optimal value. To determine this value, a patient-individual best pitch is determined experimentally by evaluating speech recognition at different pitches. This best pitch is subsequently called the comfort pitch. As a result of the considerations a technical implementation is presented in principle. A system comprised of pitch detection, pitch transposition and an arbitrary chosen comfort pitch is described in depth. It has been implemented prototypically in Matlab/Octave and tested with an example audio file. The system?itself is designed as a preprocessing stage preceding cochlear implant processing.
References
[1]
Struwe, K. (2017) APT: Enhanced Speech Comprehension Through Adaptive Pitch Transposition in Cochlear Implants. In: Giokas, K., Bokor, L.-Z. and Hopfgartner, F., Eds., eHealth 360?: International Summit on eHealth, Budapest, 14-16 June, 2016, Revised Selected Papers. Springer International Publishing, 224-228.
[2]
Shannon, R.V., et al. (2004) Speech Perception with Cochlear Implants. In: Cochlear Implants: Auditory Prostheses and Electric Hearing, Springer, 334-376.
[3]
Zeng, F.-G., Tang, Q. and Lu, T. (2014) Abnormal Pitch Perception Produced by Cochlear Implant Stimulation. PloS One, 9, e88662.
[4]
Laneau, J., Wouters, J. and Moonen, M. (2006) Improved Music Perception with Explicit Pitch Coding in Cochlear Implants. Audiology and Neurotology, 11, 38-52.
[5]
Francart, T., Osses, A. and Wouters, J. (2015) Speech Perception with F0mod, a Cochlear Implant Pitch Coding Strategy. International Journal of Audiology, 54, 424-432.
[6]
De Cheveigne, A. (2005) Pitch Perception Models. In: Pitch, Springer, 169-233.
[7]
Patterson, R.D., Gaudrain, E. and Walters. T.C. (2010) The Perception of Family and Register in Musical Tones. In: Music Perception, Springer, 13-50.
[8]
Mak, M.-W. and Yu, H.-B. (2014) A Study of Voice Activity Detection Techniques for NIST Speaker Recognition Evaluations. Computer Speech & Language, 28, 295-313.
[9]
Charpentier, F.J. and Stella, M.G. (1986) Diphone Synthesis Using an Overlap-Add Technique for Speech Waveforms Concatenation. Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP'86, 11, 2015-2018.
[10]
Flanagan, J.L. and Golden, R.M. (1966) Phase Vocoder. Bell System Technical Journal, 45, 1493-1509.
[11]
Ellis, D.P.W. (2002) A Phase Vocoder in Matlab.
http://www.ee.columbia.edu/ln/rosa/matlab/pvoc/
[12]
Laroche, J. and Dolson. M. (1999) Improved Phase Vocoder Time-Scale Modification of Audio. IEEE Trans-actions on Speech and Audio Processing, 7, 323-332.