An extra-cochlear stimulation system has been investigated as a less invasive alternative to conventional cochlear implant; however, the system is used primarily as a speech-reading aid. The purpose of this study was to develop a speech encoding scheme for the extra-cochlear stimulation system to convey intelligible speech. A click-modulated speech sound (CMS) was created as a simulation of the extra-cochlear stimulation system. The CMS is a repetitive click with a repetition rate similar to the formant frequency transition of an original sound. Seven native Japanese speakers with normal hearing participated in the experiment. After listening to the CMS, synthesized from low familiarity Japanese words, the subjects reported their perceptions. The results showed that the rates of correctly identified vowels and consonants were significantly higher than those of the control stimulus, suggesting that the CMS can generate at least partially intelligible vowel and consonant perceptions. In all, the speech encoding scheme could be applied to the extra-cochlear stimulation system to restore speech perception.
References
[1]
Grillo, G., Kiang, N.Y. and Moxon, E.C. (1972) Physiological Considerations in Artificial Stimulation of the Inner Ear. Annals of Otology, Rhinology & Laryngology, 81, 714-730. https://doi.org/10.1177/000348947208100513
[2]
Fourcin, A.J., Rosen, B.C., Moore, B.C.J., Douek, E.E., Clarke, G.P., Dodson, H. and Bannister, L.H. (1979) External Electrical Stimulation of the Cochlea: Clinical, Psychophysical, Speech-Perceptual and Histological Findings. British Journal of Audiology, 13, 85-107. https://doi.org/10.3109/03005367909078883
[3]
Rosen, S. and Ball, V. (1986) Speech Perception with the Vienna Extra-Cochlear Single-Channel Implant: A Comparison of Two Approaches to Speech Coding. British Journal of Audiology, 20, 61-83. https://doi.org/10.3109/03005368609078999
[4]
Nie, K., Stickney, G. and Zeng, F.G. (2005) Encoding Frequency Modulation to Improve Cochlear Implant Performance in Noise. IEEE Transactions on Biomedical Engineering, 52, 64-73. https://doi.org/10.1109/TBME.2004.839799
[5]
Baskent, D. and Shannon, R.V. (2004) Frequency-Place Compression and Expansion in Cochlear Implant Listeners. The Journal of the Acoustical Society of America, 116, 3130-3140. https://doi.org/10.1121/1.1804627
[6]
Peterson, G.E. and Barney, H.L. (1952) Control Methods Used in a Study of the Vowels. The Journal of the Acoustical Society of America, 24, 175-184. https://doi.org/10.1121/1.1906875
[7]
Remez, R.E., Rubin, P.E., Pisoni, D.B. and Carrell, T.D. (1981) Speech Perception without Traditional Cues. Science, 212, 947-949. https://doi.org/10.1126/science.7233191
[8]
Remez, R.E., Fellowes, J.M., Pisoni, D.B., Goh, W.D. and Rubin, P.E. (1998) Multimodal Perceptual Organization of Speech: Evidence from Tone Analogs of Spoken Utterances. Speech Communication, 26, 65-73. https://doi.org/10.1016/S0167-6393(98)00050-8
[9]
Roberts, B., Summers, R.J. and Bailey, P.J. (2010) The Perceptual Organization of Sine-Wave Speech under Competitive Conditions. The Journal of the Acoustical Society of America, 128, 804-817. https://doi.org/10.1121/1.3445786
[10]
Feng, Y.M., Xu, L., Zhou, N., Yang, G. and Yin, S.K. (2012) Sine-Wave Speech Recognition in a Tonal Language. The Journal of the Acoustical Society of America, 131, EL133-EL138. https://doi.org/10.1121/1.3670594
[11]
Brungart, D.S., Simpson, B.D., Darwin, C.J., Arbogast, T.L. and Kidd Jr., G. (2005) Across-Ear Interference from Parametrically Degraded Synthetic Speech Signals in a Dichotic Cocktail-Party Listening Task. The Journal of the Acoustical Society of America, 117, 292-304. https://doi.org/10.1121/1.1835509
[12]
Coath, M., Sheik, S., Chicca, E., Indiveri, G., Denham, S. and Wennekers, T. (2014) A Robust Sound Perception Model Suitable for Neuromorphic Implementation. Frontiers in Neuroscience, 7, 1-10. https://doi.org/10.3389/fnins.2013.00278
[13]
Elgendi, M., Bobhate, P., Jain, S., Guo, L., Kumar, S., Rutledge, J., Coe, Y., Zemp, R., Schuurmans, D. and Adatia, I. (2015) The Unique Heart Sound Signature of Children with Pulmonary Artery Hypertension. Pulmonary Circulation, 5, 631-639. https://doi.org/10.1086/683694
[14]
Kondo, T., Amano, S., Sakamoto, S. and Suzuki, Y. (2008) Development of Familiarity-Controlled Word-Lists (FW07). IEICE Technical Report, 107, 43-48.
[15]
Tukey, J.W. (1991) The Philosophy of Multiple Comparisons. Statistical Science, 6, 100-116. https://doi.org/10.1214/ss/1177011945
[16]
Hillenbrand, J., Getty, L.A., Clark, M.J. and Wheeler, K. (1995) Acoustic Characteristics of American English Vowels. The Journal of the Acoustical Society of America, 97, 3099-3111. https://doi.org/10.1121/1.411872
[17]
Lisker, L. (1995) Is It VOT or a First-Formant Transition Detector? The Journal of the Acoustical Society of America, 57, 1547-1551. https://doi.org/10.1121/1.380602
[18]
Kluender, K.R. and Lotto, A.J. (1994) Effects of First Formant Onset Frequency on [-Voice] Judgments Result from Auditory Processes Not Specific to Humans. The Journal of the Acoustical Society of America, 95, 1044-1052. https://doi.org/10.1121/1.408466
[19]
Benki, J.R. (2001) Place of Articulation and First Formant Transition Pattern Both Affect Perception of Voicing in English. Journal of Phonetics, 29, 1-22. https://doi.org/10.1006/jpho.2000.0128
[20]
Sumby, W.H. and Pollack, I. (1954) Visual Contribution to Speech Intelligibility in Noise. The Journal of the Acoustical Society of America, 26, 212-215. https://doi.org/10.1121/1.1907309
[21]
Goh, W.D., Pisoni, D.B., Kirk, K.I. and Remez, R.E. (2001) Audio-Visual Perception of Sinewave Speech in an Adult Cochlear Implant User: A Case Study. Ear and Hearing, 22, 412-419. https://doi.org/10.1097/00003446-200110000-00005