Structural Organization and Features of Speech and Music Perception in Language Functions’ Implementation with Timbre’s Perceptive Assessment

I. G. Andreeva, M. Dymnikowa, E. A. Ogorodnikova


In cognitive science, the psychological and biological foundations of music are often considered in the context of comparison with the functions of speech and language. The article describes some aspects of the similarities and differences between music and speech in the implementation of language functions, as well as a typological study of its features according to Buhler, Jakobson, Halliday and Kiklewicz classifications. A general comparison of structural organization and main perceptual characteristics of music and speech signals is carried out additionally, with separate consideration of neurophysiological foundations of the auditory analysis of timbre characteristics of speech sources (speaker's voice) and music (instrument sounds). Some aspects of the practical application of auditory and musical training in the context of training and rehabilitation measures for sensory-cognitive dysfunctions of various genesis are explained, with the prospects of further research aimed at studying the influence of the timbre on the indices of spatial selectivity of musical perception in comparison with the characteristics of the spatial selectivity of speech hearing in the perception of complex acoustic scenes.


music and speech similarities with differences, music and speech timbre characteristics, language's function.

Full Text:



Alexander, J. A., Wong, P. C. M., Bradlow, A. R. (2005). Lexical tone perception in musicians and non-musicians. Thesis of proceedings of interspeech’, Eurospeech – 9th European Conference on Speech Communication and Technology, September 2005, Lisbon, Portugal.

Allen, E. J., Burton, P. C., Olman, Ch. A., Oxenham, A. J. (2017). Representations of pitch and timbre variation in human auditory cortex. Journal of neuroscience, February, 37 (5), p.1284-1293.

Allen, E. J., Oxenham, A. J. (2014) Symmetric interactions and interference between pitch and timbre. Journal of the acoustical society of America, March, 135 (3), p.1371-1379.

ANSI (1999). American National Psychoacoustic terminology. S3.20. ed. New York, American National Standards Institute. 67 p.

Anvari, S. H., Trainor, L. J., Woodside, J., Levy, B. A. (2002). Relations among musical skills, phonological processing, and early reading ability in preschool children. Journal of experimental child psychology, October, 83 (2), p.111-130.

Aramaki, M., Besson, M., Kronland-Martinet, R., Ystad, S. (2008). Timbre perception of sounds from impacted materials: behavioral, electrophysiological and acoustic approaches. In: Ystad S., Kronland-Martinet R., Jensen K. (eds) Computer music modeling and retrieval. Genesis of meaning in sound and music. CMMR 2008. Lecture notes in computer science, vol 5493. ed. Springer, Berlin, Heidelberg. p.1-17.

Barwick, J., Valentine, E., West, R., Wilding, J. (1989). Relations between reading and musical abilities. British journal of educational psychology, June, 59 (pt.2), p.253-257.

Bates, E., Thal, D., Finlay, B. L., Clancy B. (2003). Early language development and its neural correlates. In S. J. Segalowitz, I. Rapin (ed.). Handbook of neuropsychology, vol.8, part 2, Child neuropsychology, chapter 5, ed. Amsterdam, Elsevier Science. p.525-592.

Belin, P, Zatorre, R. J., Lafaille, P., Ahad, P., Pike, B. (2000). Voice-selective areas in human auditory cortex. Nature 403 (6767), p.309-312.

Bella, S. D., Berkowska, M., Sowinski, J. (2011). Disorders of pitch production in tone deafness. Frontiers in psychology, July, vol. 2, art. 164, 11 p.

Besson, M., Chobert, J., Marie, C. (2011). Transfer of training between music and speech: common processing, attention, and memory. Frontiers in psychology, May, vol. 2, art. 94, 11 p.

Besson, M., Schön, D. (2001). Comparison between language and music. Annals of the New York Academy of Sciences, June, 930, p.232-258.

Best, K., Bogossian, F., New, K. (2018). Language exposure of preterm infants in the neonatal unit: a systematic review. Neonatology, 114 (3), p.261-276.

Bregman, A. S. (1994). Auditory scene analysis: the perceptual organization of sound. ed. Bradford book, MIT Press. 790 p.

Bronkhorst A. W. (2015). The cocktail-party problem revisited: early processing and selection of multi-talker speech. Attention, perception and psychophysics, July, 77 (5), p.1465-1487.

Caclin, A., Brattico, E., Tervaniemi, M., Naatanen, R., Morlet, D., Giard, M. H., McAdams, S. (2006). Separate neural processing of timbre dimensions in auditory sensory memory. Journal of cognitive neuroscience, December, 18 (12), p.1959-1972.

Caclin, A., McAdams, S., Smith, B. K., Winsberg, S. (2005). Acoustic correlates of timbre space dimensions: a confirmatory study using synthetic tones. Journal of the acoustical society of America, July, 118 (1), p.471-482.

Chan, A. S., Ho, Y. C., Cheung, M. C. (1998). Music training improves verbal memory. Nature, November, 396 (6707), p.128.

Chen, C. J., Miller, D. A. (2019). Pitch-synchronous analysis of human voice. Journal of voice, February 7, pii: S0892-1997(18)30448-X

Christianson, G. B., Sahani, M., Linden, J. F. (2008). The consequences of response nonlinearities for interpretation of spectrotemporal receptive fields. Journal of neuroscience, January, 28 (2), p.446-455.

Cooper, A., Wang, Y. (2010). The role of musical experience in Cantonese lexical tone perception by native speakers of Thai. Thesis of Proceedings of 5th International Conference on Speech Prosody. Chicago, IL. 4 p.

Darwin, C. J. (1997). Auditory grouping. Trends in cognitive sciences, December, 1 (9), p.327-333.

David, S. V., Mesgarani, N., Fritz, J. B., Shamma, S. A. (2009). Rapid synaptic depression explains nonlinear modulation of spectro-temporal tuning in primary auditory cortex by natural stimuli. Journal of neuroscience, March, 29 (11), p.3374-3386.

DeCasper, A. J., Fifer, W. P. (1980). Of human bonding: newborns prefer their mothers’ voices. Science, June, 208 (4448), p.1174-1176.

Dege, F., Schwarzer, G. (2011). The effect of a music program on phonological awareness in preschoolers. Frontiers in psychology, June, vol. 2, art. 124. 7 p.

Delogu, F., Lampis, G., Olivetti Belardinelli, M. (2006). Music-to-language transfer effect: may melodic ability improve learning of tonal languages by native nontonal speakers? Cognitive processing, 7 (3), p.203-207.

Delogu, F., Lampis, G., Olivetti Belardinelli, M. (2010). From melody to lexical tone: musical ability enhances specific aspects of foreign language perception. European journal of cognitive psychology, 22, p.46-61.

Doheny, L., Hurwitz, S., Insoft, R., Ringer, S., Lahav, A. (2012). Exposure to biological maternal sounds improves cardiorespiratory regulation in extremely preterm infants. Journal of maternal - fetal and neonatal medicine, September, 25 (9), p.1591-1594.

Donnadieu, S. (2007). Mental representation of the timbre of complex sounds. In J. W. Beauchamp (ed.). Analysis, synthesis, and perception of musical sounds. ed. New York, Springer. p.272-319.

Dymnikowa, M. (2018). Diagnostika muzykalnoj pamiati [Diagnosis of musical memory]. ed. Saint-Petersburg, Lema. 184 p.

Elhilali, M., Fritz, J. B., Klein, D. J., Simon, J. Z., Shamma, S. A. (2004). Dynamics of precise spike timing in primary auditory cortex. Journal of neuroscience, February, 24 (5), p.1159-1172.

Ettlinger, M., Margulis, E. H., Wong, P. C. (2011). Implicit memory in music and language. Frontiers in psychology, September, vol. 2, art. 211, 10 p.

Filipiak, E. (2015). Funkcje języka. [Functions of language] Zeszyty Naukowe Wyższej Szkoły Pedagogicznej w Bydgoszczy. Studia Pedagogiczne 27, Pedagogika Przedszkolna i Wczesnoszkolna 10, p.35-47.

Fletcher, N. H. (1934). Loudness, pitch and the timbre of musical tones and their relation to the intensity, the frequency and the overtone structure. Journal of the acoustical society of America, 6, p.59-69.

Fletcher, N. H., Rossing, T. D. (2010). The physics of musical instruments. ed. New York, Springer-Verlag. 756 p.

Gardiner, M. F., Fox, A., Knowles, F., Jeffrey, D. (1996). Learning improved by arts training. Nature, May, 381 (6580), p.284.

Gardner, H. (1999). Intelligence reframed: multiple intelligences for the 21st century. ed. New York, Basic Books. 292 p.

Greshuni, G. V. (1967). O znacenii wremennyh harakteristik w organizacji dejatelnosti sluhowoj sistemy. [On the significance of temporal characteristics in the organization of the auditory system]. In: M. N. Livanov, V. S. Rusinov (ed.). Sowremennyje problemy eketrofiziologii centralnoj nerwnoj sistemy. [Current problems of the electrophysiology of the central nervous system]. Ed. Moscow, Nauka. p.65-70.

Golubock, J. L., Janata, P. (2013). Keeping timbre in mind: working memory for complex sounds that can’t be verbalized. Journal of experimental psychology: human perception and performance, 39 (2), p.399-412.

Gottfried, T. L., Xu, Y. (2008). Effects of musical experience on Mandarin tone and vowel discrimination and imitation. Thesis of Proceedings at the Acoustics 08, Paris, France. p.6161-6165.

Hassler, M., Birbaumer, N., Feil, A. (1985). Musical talent and visual-spatial ability: a longitudinal study. Psychology of music, October, 13 (2), p.99-113.

Hetland, L. (2000). Learning to make music enhances spatial reasoning. Journal of aesthetic education, September, 34 (3/4), p.179-238.

Ho, Y. C., Cheung, M. C., Chan, A. S. (2003). Music training improves verbal but not visual memory: cross-sectional and longitudinal explorations in children. Neuropsychology, July, 17 (3), p.439-450.

Jakobson, R. (1960). Linguistics and poetics. In T. Sebeok (ed.), Style in Language, Cambridge, MA, M.I.T. Press, p.350-377.

Jakobson, R. (1995). On Language. ed. Cambridge, MA, Harvard University Press, 672 p.

Jancke, L. (2012). The relationship between music and language. Frontiers in psychology, April, vol. 3, art.123. 2 p.

Jiam, N. T., Deroche, M. L., Jiradejvong, P., Limb, C. J. (2019). A randomized controlled crossover study of the impact of online music training on pitch and timbre perception in cochlear implant users. Journal of the association for research in otolaryngology, June, 20 (3), p.247-262.

Jiang, C., Hamm, J. P., Lim, V. K., Kirk, I. J., Yang, Y. (2010). Processing melodic contour and speech intonation in congenital amusics with Mandarin Chinese. Neuropsychologia, July, 48 (9), p.2630-2639.

Kanwal, J., Ehret, G. (2006). Behaviour and neurodynamics for auditory communication. ed. Cambridge University Press. 384 p.

Kiklewicz, A. (1999). Lekcii po funcionalnoj lingvistike. [Lectures on functional linguistics] ed. Minsk, BGU, 214 p.

Koelsch, S. (2011). Toward a neural basis of music perception – a review and updated model. Frontiers in psychology, June, vol. 2, art. 110, 20 p.

Kong, Y. Y., Mullangi, A., Marozeau, J., Epstein, M. (2011). Temporal and spectral cues for musical timbre perception in electric hearing. Journal of speech, language and hearing research, June, 54 (3), p.981-994.

Konig, R., Heil, P., Budinger, E., Scheich, H. (eds.). The auditory cortex. A synthesis of human and non-human research. ed. Mahwah, NJ. Lawrence Erlbaum Associates. 512 p.

Korolewa, I. W., Orogodnikova, E. A. (2018). Ocenka progressa razwitia processow sluhorecewogo analiza u dętej s narusheniem sluha, ispolzujusih kohlearnye implanty i sluhowye aparaty. [Assessment of progress in the development of auditory-speech analysis processes in children with hearing impairment using cochlear implants and hearing aids]. Westnik psichofiziologii [Psychophysiology news], No.2, p.106-113.

Kraus, N., Skoe, E., Parbery-Clark, A., Ashley, R. (2009). Experience-induced malleability in neural encoding of pitch, timbre, and timing: implications for language and music. Annals of the New York academy of sciences, July, 1169, p.543-557.

Krumhansl, C. L., Iverson, P. (1992). Perceptual interactions between musical pitch and timbre. Journal of experimental psychology. Human perception and performance. August, 18 (3), p.739-751.

Kuhl, P. K., Stevens, E., Hayashi, A., Deguchi, T., Kiritani, S., Iverson, P. (2006). Infants show a facilitation effect for native language phonetic perception between 6 and 12 months. Developmental science, March, 9 (2), F13-F21.

Leaver, A. M., Rauschecker, J. P. (2010). Cortical representation of natural complex sounds: effects of acoustic features and auditory object category. Journal of neuroscience, June, 30 (22), p.7604-7612.

Lee, C. Y., Hung, T. H. (2008). Identification of Mandarin tones by English-speaking musicians and nonmusicians. Journal of the acoustical society of America, 124 (5), p.3235-3248.

Lee, C. Y., Lee, Y. F. (2010). Perception of musical pitch and lexical tones by Mandarin-speaking musicians. Journal of the acoustical society of America, 127 (1), p.481-490.

Levitin, D. J. (1994). Absolute memory for musical pitch: evidence from the production of learned melodies. Perception and psychophysics, 56 (4), p.414-423.

Levitin, D. J., Cook, P. R. (1996). Memory for musical tempo: additional evidence that auditory memory is absolute. Perception and psychophysics, August, 58 (6), p.927-935.

Lewis, J. W., Brefczynski, J. A., Phinney, R. E., Janik, J. J., DeYoe, E. A. (2005). Distinct cortical pathways for processing tool versus animal sounds. Journal of neuroscience, May, 25 (21), p.5148-5158.

Looi, V., Gfeller, K., Driscoll, V. (2012). Music appreciation and training for cochlear implant recipients: a review. Semin Hear. NIH-PA Author manuscript, November, 33 (4), p.307-334.

Lotto, A. J., Sullivan, S. C. (2008). Speech as a sound source. In W. A. Yost, A. N. Popper, R. R. Fay (ed.). Auditory perception of sound sources, ed. SHAR (Springer handbook of auditory research), vol. 29. Springer, Boston, MA, p.281-305.

Loui, P., Kroog, K., Zuk, J., Winner, E., Schlaug, G. (2011). Relating pitch awareness to phonemic awareness in children: implications for tone-deaf-ness and dyslexia. Frontiers in psychology, May, vol. 2, art. 111, 5 p.

Marozeau, J., de Cheveigne, A. (2007). The effect of fundamental frequency on the brightness dimension of timbre. Journal of the acoustical society of America, January, 121 (1), p.383-387.

McDermott, J. H., Keebler, M. V., Micheyl, Ch., Oxenham, A. J. (2010). Musical intervals and relative pitch: frequency resolution, not interval resolution, is special. Journal of the acoustical society of America, October, 128 (4), p.1943-1951.

McKeown, D., Wellsted, D. (2009). Auditory memory for timbre. Journal of experimental psychology: human perception and performance, 35 (3), p.855-875.

McMullen, E., Saffran, J. R. (2004). Music and language: a developmental comparison. Music perception: an interdisciplinary journal, spring, 21 (3), p.289-311.

Meyer, M., Baumann, S., Jancke, L., 2006. Electrical brain imaging reveals spatio-temporal dynamics of timbre perception in humans. Neuroimage, 32, 1510-1523.

Milovanov, R., Tervaniemi, M. (2011). The interplay between musical and linguistic aptitudes: a review. Frontiers in psychology, November, vol. 2, art. 321, 7 p.

Nan, Y., Sun, Y., Peretz, I. (2010). Congenital amusia in speakers of a tone language: association with lexical tone agnosia. Brain, 133 (9), p.2635-2642.

Overy, K. (2003). Dyslexia and music. From timing deficits to musical intervention. Annals of the New York academy of sciences, November, 999, p.497-505.

Pantev, C., Roberts, L. E., Schulz, M., Engelien, A., Ross, B. (2001). Timbre-specific enhancement of auditory cortical representations in musicians. Neuroreport, January, 12 (1), p.169-174.

Patel, A. D. (2003). Language, music, syntax and the brain. Nature neuroscience, 6 (7), p.674-681.

Patel, A. D. (2010). Music, language and the brain. ed. New York, Oxford University Press. 526 p.

Patel, A. D. (2011). Why would musical training benefit the neural encoding of speech? The OPERA hypothesis. Frontiers in psychology, June, vol.2, art.142, 14 p.

Patel, A. D. (2012). Language, music, and the brain: a resource sharing framework. In: P. Rebuschat, M. Rohrmeier, J. Hawkins, I. Cross (ed.). Language and music as cognitive systems. ed. New York, Oxford University Press. p.204-223.

Patel, A. D. (2012). The OPERA hypothesis: assumptions and clarifications. Annals of the New York Academy of Sciences, 1252, p.124-128.

Patil, K., Pressnitzer, D., Shamma, Sh., Elhilali, M. (2012). Music in our ears: the biological bases of musical timbre perception. PLoS Computational Biology, November, 8 (11), art. e1002759, 16 p.

Perani, D., Saccuman, M. C., Scifo, P., Spada, D., Andreolli, G., Rovelli, R., Baldoli, C., Koelsch, S. (2010). Functional specializations for music processing in the human newborn brain. Proceedings of the national academy of sciences of the United States of America, March, 107 (10), p.4758-4763.

Peretz, I. (2012). Music, language and modularity in action. In: P. Rebuschat, M. Rohrmeier, J. Hawkins, I. Cross (ed.). Language and music as cognitive systems. ed. New York, Oxford University Press. p.254-268.

Peretz, I., Coltheart, M. (2003). Modularity of musical processing. Nature neuroscience, 6, p.688-691.

45. Peretz, I., Zatorre, R. J. (2003). The cognitive neuroscience of music. ed. New York, Oxford University Press. 452 p.

Peretz, I., Zatorre, R, J. (2005). Brain organization for music processing. Annual review of psychology, 56, p.89-114.

Rand, K., Lahav, A. (2014). Maternal sounds elicit lower heart rate in preterm newborns in the first month of life.

Ro, M. H., Behne, D., Wang, Y. (2006). The effects of musical experience on linguistic pitch perception: a comparison of Norwegian professional singers and instrumentalists. Journal of the acoustical society of America, 120, abstract 3168.

Sadagopan, S., Wang, X. (2009). Nonlinear spectrotemporal interactions underlying selectivity for complex sounds in auditory cortex. Journal of neuroscience, September, 29 (36), p.11192-11202.

Schellenberg, E. G. (2004). Music lessons enhance IQ. Psych science, 15, August, 15 (8), p.511-514.

Schon, D., and Francois, C. (2011). Musical expertise and statistical learning of musical and linguistic structures. Frontiers in psychology, July, vol. 2, art. 167, 9 p.

Schreiner, C. E., Mendelson, J., Raggio, M. W., Brosch, M., Krueger, K. (1997). Temporal processing in cat primary auditory cortex. Acta oto-laryngologica. Supplementum, 532, p.54-60.

Schreiner, C. E., Sutter, M. L. (1992). Topography of excitatory bandwidth in cat primary auditory cortex: single-neuron versus multiple-neuron recordings. Journal of neurophysiology, November, 68 (5), p.1487-1502.

Schulze, K., Tillmann, B. (2013). Working memory for pitch, timbre, and words. Memory, April, 21 (3), p.377-395.

Shahin, A. J. (2011). Neurophysiological influence of musical training on speech perception. Frontiers in psycholology, June, vol. 2, art. 126, 10 p.

Shahin, A., Roberts, L. E., Pantev, C., Trainor, L. J., Ross, B. (2005). Modulation of P2 auditory-evoked responses by the spectral complexity of musical sounds. Neuroreport, November, 16 (16), p.1781-1785.

Shahin, A., Roberts, L. E., Trainor, L. J. (2004). Enhancement of auditory cortical development by musical experience in children. Neuroreport, August, 15 (12), p.1917-1921.

Standley, J. M., Huges, J. E. (1997). Evaluation of an early intervention music curriculum for enhancing prereading / writing skills. Music therapy perspectives, November, 15 (2), p.79-88.

Stevens, C. J. (2012). Music perception and cognition: a review of recent cross-cultural research. Topics in cognitive science, October, 4 (4), p.653-667.

Takeuchi, A. H., Hulse, S. H. (1993). Absolute pitch. Psychological bulletin, March, 113 (2), p.345-361.

Tallal, P., Gaab, N. (2006). Dynamic auditory processing, musical experience and language development. Trends in neuroscience, July, 29 (7), p.382-390.

Theunissen, F. E., Sen, K., Doupe, A. J. (2000). Spectral-temporal receptive fields of nonlinear auditory neurons obtained using natural sounds. Journal of neuroscience, March, 20 (6), p.2315-2331.

Tillmann, B., Burnham, D., Nguyen, S., Grimault, N., Gosselin, N., Peretz, I. (2011). Congenital amusia (or tone-deaf-ness) interferes with pitch processing in tone languages. Frontiers in psychology, June, vol.2, art.120. 15 p.

Tillmann, B., Rusconi, E., Traube, C., Butterworth, B., Umilta, C., Peretz, I. (2011). Fine-grained pitch processing of music and speech in congenital amusia. Journal of the acoustical society of America, 130 (6), p.4089-4096.

Town, S. M., Bizley, J. K. (2013). Neural and behavioral investigations into timbre perception. Frontiers in systems neuroscience, November, vol.7, art. 88. 14 p.

Trainor, L. J., Shahin, A., Roberts, L. E. (2003). Effects of musical training on the auditory cortex in children. Annals of the New York academy of sciences, November, 999, p.506-513.

Tuznik, P., Augustynowicz, P., Francuz, P. (2018). Electrophysiological correlates of timbre imagery and perception. International journal of psychophysiology, July, vol.129, p.9-17.

Uppenkamp, S., Johnsrude, I. S., Norris, D., Marslen-Wilson, W., Patterson, R. D. (2006). Locating the initial stages of speech-sound processing in human temporal cortex. Neuroimage, July, 31 (3), p.1284-1296.

Vanzella, P., Schellenberg, E. G. (2010). Absolute pitch: effects of timbre on note-naming ability. PloSOne, November, 5 (11), e15449. 7 p

Versnel, H., Kowalski, N., Shamma, Sh. A. (1995). Ripple analysis in the ferret primary auditorycortex. III. Topographic and columnar distribution of ripple response parameters. Auditory neuroscience, 1, p.271-286.

Warren, J. D., Jennings, A. R., Griffiths, T. D. (2005). Analysis of the spectral envelope of sounds by the human brain. Neuroimage, February, 24 (4), p.1052-1057.

Warrier, C. M., Zatorre, R. J. (2002). Influence of tonal context and timbral variation on perception of pitch. Perception and psychophysics, February, 64 (2), p.198-207.

Wolfe, J. (2002). Speech and music, acoustics and coding, and what music might be ‘for’. In: C. Stevens, D. Burnham, G. McPherson, E. Schubert, J. Reniwick (eds.). Proceedings of the 7th International Conference on Music Perception and Cognition, Sydney. ed. Adelaide, Causal Productions. p.10-13.

Wong, P. C. M., Skoe, E., Russo, N. M., Dees, T., Kraus, N. (2007). Musical experience shapes human brainstem encoding of linguistic pitch patterns. Nature neuroscience, 10 (4), p.420-422.

Zatorre, R. J., Belin, P. (2001). Spectral and temporal processing in human auditory cortex. Cerebral cortex, October, 11 (10), p.946-953.

Zhao, T. Ch., Kuhl, P. K. (2016). Effects of enriched auditory experience on infants’ speech perception during the first year of life. Prospects, June, 46 (2), p.235-247.



  • There are currently no refbacks.

Copyright (c) 2019 Asian Journal of Humanities and Social Studies

Creative Commons License
This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License.