Publications

Organized by Research Areas

Simulation of Speech Cognition

Learnability of English diphthongs: One dynamic target vs. two static targets

Xu, A.*, Niekerk, D. R. v., Gerazov, B., Krug, P. K., Prom-on, S., Birkholz, P., & Xu, Y. Speech Communication 170, 103225 2025
DOI: 10.1016/j.specom.2025.103225

Artificial vocal learning guided by speech recognition: What it may tell us about how children learn to speak

Xu, A., Niekerk, D. R. v., Gerazov, B., Krug, P. K., Birkholz, P., Prom-on, S., Halliday, L., & Xu, Y.* Journal of Phonetics 105, 101338 2024
DOI: 10.1016/j.wocn.2024.101338

In Pursuit for the Best Error Metric for Optimisation of Articulatory Vowel Synthesis

Gerazov, B., Krug, P.K., Niekerk, D. R. v., Xu, A., Birkholz, P., Xu, Y. 26th International Conference on Speech and Computer (SPECOM-2024), Belgrade, Serbia 2024
DOI: 10.1007/978-3-031-78014-1_17

Artificial Vocal Learning guided by Phoneme Recognition and Visual Information

Krug, P. K.*, Birkholz, P., Gerazov, B., Niekerk, D.v., Xu, A., Xu, Y. IEEE Transactions on Audio, Speech and Language Processing 31, 1734-1744 2023
DOI: 10.1109/TASLP.2023.3264454

Simulating vocal learning of spoken language: Beyond imitation

Niekerk, D. R. v.*, Xu, A., Gerazov, B., Krug, P. K., Birkholz, P., Prom-on, S., Halliday, L., Prom-on, S., & Xu, Y. Speech Communication 147, 51-62 2023
DOI: 10.1016/j.specom.2023.01.003

Computational models for articulatory learning of English diphthongs: One dynamic target vs. two static targets

Xu, A., Gerazov. B., van Niekerk D., Krug P., Prom-on S., Birkholz P., & Xu, Y. International Congress of Phonetic Sciences ICPhS 2023 2023
PDF

Self-Supervised Solution to the Control Problem of Articulatory Synthesis

Krug, P.K., Birkholz, P., Gerazov, B., van Niekerk, D.R., Xu, A., Xu, Y. Proc. INTERSPEECH 2023, 4329-4333 2023
DOI: http://dx.doi.org/10.21437/Interspeech.2023-2173

Evoc-Learn — High quality simulation of early vocal learning

Xu, Y., Xu, A., van Niekerk., D. R., Gerazov, B., Birkholz, P., Krug, P. K., Prom-on, S., Halliday, L. F. Proc. Interspeech 2022, 3665-3666 2022
PDF

Exploration strategies for articulatory synthesis of complex syllable onsets

Niekerk, D. R. v., Xu, A., Gerazov, B., Krug, P. K., Birkholz, P. & Xu, Y. Proc. Interspeech 2022, 635-639 2022
DOI: 10.21437/Interspeech.2022-10689

Articulatory synthesis for data augmentation in phoneme recognition

Krug, P. K., Birkholz, P., Gerazov, B., van Niekerk, D. R., Xu, A., Xu, Y. Proc. Interspeech 2022, 1228-1232 2022
DOI: 10.21437/Interspeech.2022-10874

Efficient exploration of articulatory dimensions

Krug, P. K., Birkholz, P., Gerazov, B., van Niekerk, D. R., Xu, A., Xu, Y. Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2022 (TUDPress, Dresden), pp.51-58 2022

Modelling microprosodic effects can lead to an audible improvement in articulatory synthesis

Krug, P. K.*, Gerazov, B., Niekerk, D. R. v., Xu, A., Xu, Y., & Birkholz, P. The Journal of the Acoustical Society of America 150(2), 1209-1217 2021
DOI: 10.1121/10.0005876

Model-Based Exploration of Linking Between Vowel Articulatory Space and Acoustic Space

Xu, A., Niekerk, D.v., Gerazov, B., Krug, P.K., Prom-on, S., Birkholz, P., Xu, Y. Proc. Interspeech 2021, 3191-3195 2021
DOI: 10.21437/Interspeech.2021-1422

Finding intelligible consonant-vowel sounds using high-quality articulatory synthesis

Niekerk, D. R. v., Xu, A., Gerazov, B., Krug, P. K., Birkholz, P. & Xu, Y. Proc. Interspeech 2020, 4457-4461 2020
DOI: 10.21437/Interspeech.2020-2545

Coarticulation as synchronized dimension-specific sequential target approximation: An articulatory synthesis simulation

Xu, A., Birkholz, P. & Xu, Y. International Congress of Phonetic Sciences ICPhS 2019 2019
PDF

Neural Mechanisms of Speech

Regulation of sensorimotor serial learning in speech production by motor compensation rather than sensory error

Lu, Y., Tang, X., Xiao, Z., Xu, A., Chen, J., Tian X. eLife 14:RP108357 2025
DOI: 10.7554/eLife.108357

Speech Prosody

When focus shapes the flow: prosodic restructuring in Mandarin complex nominals

Xu, A. & Hsu, Y. Proc. Interspeech 2025 2025
DOI: 10.21437/Interspeech.2025-1121

When Focus Overrides Form: Prosodic Rephrasing in Mandarin complex nominals

Hsu, Y., & Xu, A. The 31st Architectures and Mechanisms for Language Processing conference (AMLaP) 2025 2025

Morphosyntax and prosody interaction: A perceptual study of Chinese classifiers

Cao, S., Hsu, Y., & Xu, A. The 12th International Conference of the European Association of Chinese Linguistics (EACL- 12), Roma Tre University 2024

Analysis and computational modeling of Emirati Arabic intonation - A preliminary study

Alzaidi, M.*, Szreder, M., Xu, A. & Xu, Y. Journal of Phonetics 98, 101236 2023
DOI: 10.1016/j.wocn.2023.101236

Post-focus compression in Brahvi and Balochi

Syed, N. A.*, Shah, A. W., Xu, A., & Xu, Y. Phonetica 79(2), 189-218 2022
DOI: 10.1515/phon-2022-2020

Perceiving focus: A study on Cantonese and Mandarin speakers' processing Mandarin prosody

Hsu, Y., Xu, A., & Chen, Y. 34th North American Conference on Chinese Linguistics (NACCL-34), Bloomington, Indiana 2022

Consonantal F0 perturbation in American English involves multiple mechanisms

Xu, Y. & Xu, A.* The Journal of the Acoustical Society of America, 149(4), 2877-2895 2021
DOI: 10.1121/10.0004239

Wh-indeterminates and Prosody in Hong Kong Cantonese

Hsu, Y., & Xu, A. Proc. 10th International Conference on Speech Prosody 2020, 376-380 2020
DOI: 10.21437/SpeechProsody.2020-77

Prosodic encoding of focus in Hijazi Arabic

Alzaidi, M.*, Xu, Y. & Xu, A. Speech Communication, 106, 127-149 2019
DOI: 10.1016/j.specom.2018.12.006

Sentence Prosody and Wh-indeterminates in Taiwan Mandarin

Hsu, Y., & Xu, A. Proc. Interspeech 2019, 3950-3954 2019
DOI: 10.21437/Interspeech.2019-2545

Focus Acoustics and Prosodic Organization in Hong Kong Cantonese and Taiwan Mandarin

Hsu, Y., & Xu, A. International Congress of Phonetic Sciences ICPhS 2019 2019
PDF

Sentence Final Particles and Wh-indeterminates in Beijing Mandarin

Hsu, Y., Chan, K-W., Lo, T-S., Wang, X. & Xu, A. Hanyang International Symposium on Phonetics and Cognitive Sciences of Language (HISPhonCog 2019) 2019

Focus Prosody in Cantonese and Teochew Noun Phrases

Hsu, Y., Xu, A. & Hang, N. Proc. 9th International Conference on Speech Prosody 2018, 961-965 2018
DOI: 10.21437/SpeechProsody.2018-194

Focus Acoustics in Mandarin Nominals

Hsu, Y., & Xu, A. Proc. Interspeech 2017, 3231-3235 2017
DOI: 10.21437/Interspeech.2017-1167

Vocal Emotion & Attractiveness

Perception of vocal attractiveness by Mandarin native listeners

Xu, A., & Lee, A. Proc. 9th International Conference on Speech Prosody 2018, 344-348 2018
DOI: 10.21437/SpeechProsody.2018-70

Universal vs. language-specific aspects in human vocal attractiveness: An investigation towards Japanese native listeners' perceptual pattern

Xu, A., Leung, S. & Lee, A. Proceedings of Meetings on Acoustics, 29, 060001 2016
DOI: 10.1121/2.0000392

Universal vs. language-specific aspects in human vocal attractiveness: An investigation towards Japanese native listeners' perceptual pattern

Xu, A., & Leung, S. Journal of Acoustical Society of America, 140, 3401 2016
DOI: 10.1121/1.4970911

Perceived vocal attractiveness by gay listeners in Hong Kong

Leung, S., & Xu, A. Journal of Acoustical Society of America, 140, 3401 2016
DOI: 10.1121/1.4970916

View all publications on Google Scholar

← Back to Home