Comparative Analysis of Prosodic Characteristics of Spontaneous and Synthesized Speech (Based on Kazakh and English Ted Talks Video Materials)

G. Kussepova; R. Kondybaeva; К. Chingissova

doi:10.26577/EJPh202520047

Authors

G. Kussepova L.N. Gumilyov Eurasian National University, Kazakhstan, Astana https://orcid.org/0000-0001-9556-8763
R. Kondybaeva Al-Farabi Kazakh National University, Almaty, Kazakhstan https://orcid.org/0000-0002-1208-8949
К. Chingissova Al-Farabi Kazakh National University, Almaty, Kazakhstan https://orcid.org/0009-0008-9490-1622

DOI:

https://doi.org/10.26577/EJPh202520047

276 67

Abstract

The study aims to conduct an instrumental-comparative analysis of the prosodic characteristics of spontaneous (based on TED Talks materials) and synthesized speech in Kazakh and English. The paper examines existing prosody research approaches and an acoustic analysis of key prosodic parameters (pitch frequency, intensity, and tempo) for spontaneous and synthesized speech types. For the comparative analysis, a corpus was developed, containing 10 speech excerpts drawn from TED Talks each in Kazakh and English, which were then transcribed and converted into audio files using modern speech synthesis systems. The acoustic analysis was conducted using PRAAT software and own proprietary software, ProAG-2025 (protected document No. 58731, dated May 27, 2025). This article formulates a hypothesis that spontaneous speech is characterized by greater variability in prosodic features, while synthesized speech differs from natural speech in acoustic and prosodic features. The instrumental analysis results confirm that synthesized speech, despite its structural conformity, retains a set of parameters that allow it to be reliably differentiated from spontaneous speech in increased amplitude uniformity and frequency contours, the absence of stochastic variations, and a simplified rhythmic-pause pattern. The obtained data are of practical significance for the further improvement of speech synthesis algorithms, increasing the degree of naturalness, and optimizing the communicative effectiveness of media applications.

Keywords: spontaneous speech, synthesized speech, prosody, acoustic parameters, tonality, pitch frequency.

Author Biographies

G. Kussepova, L.N. Gumilyov Eurasian National University, Kazakhstan, Astana

Kussepova Gulzat Tungushbayevna – PhD, L.N. Gumilyov Eurasian National University (Kazakhstan, Astana, e-mail: kussepova_gt_2@enu.kz);

R. Kondybaeva, Al-Farabi Kazakh National University, Almaty, Kazakhstan

Kondybaeva Raushan Zhumakerimovna – PhD, Al-Farabi Kazakh National University (Kazakhstan, Almaty, e-mail: kondybaeva.raushan85@gmail.com);

К. Chingissova, Al-Farabi Kazakh National University, Almaty, Kazakhstan

Chingissova Kuralay Adilzhanovna – PhD student, Al-Farabi Kazakh National University (Kazakhstan, Almaty, e-mail: kuralay.cha@mail.ru).

Comparative Analysis of Prosodic Characteristics of Spontaneous and Synthesized Speech (Based on Kazakh and English Ted Talks Video Materials)

Authors

DOI:

Abstract

Author Biographies

G. Kussepova, L.N. Gumilyov Eurasian National University, Kazakhstan, Astana

R. Kondybaeva, Al-Farabi Kazakh National University, Almaty, Kazakhstan

К. Chingissova, Al-Farabi Kazakh National University, Almaty, Kazakhstan

Downloads

How to Cite

Issue

Section

Language

Information

Site links

Current Issue

Developed By