Speech Quality Assessment Using Audio Features

Title	Speech Quality Assessment Using Audio Features
Publication Type	Journal Article
Year of Publication	2020
Authors	Devi, M. N. Renuka, and S. Gowri
Journal	Journal of University of Shanghai for Science and Technology
Volume	22
Pagination	2116-2125
Date Published	2020
ISBN Number	1007-6735 (ISSN)
Keywords	Computer Science and Engineering, Scopus
Abstract	This paper discusses the design of features that aid in the classification of the quality of speech of a speaker.The data used in this study pertains to TED Talks. Since most TED speakers are high achievers and expert orators,we have a rich source of audio cues that define speech that is appealing to a large audience. The features used to categorize the speech quality can be the basis of analyzing the speech quality of novice speakers. Such a system can be used to draw a novice speaker‟s attention to specific areas of improvement, such an increase in amplitude or maintaining vocal consistency 22 and facilitate directed effort towards improving the quality of one‟s speech.We use a speakerclassification technique designed and developed in house including Short Term Energy (STE), Zero Crossing Rate (ZCR), Mean power, Pitch, Magnitude and standard deviation. Finally, we use an unsupervised classifying method called "Hierarchical clustering technique" to group speakers into 6 categories.
URL	https://jusst.org/wp-content/uploads/2020/10/Speech-Quality-Assessment-Using-Audio-Features-2.pdf

You are here