You are here

Speech Quality Assessment Using Audio Features

TitleSpeech Quality Assessment Using Audio Features
Publication TypeJournal Article
Year of Publication2020
AuthorsDevi, M. N. Renuka, and S. Gowri
JournalJournal of University of Shanghai for Science and Technology
Volume22
Pagination2116-2125
Date Published2020
ISBN Number1007-6735 (ISSN)
KeywordsComputer Science and Engineering, Scopus
Abstract

This paper discusses the design of features that aid in the classification of the quality of speech of a speaker.The data used in this study pertains to TED Talks. Since most TED speakers are high achievers and expert orators,we have a rich source of audio cues that define speech that is appealing to a large audience. The features used to categorize the speech quality can be the basis of analyzing the speech quality of novice speakers. Such a system can be used to draw a novice speaker‟s attention to specific areas of improvement, such an increase in amplitude or maintaining vocal consistency 22 and facilitate directed effort towards improving the quality of one‟s speech.We use a speakerclassification technique designed and developed in house including Short Term Energy (STE), Zero Crossing Rate (ZCR), Mean power, Pitch, Magnitude and standard deviation. Finally, we use an unsupervised classifying method called "Hierarchical clustering technique" to group speakers into 6 categories.

URLhttps://jusst.org/wp-content/uploads/2020/10/Speech-Quality-Assessment-Using-Audio-Features-2.pdf