Show simple item record

dc.contributor.advisorKëpuska, Veton Z.
dc.contributor.authorBurgos, Wilson
dc.date.accessioned2015-01-09T18:33:59Z
dc.date.available2015-01-09T18:33:59Z
dc.date.issued2014-11
dc.identifier.urihttp://hdl.handle.net/11141/458
dc.descriptionThesis (M.S.) - Florida Institute of Technology, 2014.en_US
dc.description.abstractThe feature analysis component of an Automated Speaker Recognition (ASR) system plays a crucial role in the overall performance of the system. There are many feature extraction techniques available, but ultimately we want to maximize the performance of these systems. From this point of view, the algorithms developed to compute feature components are analyzed. Current state-of-the-art ASR systems perform quite well in a controlled environment where the speech signal is noise free. The objective of this thesis investigates the results that can be obtained when you combine Mel-Frequency Cepstral Coefficients (MFCC) and Gammatone Frequency Cepstral Coefficients (GFCC) as feature components for the front-end processing of an ASR. The MFCC and GFCC feature components combined are suggested to improve the reliability of a speaker recognition system. The MFCC are typically the “de facto” standard for speaker recognition systems because of their high accuracy and low complexity; however they are not very robust at the presence of additive noise. The GFCC features in recent studies have shown very good robustness against noise and acoustic change. The main idea is to integrate MFCC & GFCC features to improve the overall ASR system performance in low signal to noise ratio (SNR) conditions. The experiment are conducted on the Texas Instruments and Massachusetts Institute of Technology (TIMIT) and the English Language Speech Database for Speaker Recognition (ELSDR) databases, were the test utterances are mixed with noises at various SNR levels to simulate the channel change. The results provide an empirical comparison of the MFCC-GFCC combined features and the individual counterparts.en_US
dc.language.isoen_USen_US
dc.rightsCopyright held by authoren_US
dc.titleGAMMATONE AND MFCC FEATURES IN SPEAKER RECOGNITIONen_US
dc.typeThesisen_US
thesis.degree.nameMaster of Science in Computer Engineeringen_US
thesis.degree.levelMastersen_US
thesis.degree.disciplineComputer Engineeringen_US
thesis.degree.departmentElectrical and Computer Engineeringen_US
thesis.degree.grantorFlorida Institute of Technologyen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record