Please wait, while we are loading the content...
Please wait, while we are loading the content...
| Content Provider | IEEE Xplore Digital Library |
|---|---|
| Author | Hermansky, H. |
| Copyright Year | 1999 |
| Description | Author affiliation: Graduate Inst. of Sci. & Technol., Portland, OR, USA (Hermansky, H.) |
| Abstract | Summary form only given, as follows. Firstly, the basic principles of automatic recognition of speech are reviewed. The acoustic analysis module is focused on in greater detail and distinctions between its two main blocks, the pattern classification and the feature extraction, are discussed. The early history of speech feature extraction mentions early attempts of Newton and Helmholtz to characterize information bearing components of vowels, and Scripture's analysis of phonographic voice recordings. The concept of short-term analysis and spectrograms is introduced together with the linear model of speech production. Reasons for spectral envelope estimation in ASR as well as basic techniques for its estimation such as homomorphic analysis and linear predictive analysis are introduced. Cepstrum as an approximation to Karhunen-Loeve transformation and cepstral lifters as means for modifying properties of simple Euclidean cepstral distances are also introduced. Inconsistencies of simple envelope estimation techniques with human speech perception are mentioned. Reasons for auditory-like feature extraction and some currently dominant auditory-like techniques such as Mel cepstral analysis and perceptual linear prediction (PLP) are described. The concept and basic properties of a modulation spectrum of speech is explained and its historical use in predicting intelligibility of speech in auditoria is mentioned. Dynamic features (delta, double-delta) are discussed, with a special focus on their interpretation as FIR filters applied to modulation spectrum of speech. RASTA filtering is introduced as an extension of FIR filtering done in dynamic feature estimation and reasons for its robustness to changes in communication environments explained. Interesting consistencies of RASTA processing with temporal properties of human hearing such as forward masking is also mentioned. The need for data-driven feature extraction is discussed and techniques for design of discriminant spectral basis and of discriminant RASTA filters are described with recent results of their applications in automatic recognition of speech and in speaker recognition. The concept of multi-band recognition of speech is introduced and its inherent robustness in presence of colored noise is discussed. The concept is further generalized into more general sub-stream based recognition and some techniques for merging of information sub-streams are described. Finally, recently introduced speech recognition from temporal patterns of spectral energies is described, and its inherent advantages in recognition of speech in adverse environments discussed. |
| File Size | 80558 |
| File Format | |
| ISBN | 1864354518 |
| DOI | 10.1109/ISSPA.1999.818095 |
| Language | English |
| Publisher | Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| Publisher Date | 1999-08-22 |
| Publisher Place | Australia |
| Access Restriction | Subscribed |
| Rights Holder | Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| Subject Keyword | History Cepstral analysis Finite impulse response filter Automatic speech recognition Feature extraction Speech analysis Speech recognition Humans Filtering Pattern analysis |
| Content Type | Text |
| Resource Type | Article |
National Digital Library of India (NDLI) is a virtual repository of learning resources which is not just a repository with search/browse facilities but provides a host of services for the learner community. It is sponsored and mentored by Ministry of Education, Government of India, through its National Mission on Education through Information and Communication Technology (NMEICT). Filtered and federated searching is employed to facilitate focused searching so that learners can find the right resource with least effort and in minimum time. NDLI provides user group-specific services such as Examination Preparatory for School and College students and job aspirants. Services for Researchers and general learners are also provided. NDLI is designed to hold content of any language and provides interface support for 10 most widely used Indian languages. It is built to provide support for all academic levels including researchers and life-long learners, all disciplines, all popular forms of access devices and differently-abled learners. It is designed to enable people to learn and prepare from best practices from all over the world and to facilitate researchers to perform inter-linked exploration from multiple sources. It is developed, operated and maintained from Indian Institute of Technology Kharagpur.
Learn more about this project from here.
NDLI is a conglomeration of freely available or institutionally contributed or donated or publisher managed contents. Almost all these contents are hosted and accessed from respective sources. The responsibility for authenticity, relevance, completeness, accuracy, reliability and suitability of these contents rests with the respective organization and NDLI has no responsibility or liability for these. Every effort is made to keep the NDLI portal up and running smoothly unless there are some unavoidable technical issues.
Ministry of Education, through its National Mission on Education through Information and Communication Technology (NMEICT), has sponsored and funded the National Digital Library of India (NDLI) project.
| Sl. | Authority | Responsibilities | Communication Details |
|---|---|---|---|
| 1 | Ministry of Education (GoI), Department of Higher Education |
Sanctioning Authority | https://www.education.gov.in/ict-initiatives |
| 2 | Indian Institute of Technology Kharagpur | Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project | https://www.iitkgp.ac.in |
| 3 | National Digital Library of India Office, Indian Institute of Technology Kharagpur | The administrative and infrastructural headquarters of the project | Dr. B. Sutradhar bsutra@ndl.gov.in |
| 4 | Project PI / Joint PI | Principal Investigator and Joint Principal Investigators of the project |
Dr. B. Sutradhar bsutra@ndl.gov.in Prof. Saswat Chakrabarti will be added soon |
| 5 | Website/Portal (Helpdesk) | Queries regarding NDLI and its services | support@ndl.gov.in |
| 6 | Contents and Copyright Issues | Queries related to content curation and copyright issues | content@ndl.gov.in |
| 7 | National Digital Library of India Club (NDLI Club) | Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach | clubsupport@ndl.gov.in |
| 8 | Digital Preservation Centre (DPC) | Assistance with digitizing and archiving copyright-free printed books | dpc@ndl.gov.in |
| 9 | IDR Setup or Support | Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops | idr@ndl.gov.in |
|
Loading...
|