Loading...
Please wait, while we are loading the content...
Similar Documents
Construction And Evaluation Of A Robust Multifeature Speech/music Discriminator (1997)
| Content Provider | CiteSeerX |
|---|---|
| Author | Scheirer, Eric Slaney, Malcolm |
| Description | We report on the construction of a real-time computer system capable of distinguishing speech signals from music signals over a wide range of digital audio input. We have examined 13 features intended to measure conceptually distinct properties of speech and/or music signals, and combined them in several multidimensional classification frameworks. We provide extensive data on systemperformanceand the cross-validated training/test setup used to evaluate the system. For the datasets currently in use, the best classifier classifies with 5.8% error on a frame-by-frame basis, and 1.4% error when integrating long (2.4 second) segments of sound. 1. OVERVIEW The problem of distinguishing speech signals from music signals has become increasingly important as automatic speech recognition (ASR) systems are applied to more and more "real-world" multimedia domains. If we wish to build systems that perform ASR on soundtrack data, for example, it is important to be able to distinguish which segments... |
| File Format | |
| Language | English |
| Publisher Date | 1997-01-01 |
| Access Restriction | Open |
| Subject Keyword | Automatic Speech Recognition Real-time Computer System Digital Audio Input Wide Range Robust Multifeature Speech Music Discriminator Several Multidimensional Classification Framework Cross-validated Training Test Setup Extensive Data Real-world Multimedia Domain Music Signal Classifier Classifies Speech Signal Frame-by-frame Basis Distinct Property Soundtrack Data |
| Content Type | Text |
| Resource Type | Article |