Loading...
Please wait, while we are loading the content...
Similar Documents
Compact Bilinear Deep Features For Environmental Sound Recognition
| Content Provider | Semantic Scholar |
|---|---|
| Author | Demir, Fatih Sengur, Abdulkadir Amiriparian, Shahin Cummins, Nicholas Schuller, Björn W. |
| Copyright Year | 2018 |
| Abstract | Environmental sound recognition (ESR) has extensive various civilian and military applications. Existing ESR methods generally tackle this problem by employing various signal processing and machine learning methods. Herein, an ESR paradigm based on feature extraction from pre-trained deep convolutional neural networks (CNN), the derivation of higher-order statistics by compact bilinear pooling and normalisation. In particular, we consider two deep ImageNet architectures for deep feature extraction, and the Random Maclaurin (RM) to produce the compact bilinear features. A support vector machine (SVM) with homogeneous mapping is used in the classification stage. Two publicly available environmental sound datasets are used to verify the efficacy of the approach namely, ESC-50 and ESC-10. We compare the proposed method with various previous state-of-the-art methods. Presented results indicate the suitability of the higher-order statistics of Deep Spectrum representations for ESR classification tasks. |
| Starting Page | 1 |
| Ending Page | 5 |
| Page Count | 5 |
| File Format | PDF HTM / HTML |
| Alternate Webpage(s) | https://www.informatik.uni-augsburg.de/lehrstuehle/eihw/pdfs/Sengur18-CBD.pdf |
| Alternate Webpage(s) | https://doi.org/10.1109/idap.2018.8620779 |
| Journal | 2018 International Conference on Artificial Intelligence and Data Processing (IDAP) |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |