Khonglah et al., 2016
| Publication | Publication Date | Title |
|---|---|---|
| EP2793223B1 (en) | Ranking representative segments in media data | |
| US6570991B1 (en) | Multi-feature speech/music discrimination system | |
| Tian et al. | Spoofing detection from a feature representation perspective | |
| Khonglah et al. | Speech/music classification using speech-specific features | |
| Esmaili et al. | Content based audio classification and retrieval using joint time-frequency analysis | |
| Alexandre-Cortizo et al. | Application of fisher linear discriminant analysis to speech/music classification | |
| Lampropoulos et al. | Evaluation of MPEG-7 descriptors for speech emotional recognition | |
| Bach et al. | Robust speech detection in real acoustic backgrounds with perceptually motivated features | |
| Thambi et al. | Random forest algorithm for improving the performance of speech/non-speech detection | |
| Ahrendt et al. | Decision time horizon for music genre classification using short time features | |
| Nilufar et al. | Spectrogram based features selection using multiple kernel learning for speech/music discrimination | |
| Khonglah et al. | Low frequency region of vocal tract information for speech/music classification | |
| Izumitani et al. | A background music detection method based on robust feature extraction | |
| Mohammed et al. | Overlapped music segmentation using a new effective feature and random forests | |
| Dziubinski et al. | Estimation of musical sound separation algorithm effectiveness employing neural networks | |
| Martin et al. | Cepstral modulation ratio regression (CMRARE) parameters for audio signal analysis and classification | |
| Alexandre et al. | Application of Fisher linear discriminant analysis to speech/music classification | |
| Li et al. | Yolopitch: A time-frequency dual-branch yolo model for pitch estimation | |
| Kumar et al. | Hilbert spectrum based features for speech/music classification | |
| Rahman et al. | Automatic gender identification system for Bengali speech | |
| JPH01255000A (en) | Apparatus and method for selectively adding noise to template to be used in voice recognition system | |
| Ramírez et al. | Stem audio mixing as a content-based transformation of audio features | |
| Patsis et al. | A speech/music/silence/garbage/classifier for searching and indexing broadcast news material | |
| Kos et al. | Online speech/music segmentation based on the variance mean of filter bank energy | |
| von Zeddelmann | A feature-based approach to noise robust speech detection |