N. Adams, D. Marquez, and G. Wakefield, Iterative deepening for melody alignment and retrieval, 2005.

J. Ajmera, I. A. Mccowan, and H. Bourlard, Robust hmm-based speech/music segmentation, Acoustics, Speech, and Signal Processing (ICASSP) IEEE International Conference on, p.297, 2002.

H. Akaike, Information theory and an extension of the maximum likelihood principle, Selected Papers of Hirotugu Akaike, pp.199-213, 1998.

A. I. Al-shoshan, Speech and music classification and separation : a review, 2006.

J. Alon, V. Athitsos, Q. Yuan, and S. Sclaroff, A unified framework for gesture recognition and spatiotemporal gesture segmentation. Pattern Analysis and Machine Intelligence, IEEE Transactions on, issue.9, pp.31-1685, 2009.

F. J. Anscombe, Graphs in statistical analysis. The American Statistician, pp.17-21, 1973.

A. Bagnall and G. Janacek, Clustering Time Series with Clipped Data, Machine Learning, pp.151-178, 2005.
DOI : 10.1007/s10994-005-5825-6

A. Bagnall, E. Keogh, S. Lonardi, and G. Janacek, A Bit Level Representation for Time Series Data Mining with Shape Based Similarity, Data Mining and Knowledge Discovery, vol.13, issue.1, pp.11-40, 2006.
DOI : 10.1007/s10618-005-0028-0

C. Bahlmann, B. Haasdonk, and H. Burkhardt, Online handwriting recognition with support vector machines - a kernel approach, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition, pp.49-54, 2002.
DOI : 10.1109/IWFHR.2002.1030883

L. E. Baum and J. A. Eagon, An inequality with applications to statistical estimation for probabilistic functions of Markov processes and to a model for ecology, Bulletin of the American Mathematical Society, vol.73, issue.3, pp.73-360, 1967.
DOI : 10.1090/S0002-9904-1967-11751-8

R. Bellman, R. E. Bellman, R. E. Bellman, and R. E. Bellman, Adaptive control processes : a guided tour, 1961.
DOI : 10.1515/9781400874668

J. A. Bilmes, What HMMs Can Do, IEICE Transactions on Information and Systems, vol.89, issue.3, pp.869-891, 2006.
DOI : 10.1093/ietisy/e89-d.3.869

C. M. Bishop, Pattern recognition and machine learning, 2006.

T. Bocklet, A. Maier, J. G. Bauer, F. Burkhardt, and E. Nöth, Age and gender recognition for telephone applications based on GMM supervectors and support vector machines, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.1605-1608, 2008.
DOI : 10.1109/ICASSP.2008.4517932

B. Bogert, M. Healy, and J. Tukey, The quefrency alanysis of time series for echoes : Cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking, Symposium on Time Series Analysis, pp.209-243, 1963.

B. E. Boser, I. M. Guyon, and V. N. Vapnik, A training algorithm for optimal margin classifiers, Proceedings of the fifth annual workshop on Computational learning theory , COLT '92, pp.144-152, 1992.
DOI : 10.1145/130385.130401

N. Bouguila, D. Ziou, and J. Vaillancourt, Unsupervised Learning of a Finite Mixture Model Based on the Dirichlet Distribution and Its Application, IEEE Transactions on Image Processing, vol.13, issue.11, pp.13-1533, 2004.
DOI : 10.1109/TIP.2004.834664

G. E. Box, G. M. Jenkins, and G. C. Reinsel, Time series analysis : forecasting and control, 2011.
DOI : 10.1002/9781118619193

L. Breiman, J. Friedman, C. J. Stone, and R. A. Olshen, Classification and regression trees, 1984.

C. J. Burges and B. Schölkopf, Improving the accuracy and speed of support vector machines, Advances in Neural Information Processing Systems 9, pp.375-381, 1997.

J. Burred and G. Peeters, An adaptive system for music classification and tagging, Welcome to the 3rd Int. Workshop on Learning Semantics of Audio Signals, 2009.
URL : https://hal.archives-ouvertes.fr/hal-01106472

J. J. Burred and A. Lerch, Hierarchical automatic audio signal classification, J. Audio Eng. Soc, vol.52, issue.78, pp.724-739, 2004.

M. Carey, E. Parris, and H. Lloyd-thomas, A comparison of features for speech, music discrimination, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258), pp.149-152, 1999.
DOI : 10.1109/ICASSP.1999.758084

N. Casagrande, D. Eck, and B. Kégl, Geometry in sound : A speech/music audio classifier inspired by an image classifier, International Computer Music Conference, 2005.

G. Castellano, A. M. Fanelli, and M. Pelillo, An iterative pruning algorithm for feedforward neural networks, IEEE Transactions on Neural Networks, vol.8, issue.3, pp.519-531, 1997.
DOI : 10.1109/72.572092

N. Castro and P. J. Azevedo, Multiresolution Motif Discovery in Time Series, SDM, pp.665-676, 2010.
DOI : 10.1137/1.9781611972801.73

G. C. Cawley and N. L. Talbot, Preventing over-fitting during model selection via bayesian regularisation of the hyper-parameters, The Journal of Machine Learning Research, vol.8, pp.841-861, 2007.

G. C. Cawley and N. L. Talbot, On over-fitting in model selection and subsequent selection bias in performance evaluation, The Journal of Machine Learning Research, vol.11, pp.2079-2107, 2010.

N. A. Chadwick, D. A. Mcmeekin, and T. Tan, Classifying eye and head movement artifacts in EEG signals, 5th IEEE International Conference on Digital Ecosystems and Technologies (IEEE DEST 2011), pp.285-291, 2011.
DOI : 10.1109/DEST.2011.5936640

C. Chang and C. Lin, LIBSVM, ACM Transactions on Intelligent Systems and Technology, vol.2, issue.3, pp.1-2727, 2011.
DOI : 10.1145/1961189.1961199

G. Choy, D. Hermann, R. L. Brennan, T. Schneider, H. Sheikhzadeh et al., Subbandbased acoustic shock limiting algorithm on a low-resource dsp system, 2003.

S. Chu, S. Narayanan, and C. J. Kuo, Content analysis for acoustic environment classification in mobile robots, AAAI Fall Symposium, Aurally Informed Performance : Integrating Machine Listening and Auditory Presentation in Robotic Systems, pp.16-21, 2006.

C. Cotton, D. Ellis, and A. Loui, Soundtrack classification by transient events, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.473-476, 2011.
DOI : 10.1109/ICASSP.2011.5946443

T. M. Cover and J. M. Van-campenhout, On the possible orderings in the measurement selection problem. Systems, Man and Cybernetics, IEEE Transactions on, vol.7, issue.9, pp.657-661, 1977.

T. H. Dat, K. Takeda, and F. Itakura, Gamma Modeling of Speech Power and Its On-Line Estimation for Statistical Speech Enhancement, IEICE Transactions on Information and Systems, vol.89, issue.3, pp.1040-1049, 2006.
DOI : 10.1093/ietisy/e89-d.3.1040

S. Davis and P. Mermelstein, Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences, Acoustics, Speech and Signal Processing IEEE Transactions on, vol.28, issue.4, pp.357-366, 1980.
DOI : 10.1016/B978-0-08-051584-7.50010-3

C. S. Daw, C. E. Finney, and E. R. Tracy, A review of symbolic analysis of experimental data, Review of Scientific Instruments, vol.74, issue.2, pp.915-930, 2003.
DOI : 10.1063/1.1531823

D. Cheveigné, A. Kawahara, and H. , YIN, a fundamental frequency estimator for speech and music, The Journal of the Acoustical Society of America, vol.111, issue.4, 1917.
DOI : 10.1121/1.1458024

A. Dempster, N. Laird, and D. Rubin, Maximum likelihood from incomplete data via the em algorithm, Journal of the Royal Statistical Society. Series B (Methodological), pp.1-38, 1977.

H. Ding, G. Trajcevski, P. Scheuermann, X. Wang, and E. Keogh, Querying and mining of time series data, Proceedings of the VLDB Endowment, pp.1542-1552, 2008.
DOI : 10.14778/1454159.1454226

R. O. Duda, P. E. Hart, and D. G. Stork, Pattern classification, 2001.

K. El-maleh, M. Klein, G. Petrucci, and P. Kabal, Speech/music discrimination for multimedia applications, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), pp.2445-2448, 2000.
DOI : 10.1109/ICASSP.2000.859336

A. J. Eronen, V. T. Peltonen, J. T. Tuomi, A. P. Klapuri, S. Fagerlund et al., Audio-based context recognition. Audio, Speech, and Language Processing, IEEE Transactions on, vol.14, issue.1, pp.321-329, 2006.

P. Esling and C. Agon, Time-series data mining, ACM Computing Surveys, vol.45, issue.1, p.12, 2012.
DOI : 10.1145/2379776.2379788

S. Essid, Classification automatique des signaux audio-fréquences : reconnaissance des instruments de musique, 2005.

C. Faloutsos, M. Ranganathan, and Y. Manolopoulos, Fast subsequence matching in time-series databases, 1994.

J. Faure, A. Guérin, and C. Marro, Method and device for detecting acoustic shocks, WO Patent, vol.2, p.12001261, 2012.

J. Flocon-cholet, J. Faure, A. Guérin, and P. Scalart, An investigation of temporal feature integration for a low-latency classification with application to speech/music/mix classification, Audio Engineering Society Convention 137, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01100261

J. Flocon-cholet, J. Faure, A. Guérin, and P. Scalart, A robust howling detection algorithm based on a statistical approach, 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC), pp.65-69, 2014.
DOI : 10.1109/IWAENC.2014.6953339

URL : https://hal.archives-ouvertes.fr/hal-01100273

J. Foote and S. Uchihashi, The beat spectrum: a new approach to rhythm analysis, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001., p.224, 2001.
DOI : 10.1109/ICME.2001.1237863

Z. Fu, J. Wang, and L. Xie, Noise robust features for speech/music discrimination in realtime telecommunication, ICME, pp.574-577, 2009.

K. Fukunaga, Introduction to statistical pattern recognition, 2013.

S. Galliano, E. Geoffrois, D. Mostefa, K. Choukri, J. Bonastre et al., The ester phase ii evaluation campaign for the rich transcription of french broadcast news, Interspeech, pp.1149-1152, 2005.

R. Gaudel, Paramètres d'ordre et sélection de modèles en apprentissage : caractérisation des modèles et sélection d'attributs, 2010.

P. Geurts, Pattern Extraction for Time Series Classification, Principles of Data Mining and Knowledge Discovery, pp.115-127, 2001.
DOI : 10.1007/3-540-44794-6_10

A. A. Goshtasby, Similarity and Dissimilarity Measures, Image registration, pp.7-66, 2012.
DOI : 10.1007/978-1-4471-2458-0_2

C. Goudar, P. Rabha, M. Deshpande, and A. Rao, SMVLite: Reduced Complexity Selectable Mode Vocoder, 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings, 2006.
DOI : 10.1109/ICASSP.2006.1660117

S. C. Greer and A. Dejaco, Standardization of the selectable mode vocoder, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), pp.953-956, 2001.
DOI : 10.1109/ICASSP.2001.941074

S. Gudmundsson, T. P. Runarsson, and S. Sigurdsson, Support vector machines and dynamic time warping for time series, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), pp.2772-2776, 2008.
DOI : 10.1109/IJCNN.2008.4634188

I. Guyon and A. Elisseeff, An introduction to variable and feature selection, The Journal of Machine Learning Research, vol.3, pp.1157-1182, 2003.

I. Guyon, J. Weston, S. Barnhill, and V. Vapnik, Gene selection for cancer classification using support vector machines, Machine learning, vol.46, pp.1-3, 2002.

H. Harb, L. Chen, and J. Auloge, Mixture of experts for audio classification: an application to male female classification and musical genre recognition, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763), pp.1351-1354, 2004.
DOI : 10.1109/ICME.2004.1394479

T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning, 2009.

D. Haussler, Convolution kernels on discrete structures, 1999.

R. P. Hellman, Perceived magnitude of two???tone???noise complexes: Loudness, annoyance, and noisiness, The Journal of the Acoustical Society of America, vol.77, issue.4, pp.1497-1504, 1985.
DOI : 10.1121/1.392044

S. Helmer, A. Poulovassilis, and F. Xhafa, Reasoning in event-based distributed systems, 2011.
DOI : 10.1007/978-3-642-19724-6

M. Hossan, S. Memon, and M. Gregory, A novel approach for MFCC feature extraction, 2010 4th International Conference on Signal Processing and Communication Systems, pp.1-5, 2010.
DOI : 10.1109/ICSPCS.2010.5709752

H. Hotelling, Analysis of a complex of statistical variables into principal components., Journal of Educational Psychology, vol.24, issue.6, p.417, 1933.
DOI : 10.1037/h0071325

J. Huang, Z. Liu, and Y. Wang, Joint scene classification and segmentation based on hidden markov model. Multimedia, IEEE Transactions on, vol.7, issue.3, pp.538-550, 2005.

Y. Huang and P. S. Yu, Adaptive query processing for time-series data, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '99, pp.282-286, 1999.
DOI : 10.1145/312129.318357

B. Hugueney, Représentations symboliques adaptatives de séries temporelles : principes et algorithmes de construction, p.76, 2006.

F. Itakura and S. Saito, Statistical method for estimation of speech spectral density and formant frequencies, Electronics & Communications in Japan, vol.53, issue.1, p.36, 1970.

A. Jain and D. Zongker, Feature selection : Evaluation, application, and small sample performance. Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol.19, issue.2, pp.153-158, 1997.

A. K. Jain and B. Chandrasekaran, 39 Dimensionality and sample size considerations in pattern recognition practice, Handbook of statistics, pp.835-855, 1982.
DOI : 10.1016/S0169-7161(82)02042-2

T. Jebara, R. Kondor, and A. Howard, Probability product kernels, The Journal of Machine Learning Research, vol.5, pp.819-844, 2004.

M. Jelinek, R. Salami, S. Ahmadi, B. Bessette, P. Gournay et al., Advances in source-controlled variable bit rate wideband speech coding, Special Workshop in MAUI (SWIM) : Lectures by masters in speech processing, 2004.

Y. Ji, C. Wu, P. Liu, J. Wang, and K. R. Coombes, Applications of beta-mixture models in bioinformatics, Bioinformatics, vol.21, issue.9, pp.2118-2122, 2005.
DOI : 10.1093/bioinformatics/bti318

D. Jiang, L. Lu, H. Zhang, J. Tao, and L. Cai, Music type classification by spectral contrast feature, Multimedia and Expo, 2002. ICME'02. Proceedings. 2002 IEEE International Conference on, pp.113-116, 2002.

C. Joder, S. Essid, and G. Richard, Temporal integration for audio classification with application to musical instrument classification. Audio, Speech, and Language Processing, IEEE Transactions on, vol.17, issue.1, pp.174-186, 2009.

M. W. Kadous and C. Sammut, Classification of Multivariate Time Series and Structured Data Using Constructive Induction, Machine Learning, vol.29, issue.4, pp.179-216, 2005.
DOI : 10.1007/s10994-005-5826-5

O. Kalinli, S. Sundaram, and S. Narayanan, Saliency-driven unstructured acoustic scene classification using latent perceptual indexing, 2009 IEEE International Workshop on Multimedia Signal Processing, pp.1-6, 2009.
DOI : 10.1109/MMSP.2009.5293267

B. Kedem, Spectral analysis and discrimination by zero-crossings, Proceedings of the IEEE, pp.1477-1493, 1986.
DOI : 10.1109/PROC.1986.13663

E. Keogh, K. Chakrabarti, M. Pazzani, and S. Mehrotra, Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases, Knowledge and Information Systems, vol.3, issue.3, pp.263-286, 2001.
DOI : 10.1007/PL00011669

E. Keogh, K. Chakrabarti, M. Pazzani, and S. Mehrotra, Locally adaptive dimensionality reduction for indexing large time series databases, ACM SIGMOD Record, vol.30, issue.2, pp.151-162, 2001.
DOI : 10.1145/376284.375680

E. Keogh, S. Chu, D. Hart, and M. Pazzani, Segmenting time series : A survey and novel approach. Data mining in time series databases, pp.1-22, 2004.

E. Keogh and S. Kasetty, On the need for time series data mining benchmarks, Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '02, pp.349-371, 2003.
DOI : 10.1145/775047.775062

E. Keogh and J. Lin, Clustering of time-series subsequences is meaningless: implications for previous and future research, Knowledge and Information Systems, vol.14, issue.2, pp.154-177, 2005.
DOI : 10.1109/TKDE.2002.1019212

E. Keogh, S. Lonardi, and C. A. Ratanamahatana, Towards parameter-free data mining, Proceedings of the 2004 ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '04, pp.206-215, 2004.
DOI : 10.1145/1014052.1014077

E. Keogh and C. A. Ratanamahatana, Exact indexing of dynamic time warping, Knowledge and Information Systems, vol.26, issue.3, pp.358-386, 2005.
DOI : 10.1109/TASSP.1978.1163149

E. Keogh, L. Wei, X. Xi, M. Vlachos, S. Lee et al., Supporting exact indexing of arbitrarily rotated shapes and periodic time series under Euclidean and warping distance measures, The VLDB Journal?The International Journal on Very Large Data Bases, pp.611-630, 2009.
DOI : 10.1007/s00778-008-0111-4

W. Kienzle, G. Bak?r, M. Franz, and B. Schölkopf, Efficient Approximations for Support Vector Machines in Object Detection, Pattern Recognition, pp.54-61, 2004.
DOI : 10.1007/978-3-540-28649-3_7

D. Kimber and L. Wilcox, Acoustic segmentation for audio browsers, Computing Science and Statistics, pp.295-304, 1997.

L. Blouch, O. Collen, and P. , Méthode de segmentation parole non-parole. Rencontres Jeunes Chercheurs Parole, 2005.

K. Lee and D. Ellis, Audio-based semantic concept classification for consumer video. Audio, Speech, and Language Processing, IEEE Transactions on, vol.18, issue.6, pp.1406-1416, 2010.

H. Lei and V. Govindaraju, Speeding up multi-class svm evaluation by pca and feature selection. Feature Selection for Data Mining, p.72, 2005.

D. Li, I. Sethi, N. Dimitrova, and T. Mcgee, Classification of general audio data for content-based retrieval, Pattern Recognition Letters, vol.22, issue.5, pp.533-544, 2001.
DOI : 10.1016/S0167-8655(00)00119-7

W. Liao, J. Wen, and J. Kuo, Streaming audio classification in smart home environments, Pattern Recognition (ACPR), 2011 First Asian Conference on, pp.593-597, 2011.

C. Lim and J. Chang, Adaptive Kernel Function of SVM for Improving Speech/Music Classification of 3GPP2 SMV, ETRI Journal, vol.33, issue.6, pp.871-879, 2011.
DOI : 10.4218/etrij.11.0110.0780

M. Lim and S. Chi, Acoustic shock protection device and method thereof, US Patent, vol.8, p.954322, 2015.

J. Lin, E. Keogh, S. Lonardi, and B. Chiu, A symbolic representation of time series, with implications for streaming algorithms, Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery , DMKD '03, pp.2-11, 2003.
DOI : 10.1145/882082.882086

J. Lin, E. Keogh, S. Lonardi, J. P. Lankford, and D. M. Nystrom, Visually mining and monitoring massive time series, Proceedings of the 2004 ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '04, pp.460-469, 2004.
DOI : 10.1145/1014052.1014104

J. Lin, E. Keogh, L. Wei, and S. Lonardi, Experiencing SAX: a novel symbolic representation of time series, Data Mining and Knowledge Discovery, vol.5, issue.2, pp.107-144, 2007.
DOI : 10.1007/s10618-007-0064-z

J. Lin, M. Vlachos, E. Keogh, D. Gunopulos, J. Liu et al., A MPAA-Based Iterative Clustering Algorithm Augmented by Nearest Neighbors Search for Time-Series Data Streams, Advances in Knowledge Discovery and Data Mining, pp.333-342, 2005.
DOI : 10.1007/11430919_40

J. Lin, S. Williamson, K. Borne, and D. Debarr, Pattern recognition in time series Advances in Machine Learning and Data Mining for Astronomy, pp.617-645, 2012.

T. Lin, N. Kaminski, and Z. Bar-joseph, Alignment and classification of time series gene expression in clinical studies, Bioinformatics, vol.24, issue.13, pp.24-147, 2008.
DOI : 10.1093/bioinformatics/btn152

B. Lkhagva, Y. Suzuki, and K. Kawagoe, New Time Series Data Representation ESAX for Financial Applications, 22nd International Conference on Data Engineering Workshops (ICDEW'06), p.115, 2006.
DOI : 10.1109/ICDEW.2006.99

J. L. Lonardi and P. Patel, Finding motifs in time series, Proc. of the 2nd Workshop on Temporal Data Mining, pp.53-68, 2002.

L. Lu, A digital realization of audio dynamic range control, Signal Processing Proceedings, 1998. ICSP'98. 1998 Fourth International Conference on, pp.1424-1427, 1998.

L. Lu, H. Jiang, and H. Zhang, A robust audio classification and segmentation method, Proceedings of the ninth ACM international conference on Multimedia , MULTIMEDIA '01, pp.203-211, 2001.
DOI : 10.1145/500141.500173

L. Ma, D. Smith, and B. Milner, Environmental Noise Classification for Context-Aware Applications, Database and Expert Systems Applications, pp.360-370, 2003.
DOI : 10.1007/978-3-540-45227-0_36

Z. Ma and A. Leijon, Modelling speech line spectral frequencies with dirichlet mixture models, 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, pp.2370-2373, 2010.

Z. Ma and A. Leijon, Bayesian estimation of beta mixture models with variational inference, pp.2160-2173, 2011.

V. Malenovsky, T. Vaillancourt, W. Zhe, K. Choo, and V. Atti, Two-stage speech/music classifier with decision smoothing and sharpening in the EVS codec, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.5718-5722, 2015.
DOI : 10.1109/ICASSP.2015.7179067

S. Malinowski, T. Guyet, R. Quiniou, and R. Tavenard, 1d-SAX: A Novel Symbolic Representation for Time Series, Advances in Intelligent Data Analysis XII, pp.273-284, 2013.
DOI : 10.1007/978-3-642-41398-8_24

URL : https://hal.archives-ouvertes.fr/halshs-00912512

R. G. Malkin and A. Waibel, Classifying User Environment for Mobile Applications using Linear Autoencoding of Ambient Audio, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., p.509, 2005.
DOI : 10.1109/ICASSP.2005.1416352

S. Mallat, A wavelet tour of signal processing, 1999.

B. Mathieu, S. Essid, T. Fillon, J. Prado, and G. Richard, Yaafe, an easy to use and efficient audio feature extraction software, In ISMIR, pp.441-446, 2010.

D. Mcferran and D. Baguley, Acoustic shock, The Journal of Laryngology & Otology, vol.121, issue.04, pp.301-305, 2007.
DOI : 10.1017/S0022215107006111

M. Mckinney and J. Breebaart, Features for audio and music classification, Proc. ISMIR, pp.151-158, 2003.

G. Mclachlan, Discriminant analysis and statistical pattern recognition, 2004.
DOI : 10.1002/0471725293

V. Megalooikonomou, G. Li, and Q. Wang, A dimensionality reduction technique for efficient similarity analysis of time series databases, Proceedings of the Thirteenth ACM conference on Information and knowledge management , CIKM '04, pp.160-161, 2004.
DOI : 10.1145/1031171.1031203

A. Meng, Temporal feature integration for music organisation, 2006.

A. Meng, P. Ahrendt, and J. Larsen, Improving Music Genre Classification By Short-Time Feature Integration, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., p.497, 2005.
DOI : 10.1109/ICASSP.2005.1416349

A. Meng, P. Ahrendt, J. Larsen, and L. K. Hansen, Temporal feature integration for music genre classification. Audio, Speech, and Language Processing, IEEE Transactions on, vol.15, issue.5, pp.1654-1664, 2007.

S. Meunier, G. Rabau, and E. Friot, Annoyance and loudness of pure tones in noise : application to active control of fan noise, CFA/DAGA), 2004.
URL : https://hal.archives-ouvertes.fr/hal-00088519

I. Mierswa, Non-convex and multi-objective optimization in data mining, 2009.

I. Mierswa and K. Morik, Automatic Feature Extraction for Classifying Audio Data, Machine Learning, vol.9, issue.1/2, pp.127-149, 2005.
DOI : 10.1007/s10994-005-5824-7

J. C. Milhinch, Acoustic shock injury : Real or imaginary, America Audiology Network, 2002.

K. Minami, A. Akutsu, H. Hamada, and Y. Tonomura, Video handling with music and speech detection, IEEE Multimedia, vol.5, issue.3, pp.17-25, 1998.
DOI : 10.1109/93.713301

F. Mörchen and A. Ultsch, Optimizing time series discretization for knowledge discovery, Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining , KDD '05, pp.660-665, 2005.
DOI : 10.1145/1081870.1081953

O. Mubarak, E. Ambikairajah, and J. Epps, Novel Features for Effective Speech and Music Discrimination, 2006 IEEE International Conference on Engineering of Intelligent Systems, pp.1-5, 2006.
DOI : 10.1109/ICEIS.2006.1703190

A. Mueen and E. Keogh, Online discovery and maintenance of time series motifs, Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD '10, pp.1089-1098, 2010.
DOI : 10.1145/1835804.1835941

A. Mueen, E. J. Keogh, Q. Zhu, S. Cash, and M. B. Westover, Exact Discovery of Time Series Motifs, SDM, pp.473-484, 2009.
DOI : 10.1137/1.9781611972795.41

M. Müller, Analysis and retrieval techniques for motion and music data, 2009.

N. Nitanda, M. Haseyama, and H. Kitajima, Accurate audio-segment classification using feature extraction matrix, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., p.261, 2005.
DOI : 10.1109/ICASSP.2005.1415696

H. S. Noma, Dynamic time-alignment kernel in support vector machine Advances in neural information processing systems, p.921, 2002.

M. P. Norton and D. G. Karczub, Fundamentals of noise and vibration analysis for engineers, 2003.

S. Ntalampiras and N. Fakotakis, Modeling the Temporal Evolution of Acoustic Parameters for Speech Emotion Recognition, IEEE Transactions on Affective Computing, vol.3, issue.1, pp.116-125, 2012.
DOI : 10.1109/T-AFFC.2011.31

S. Ntalampiras, I. Potamitis, and N. Fakotakis, Exploiting Temporal Feature Integration for Generalized Sound Recognition, EURASIP Journal on Advances in Signal Processing, vol.6, issue.1, p.807162, 2009.
DOI : 10.1145/382043.382316

K. Paliwal, On the use of line spectral frequency parameters for speech recognition, Digital Signal Processing, vol.2, issue.2, pp.80-87, 1992.
DOI : 10.1016/1051-2004(92)90028-W

C. Panagiotakis and G. Tziritas, A speech/music discriminator based on rms and zero-crossings. Multimedia, IEEE Transactions on, vol.7, issue.1, pp.155-166, 2005.

E. Parliament and . The, Directive 2003-10-ec on the minimum health and safety requirements regarding the exposure of workers to the risks arising from physical agents (noise), Official Journal of the European Union, vol.42, pp.38-44, 2003.

O. J. Pedersen, P. Lyregaard, and T. Poulsen, The round robin test on evaluation of loudness level of impulsive noise, 1977.

G. Peeters, A large set of audio features for sound description (similarity and classification) in the cuidado project, 2004.

G. Peeters, A generic system for audio indexing : Application to speech/music segmentation and music genre recognition, Proc. DAFX, pp.205-212, 2007.

G. Peeters and X. Rodet, Hierarchical gaussian tree with inertia ratio maximization for the classification of large musical instruments databases, Proc. of the 6th Int. Conf. on Digital Audio Effects, 2003.

V. Peltonen, J. Tuomi, A. Klapuri, J. Huopaniemi, and T. Sorsa, Computational auditory scene recognition, Acoustics, Speech, and Signal Processing (ICASSP) IEEE International Conference on, 1941.

V. Peltonen, J. Tuomi, A. Klapuri, J. Huopaniemi, and T. Sorsa, Computational auditory scene recognition, Acoustics, Speech, and Signal Processing (ICASSP) IEEE International Conference on, 1941.

N. D. Pham, Q. L. Le, and T. K. Dang, Two Novel Adaptive Symbolic Representations for Similarity Search in Time Series Databases, 2010 12th International Asia-Pacific Web Conference, pp.181-187, 2010.
DOI : 10.1109/APWeb.2010.23

J. Pinquier, J. Rouas, &. , and R. , Robust speech/music classification in audio documents, Entropy, vol.1, issue.2, p.3, 2002.

J. Platt, Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods Advances in large margin classifiers, pp.61-74, 1999.

R. Plomp and M. Bouman, Relation between Hearing Threshold and Duration for Tone Pulses, The Journal of the Acoustical Society of America, vol.31, issue.6, pp.31-749, 1959.
DOI : 10.1121/1.1907781

L. Rabiner, A tutorial on hidden markov models and selected applications in speech recognition, Proceedings of the IEEE, pp.257-286, 1989.

T. Rakthanmanon, B. Campana, A. Mueen, G. Batista, B. Westover et al., Addressing Big Data Time Series, ACM Transactions on Knowledge Discovery from Data, vol.7, issue.3, p.10, 2013.
DOI : 10.1145/2513092.2500489

M. Ramona, Classification automatique de flux radiophoniques par Machines à Vecteurs de Support, 2010.
URL : https://hal.archives-ouvertes.fr/pastel-00529331

M. Ramona and G. Richard, Segmentation parole/musique par machines à vecteurs de support, p.142, 2008.

M. Ramona, G. Richard, and B. David, Vocal detection in music with support vector machines, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.1885-1888, 2008.
DOI : 10.1109/ICASSP.2008.4518002

M. Ramona, G. Richard, and B. David, Multiclass Feature Selection With Kernel Gram-Matrix-Based Criteria, IEEE Transactions on Neural Networks and Learning Systems, vol.23, issue.10, pp.1611-1623, 2012.
DOI : 10.1109/TNNLS.2012.2201748

C. Ratanamahatana, E. Keogh, A. J. Bagnall, and S. Lonardi, A Novel Bit Level Time Series Representation with Implication of Similarity Search and Clustering, Advances in knowledge discovery and data mining, pp.771-777, 2005.
DOI : 10.1007/11430919_90

C. A. Ratanamahatana and E. Keogh, Making Time-series Classification More Accurate Using Learned Constraints, 2004.
DOI : 10.1137/1.9781611972740.2

S. Ravindran and D. V. Anderson, Audio Classification And Scene Recognition and for Hearing Aids, 2005 IEEE International Symposium on Circuits and Systems, pp.860-863, 2005.
DOI : 10.1109/ISCAS.2005.1464724

U. Rebbapragada, P. Protopapas, C. E. Brodley, and C. Alcock, Finding anomalous periodic time series, Machine Learning, vol.4, issue.4, pp.281-313, 2009.
DOI : 10.1007/s10994-008-5093-3

L. Regnier, Localization, characterization and recognition of singing voices, 2012.
URL : https://hal.archives-ouvertes.fr/tel-00687475

R. Rifkin and A. Klautau, In defense of one-vs-all classification, The Journal of Machine Learning Research, vol.5, pp.101-141, 2004.

J. Saunders, Real-time discrimination of broadcast speech/music ICASSP-96, Acoustics, Speech, and Signal Processing IEEE International Conference on, pp.993-996, 1996.

N. Scaringella and G. Zoia, On the modeling of time information for automatic genre recognition systems in audio signals, Proceedings of the ISMIR 2005 6th International Conference on Music Information Retrieval, pp.12-15, 2005.

E. Scheirer and M. Slaney, Construction and evaluation of a robust multifeature speech/music discriminator, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.1331-1334, 1997.
DOI : 10.1109/ICASSP.1997.596192

B. Schilit, N. Adams, and R. Want, Context-aware computing applications, Mobile Computing Systems and Applications, 1994. WMCSA 1994. First Workshop on, pp.85-90, 1994.

B. Schölkopf, A. Smola, and K. Müller, Nonlinear Component Analysis as a Kernel Eigenvalue Problem, Neural Computation, vol.20, issue.5, pp.1299-1319, 1998.
DOI : 10.1007/BF02281970

B. Schölkopf and A. J. Smola, Learning with kernels : support vector machines, regularization, optimization, and beyond (Adaptive computation and machine learning), 2001.

G. Schwarz, Estimating the dimension of a model. The annals of statistics, pp.461-464, 1978.

R. V. Sharan and T. J. Moir, Comparison of multiclass SVM classification techniques in an audio surveillance application under mismatched conditions, 2014 19th International Conference on Digital Signal Processing, pp.83-88, 2014.
DOI : 10.1109/ICDSP.2014.6900805

G. Sharma, F. Jurie, and P. Pérez, Learning Non-linear SVM in Input Space for Image Classification, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00977304

J. Shawe-taylor and A. Meng, An investigation of feature models for music genre classification using the support vector classifier, 2005.

M. Shi and A. Bermak, An efficient digital vlsi implementation of gaussian mixture models-based classifier. Very Large Scale Integration (VLSI) Systems, IEEE Transactions on, vol.14, issue.9, pp.962-974, 2006.

J. Shieh and E. Keogh, i sax : indexing and mining terabyte sized time series, Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp.623-631, 2008.

K. G. Shin and P. Ramanathan, Real-time computing: a new discipline of computer science and engineering, Proceedings of the IEEE, pp.6-24, 1994.
DOI : 10.1109/5.259423

J. Shlens, A tutorial on principal component analysis. arXiv preprint, 2014.

T. Soltani, D. Hermann, E. Cornu, H. Sheikhzadeh, and R. L. Brennan, An acoustic shock limiting algorithm using time and frequency domain speech features, 2004.

J. A. Stankovic, Real-time computing. Byte, pág, pp.155-162, 1992.

J. A. Stankovic, M. Spuri, K. Ramamritham, and G. C. Buttazzo, Introduction, Deadline Scheduling for Real-Time Systems, pp.1-11, 1998.
DOI : 10.1007/978-1-4615-5535-3_1

G. Tzanetakis and P. Cook, Musical genre classification of audio signals. Speech and Audio Processing, IEEE transactions on, vol.10, issue.5, pp.293-302, 2002.

T. Van-waterschoot and M. Moonen, Comparative evaluation of howling detection criteria in notch-filter-based howling suppression, Preprints AES 126th Convention, 2009.

T. Van-waterschoot and M. Moonen, Comparative evaluation of howling detection criteria in notch-filter-based howling suppression, J. Audio Eng. Soc, vol.58, issue.11, pp.923-940, 2010.

T. Van-waterschoot and M. Moonen, Fifty Years of Acoustic Feedback Control: State of the Art and Future Challenges, Proc. IEEE, pp.288-327, 2011.
DOI : 10.1109/JPROC.2010.2090998

V. Vapnik, Statistical learning theory, 1998.

V. Vapnik, The nature of statistical learning theory, 2013.

A. J. Viterbi, Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. Information Theory, IEEE Transactions on, vol.13, issue.2, pp.260-269, 1967.

M. Vlachos, G. Kollios, and D. Gunopulos, Discovering similar multidimensional trajectories, Proceedings 18th International Conference on Data Engineering, pp.673-684, 2002.
DOI : 10.1109/ICDE.2002.994784

D. Wang and G. Brown, Computational auditory scene analysis : Principles, algorithms, and applications, 2006.
DOI : 10.1109/9780470043387

J. Wang, Q. Wu, H. Deng, and Q. Yan, Real-time speech/music classification with a hierarchical oblique decision tree, Acoustics, Speech and Signal Processing IEEE International Conference on, pp.2033-2036, 2008.

W. Wang, W. Gao, and D. Ying, A fast and robust speech/music discrimination approach, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint, pp.1325-1329, 2003.
DOI : 10.1109/ICICS.2003.1292679

C. Watkins, Advances in kernel methods, chapter dynamic alignment kernels, 2000.

G. M. Weiss, Mining with rarity, ACM SIGKDD Explorations Newsletter, vol.6, issue.1, pp.7-19, 2004.
DOI : 10.1145/1007730.1007734

K. West and S. Cox, Features and classifiers for the automatic classification of musical audio signals, ISMIR, 2004.

K. West and S. Cox, Finding an optimal segmentation for audio genre classification, pp.680-685, 2005.

M. Westcott, Acoustic shock injury (ASI), Acta Oto-Laryngologica, vol.125, issue.2, pp.54-58, 2006.
DOI : 10.1080/03655230600895531

L. Xie, Z. Fu, W. Feng, and Y. Luo, Pitch-density-based features and an svm binary tree approach for multi-class audio classification in broadcast news. Multimedia systems, pp.101-112, 2011.

B. Yi and C. Faloutsos, Fast time sequence indexing for arbitrary lp norms, 2000.

W. Zalewski, F. Silva, H. D. Lee, A. G. Maletzke, and F. C. Wu, Time Series Discretization Based on the Approximation of the Local Slope Information, Advances in Artificial Intelligence?IBERAMIA 2012, pp.91-100, 2012.
DOI : 10.1007/978-3-642-34654-5_10

T. Zhang and C. Kuo, Hierarchical classification of audio data for archiving and retrieving, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258), pp.3001-3004, 1999.
DOI : 10.1109/ICASSP.1999.757472

T. Zhang and C. Kuo, Video content parsing based on combined audio and visual information, Photonics East'99 International Society for Optics and Photonics, pp.78-89, 1999.

T. Zhang and C. Kuo, Audio content analysis for online audiovisual data segmentation and classification. Speech and Audio Processing, IEEE Transactions on, vol.9, issue.4, pp.441-457, 2001.

H. Zhou, A. Sadka, and R. Jiang, Feature extraction for speech and music discrimination, Content-Based Multimedia Indexing CBMI 2008. International Workshop on, pp.170-173, 2008.

A. Zils and F. Pachet, Extracting automatically the perceived intensity of music titles, Proceedings of the 6th COST-G6 Conference on Digital Audio Effects (DAFX03), 2003.