]. G. Tzanetakis-2005 and . Tzanetakis, Audio-based gender identification using bootstrapping, PACRIM. 2005 IEEE Pacific Rim Conference on Communications, Computers and signal Processing, 2005., p.432433, 2005.
DOI : 10.1109/PACRIM.2005.1517318

. Li, Content-based music similarity search and emotion detection, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.705-708, 2004.

]. A. Wang, An industrial strength audio search algorithm, Proceedings of the International Conference on Music Information Retrieval, pp.7-13, 2003.

]. A. Wang, The Shazam music recognition service, Communications of the ACM, vol.49, issue.8, pp.44-48, 2006.
DOI : 10.1145/1145287.1145312

]. W. Hess, Algorithms and Devices for Pitch Determination of Speech Signals, Phonetica, vol.39, issue.4-5, 1983.
DOI : 10.1159/000261664

M. A. Bartsch and G. H. Wakefield, To catch a chorus: using chromabased representations for audio thumbnailing A chorus-section detecting method for musical audio signals, Proceedings of the IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.15-18, 2001.

. Zhu, Precise pitch profile feature extraction from musical audio for key detection, IEEE Transactions on Multimedia, vol.8, issue.3, pp.575-584, 2006.

]. S. Pfeiffer, <title>Importance of perceptive adaptation of sound features in audio content processing</title>, Storage and Retrieval for Image and Video Databases VII, pp.328-337, 1999.
DOI : 10.1117/12.333852

]. S. Pfeiffer, Pause concepts for audio segmentation at different semantic levels, Proceedings of the ninth ACM international conference on Multimedia , MULTIMEDIA '01, pp.187-193, 2001.
DOI : 10.1145/500141.500171

. Zhang, Content-Based Audio Classifcation and Retrieval for Audiovisual Data Parsing, 2001.

. Lloyd, Least squares quantization in PCM, IEEE Transactions on Information Theory, vol.28, issue.2, pp.129-137, 1982.
DOI : 10.1109/TIT.1982.1056489

. Wold, Content-based classification, search, and retrieval of audio, IEEE Multimedia, vol.3, issue.3, p.2736, 1996.

. Ramalingam, Gaussian Mixture Modeling Using Short Time Fourier Transform Features for Audio Fingerprinting, 2005 IEEE International Conference on Multimedia and Expo, 2005.
DOI : 10.1109/ICME.2005.1521629

. Meng, Affective Expression Classification by a Multi-Stage Approach Based on Hidden Markov Models, Proceedings of 1st International Audio/Visual Emotion Challenge and Workshop (AVEC 2011), pp.378-387, 2011.