@. T. Durand, N. Thome, M. Cord, and S. Avila, Image classification using object detectors, 2013 IEEE International Conference on Image Processing, 2013.
DOI : 10.1109/ICIP.2013.6738894

URL : https://hal.archives-ouvertes.fr/hal-01078079

@. S. Avila, N. Thome, M. Cord, E. Valle, A. De et al., BossaNova at ImageCLEF 2012 Flickr Photo Annotation task, Working Notes of the Conference and Labs of the Evaluation Forum (CLEF), 2012.

@. S. Avila, N. Thome, M. Cord, E. Valle, A. De et al., BOSSA: Extended bow formalism for image classification, 2011 18th IEEE International Conference on Image Processing, pp.2966-2969, 2011.
DOI : 10.1109/ICIP.2011.6116268

URL : https://hal.archives-ouvertes.fr/hal-00625533

@. A. Lopes, S. Avila, A. Peixoto, R. Oliveira, A. De et al., A bag-of-features approach based on hue-SIFT descriptor for nude detection, 17th European Signal Processing Conference (EUSIPCO), pp.1552-1556, 2009.

B. Conferences, @. S. Avila, N. Thome, M. Cord, E. Valle et al., Extended bag-ofwords formalism for image classification, 26th Conference on Graphics, Patterns , and Images (SIBGRAPI) ? Workshop of Theses and Dissertations (WTD), 2013.

@. A. Lopes, S. Avila, A. Peixoto, R. Oliveira, M. Coelho et al., Nude Detection in Video Using Bag-of-Visual-Features, 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing, 2009.
DOI : 10.1109/SIBGRAPI.2009.32

@. E. Valle, S. Avila, F. Souza, M. Coelho, A. De et al., Content-based filtering for video sharing social networks, Brazilian Symposium on Information and Computer System Security (SBSeg), 2012.

A. Abdullah, R. C. Veltkamp, and M. A. Wiering, Spatial pyramids and two-layer stacking SVM classifiers for image categorization: A comparative study, 2009 International Joint Conference on Neural Networks, pp.1130-1137, 2009.
DOI : 10.1109/IJCNN.2009.5178743

M. B. Ahmad and T. Choi, Local threshold and Boolean function based edge detection, IEEE Transactions on Consumer Electronics, vol.45, issue.3, pp.674-679, 1999.
DOI : 10.1109/30.793567

M. A. Aizerman, E. A. Braverman, and L. Rozonoer, Theoretical foundations of the potential function method in pattern recognition learning, Automation and Remote Control, pp.821-837, 1964.

E. Alpaydin, Introduction to Machine Learning (Adaptive Computation and Machine Learning ), p.35, 2010.

M. Ankerst, G. Kastenmüller, H. Kriegel, and T. Seidl, 3D Shape Histograms for Similarity Search and Classification in Spatial Databases, International Symposium on Advances in Spatial Databases, pp.207-226, 1999.
DOI : 10.1007/3-540-48482-5_14

S. Avila, N. Thome, M. Cord, E. Valle, D. A. Araújo et al., BOSSA: Extended bow formalism for image classification, 2011 18th IEEE International Conference on Image Processing, pp.2909-2912, 2011.
DOI : 10.1109/ICIP.2011.6116268

URL : https://hal.archives-ouvertes.fr/hal-00625533

S. Avila, N. Thome, M. Cord, E. Valle, D. A. Araújo et al., BossaNova at ImageCLEF 2012 Flickr Photo Annotation Task, Working Notes of the Conference and Labs of the Evaluation Forum (CLEF, pp.38-80, 2012.

S. Avila, N. Thome, M. Cord, E. Valle, D. A. Araújo et al., Pooling in image representation: The visual codeword point of view, Computer Vision and Image Understanding, vol.117, issue.5, pp.453-465, 2013.
DOI : 10.1016/j.cviu.2012.09.007

URL : https://hal.archives-ouvertes.fr/hal-01172709

H. Azizpour and I. Laptev, Object Detection Using Strongly-Supervised Deformable Part Models, European conference on Computer Vision (ECCV), pp.836-849, 2012.
DOI : 10.1007/978-3-642-33718-5_60

URL : https://hal.archives-ouvertes.fr/hal-01063338

R. Baeza-yates and B. Ribeiro-neto, Modern Information Retrieval, p.19, 1999.

M. Basu, Gaussian-based edge-detection methods-a survey, IEEE International Conference on Systems, Man, and Cybernetics, pp.252-260, 2002.
DOI : 10.1109/TSMCC.2002.804448

H. Bay, A. Ess, T. Tuytelaars, V. Gool, and L. , Speeded-Up Robust Features (SURF), Computer Vision and Image Understanding, vol.110, issue.3, pp.346-359, 2008.
DOI : 10.1016/j.cviu.2007.09.014

H. Bay, T. Tuytelaars, and L. V. Gool, SURF: Speeded up robust features, European Conference on Computer Vision (ECCV), pp.404-417, 2006.
DOI : 10.1007/11744023_32

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.679.3046

P. R. Beaudet, Rotationally invariant image operators, International Joint Conference on Pattern Recognition, pp.579-583, 1978.

R. Behmo, P. Marcombes, A. Dalalyan, and V. Prinet, Towards Optimal Naive Bayes Nearest Neighbor, European Conference on Computer Vision (ECCV), pp.171-184, 2010.
DOI : 10.1007/978-3-642-15561-1_13

URL : https://hal.archives-ouvertes.fr/hal-00654399

R. E. Bellman, Adaptive control processes -A guided tour, p.90, 1961.

S. Belongie, J. Malik, and J. Puzicha, Shape matching and object recognition using shape contexts, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.4, pp.509-522, 2002.
DOI : 10.1109/34.993558

Y. Bengio, Learning Deep Architectures for AI, Machine Learning, pp.1-127, 2009.
DOI : 10.1561/2200000006

Y. Bengio, P. Lamblin, D. Popovici, and H. Larochelle, Greedy Layer-Wise Training of Deep Networks, Advances in Neural Information Processing Systems (NIPS), pp.153-160, 2007.

J. Benois-pineau, F. Precioso, and M. Cord, Visual indexing and retrieval, p.11, 2012.
DOI : 10.1007/978-1-4614-3588-4

URL : https://hal.archives-ouvertes.fr/hal-00695914

I. Biederman, Recognition-by-components: A theory of human image understanding., Psychological Review, vol.94, issue.2, pp.115-147, 1987.
DOI : 10.1037/0033-295X.94.2.115

I. Biederman, An Invitation to Cognitive Science: Visual Cognition, Visual Object Recognition, vol.2, issue.3, pp.121-165, 1995.

A. Binder, W. Samek, M. Kloft, C. Müller, K. Müller et al., The joint submission of the tu berlin and fraunhofer first (tubfi) to the ImageCLEF 2011 photo annotation task, Cross-Language Evaluation Forum (CLEF Notebook Papers, pp.95-119, 2011.

O. Boiman, E. Shechtman, and M. Irani, In defense of Nearest-Neighbor based image classification, 2008 IEEE Conference on Computer Vision and Pattern Recognition, p.37, 2008.
DOI : 10.1109/CVPR.2008.4587598

A. Bordes, New Algorithms for Large-Scale Support Vector Machines, p.33, 2010.
URL : https://hal.archives-ouvertes.fr/tel-00464007

A. Bordes, L. Bottou, and P. Gallinari, SGD-QN: Careful quasi-newton stochastic gradient descent, Journal of Machine Learning Research, vol.10, pp.1737-1754, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00750911

A. Bosch, A. Zisserman, and X. Muñoz, Image Classification using Random Forests and Ferns, 2007 IEEE 11th International Conference on Computer Vision, pp.1-8, 2007.
DOI : 10.1109/ICCV.2007.4409066

A. Bosch, A. Zisserman, and X. M. , Scene Classification Via pLSA, European Conference on Computer Vision (ECCV), pp.517-530, 2006.
DOI : 10.1007/11744085_40

L. Bottou, Stochastic gradient descent examples on toy problems, p.33, 2007.

L. Bottou and O. Bousquet, The tradeoffs of large scale learning, Advances in Neural Information Processing Systems (NIPS), pp.161-168, 2008.

H. Bouirouga, S. E. Fkihi, A. Jilbab, and D. Aboutajdine, Skin detection in pornographic videos using threshold technique, Journal of Theoretical and Applied Information Technology, vol.35, issue.1, pp.7-19, 2012.

Y. Boureau, F. Bach, Y. Lecun, and J. Ponce, Learning mid-level features for recognition, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.2559-2566, 2010.
DOI : 10.1109/CVPR.2010.5539963

Y. Boureau, L. Roux, N. Bach, F. Ponce, J. Lecun et al., Ask the locals: Multi-way local pooling for image recognition, 2011 International Conference on Computer Vision, pp.2651-2658, 2011.
DOI : 10.1109/ICCV.2011.6126555

URL : https://hal.archives-ouvertes.fr/hal-00646816

Y. Boureau, J. Ponce, and Y. Lecun, A theoretical analysis of feature pooling in visual recognition, International Conference on Machine Learning (ICML), pp.111-118, 2010.

L. Breiman, Bagging predictors, Machine Learning, pp.123-140, 1996.
DOI : 10.1007/BF00058655

L. Breiman, Random forests, Machine Learning, pp.5-32, 2001.

G. Brown, J. L. Wyatt, R. Harris, and X. Yao, Diversity creation methods: a survey and categorisation, Information Fusion, vol.6, issue.1, pp.5-20, 2005.
DOI : 10.1016/j.inffus.2004.04.004

G. J. Burghouts and J. Geusebroek, Performance evaluation of local colour invariants, Computer Vision and Image Understanding, vol.113, issue.1, pp.48-62, 2009.
DOI : 10.1016/j.cviu.2008.07.003

M. C. Burl, M. Weber, and P. Perona, A probabilistic approach to object recognition using local photometry and global geometry, European Conference on Computer Vision (ECCV), pp.628-641, 1998.
DOI : 10.1007/BFb0054769

G. Carneiro and D. Lowe, Sparse Flexible Models of Local Features, European Conference on Computer Vision (ECCV), pp.29-43, 2006.
DOI : 10.1007/11744078_3

C. Carson, S. Belongie, H. Greenspan, M. , and J. , Blobworld: image segmentation using expectation-maximization and its application to image querying, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.8, pp.24102-1038, 2002.
DOI : 10.1109/TPAMI.2002.1023800

C. Chang and C. Lin, LIBSVM, ACM Transactions on Intelligent Systems and Technology, vol.2, issue.3, pp.1-27, 2011.
DOI : 10.1145/1961189.1961199

K. Chatfield, V. Lempitsky, A. Vedaldi, and A. Zisserman, The devil is in the details: an evaluation of recent feature encoding methods, Procedings of the British Machine Vision Conference 2011, pp.70-91, 2011.
DOI : 10.5244/C.25.76

R. Collobert and J. Weston, A unified architecture for natural language processing, Proceedings of the 25th international conference on Machine learning, ICML '08, pp.160-167, 2008.
DOI : 10.1145/1390156.1390177

M. Cord and P. Cunningham, Machine Learning Techniques for Multimedia: Case Studies on Organization and Retrieval. Cognitive Technologies, p.63, 2008.
DOI : 10.1007/978-3-540-75171-7

C. Cortes and V. Vapnik, Support-vector networks, Machine Learning, pp.273-297, 1995.
DOI : 10.1007/BF00994018

L. D. Costa, C. Jr, and R. M. , Shape Analysis and Classification: Theory and Practice, p.13, 2000.

N. Cristianini and J. Shawe-taylor, An introduction to support Vector Machines: and other kernel-based learning methods, p.32, 2000.
DOI : 10.1017/CBO9780511801389

G. Csurka, C. Bray, C. Dance, F. , and L. , Visual categorization with bags of keypoints, Workshop on Statistical Learning in Computer Vision, ECCV, pp.1-22, 2004.

P. Cunningham, Machine Learning Techniques for Multimedia: Case Studies on Organization and Retrieval, chapter Dimension Reduction. Cognitive Technologies, p.28, 2008.

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.886-893, 2005.
DOI : 10.1109/CVPR.2005.177

URL : https://hal.archives-ouvertes.fr/inria-00548512

R. Datta, D. Joshi, J. Li, W. , and J. , Image retrieval, ACM Computing Surveys, vol.40, issue.2, 2008.
DOI : 10.1145/1348246.1348248

D. Bimbo and A. , Visual information retrieval, 1999.

T. Deselaers, L. Pimenidis, and H. Ney, Bag-of-visual-words models for adult image classification and filtering, 2008 19th International Conference on Pattern Recognition, pp.1-4, 2008.
DOI : 10.1109/ICPR.2008.4761366

M. Douze, H. Jégou, H. Sandhawalia, L. Amsaleg, and C. Schmid, Evaluation of GIST descriptors for web-scale image search, Proceeding of the ACM International Conference on Image and Video Retrieval, CIVR '09, pp.1-8, 2009.
DOI : 10.1145/1646396.1646421

URL : https://hal.archives-ouvertes.fr/inria-00394212

R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification, p.38, 2001.

T. Endeshaw, J. Garcia, and A. Jakobsson, Fast classification of indecent video by low complexity repetitive motion detection, IEEE Applied Imagery Pattern Recognition Workshop, pp.1-7, 2008.

C. Faloutsos, R. Barber, M. Flickner, J. Hafner, W. Niblack et al., Efficient and effective Querying by Image Content, Journal of Intelligent Information Systems, vol.2, issue.6, pp.3-4231, 1994.
DOI : 10.1007/BF00962238

J. Farquhar, S. Szedmak, H. Meng, and J. Shawe-taylor, Improving bag-of-keypoints image categorisation, p.39, 2005.

L. Fei-fei, R. Fergus, and P. Perona, Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories, CVPR Workshop on Generative Model Based Vision, p.58, 2004.
DOI : 10.1016/j.cviu.2005.09.012

L. Fei-fei, R. Fergus, and P. Perona, Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories, Computer Vision and Image Understanding, vol.106, issue.1, pp.59-70, 2007.
DOI : 10.1016/j.cviu.2005.09.012

L. Fei-fei and P. Perona, A Bayesian Hierarchical Model for Learning Natural Scene Categories, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.524-531, 2005.
DOI : 10.1109/CVPR.2005.16

P. F. Felzenszwalb, R. B. Girshick, D. Mcallester, and D. And-ramanan, Object Detection with Discriminatively Trained Part-Based Models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.9, pp.1627-1645, 2010.
DOI : 10.1109/TPAMI.2009.167

P. F. Felzenszwalb and D. P. Huttenlocher, Pictorial Structures for Object Recognition, International Journal of Computer Vision, vol.61, issue.1, pp.55-79, 2005.
DOI : 10.1023/B:VISI.0000042934.15159.49

P. F. Felzenszwalb, D. A. Mcallester, and D. And-ramanan, A discriminatively trained, multiscale, deformable part model, 2008 IEEE Conference on Computer Vision and Pattern Recognition, p.42, 2008.
DOI : 10.1109/CVPR.2008.4587597

J. Feng, B. Ni, Q. Tian, Y. , and S. , Geometric ? p -norm feature pooling for image classification, Computer Vision and Pattern Recognition (CVPR), pp.2609-2704, 2011.

R. Fergus, Visual Object Category Recognition, p.42, 2005.

R. Fergus, P. Perona, and A. Zisserman, Object class recognition by unsupervised scale-invariant learning, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings., pp.264-271, 2003.
DOI : 10.1109/CVPR.2003.1211479

R. Fergus, P. Perona, and A. Zisserman, A Sparse Object Category Model for Efficient Learning and Exhaustive Recognition, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.380-397, 2005.
DOI : 10.1109/CVPR.2005.47

V. Ferrari, M. Marín-jiménez, and A. Zisserman, Progressive search space reduction for human pose estimation, 2008 IEEE Conference on Computer Vision and Pattern Recognition, p.43, 2008.
DOI : 10.1109/CVPR.2008.4587468

M. A. Fischler and R. A. Elschlager, The Representation and Matching of Pictorial Structures, IEEE Transactions on Computers, vol.22, issue.1, pp.67-92, 1973.
DOI : 10.1109/T-C.1973.223602

M. Fleck, D. A. Forsyth, and C. Bregler, Finding naked people, European Conference on Computer Vision (ECCV), pp.593-602, 1996.
DOI : 10.1007/3-540-61123-1_173

D. A. Forsyth and M. M. Fleck, Identifying nude pictures, Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96, pp.103-108, 1996.
DOI : 10.1109/ACV.1996.572010

D. A. Forsyth and M. M. Fleck, Body plans, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.678-683, 1997.
DOI : 10.1109/CVPR.1997.609399

D. A. Forsyth and M. M. Fleck, Automatic detection of human nudes, International Journal of Computer Vision, vol.32, issue.1, pp.63-77, 1999.
DOI : 10.1023/A:1008145029462

J. Fournier, M. Cord, and S. Philipp-foliguet, RETIN: A Content-Based Image Indexing and Retrieval System, Pattern Analysis & Applications, vol.4, issue.2-3, pp.153-173, 2001.
DOI : 10.1007/PL00014576

Y. Freund and R. E. Schapire, A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting, European Conference on Computational Learning Theory, pp.23-37, 1995.
DOI : 10.1006/jcss.1997.1504

J. Friedman, T. Hastie, and R. Tibshirani, Additive logistic regression: a statistical view of boosting, p.35, 2000.

A. Frome, D. Huber, R. Kolluri, and T. Bülow, Recognizing Objects in Range Data Using Regional Point Descriptors, European Conference on Computer Vision (ECCV), pp.224-237, 2004.
DOI : 10.1007/978-3-540-24672-5_18

K. Fukushima and S. Miyake, Neocognitron: A new algorithm for pattern recognition tolerant of deformations and shifts in position, Pattern Recognition, vol.15, issue.6, pp.455-469, 1982.
DOI : 10.1016/0031-3203(82)90024-3

B. Fulkerson, A. Vedaldi, and S. Soatto, Localizing Objects with Smart Dictionaries, European Conference on Computer Vision (ECCV), pp.179-192, 2008.
DOI : 10.1007/978-3-540-88682-2_15

S. Gao, I. W. Tsang, L. Chia, and P. Zhao, Local features are not lonely – Laplacian sparse coding for image classification, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.3555-3561, 2010.
DOI : 10.1109/CVPR.2010.5539943

P. Gehler and S. Nowozin, On feature combination for multiclass object classification, 2009 IEEE 12th International Conference on Computer Vision, pp.221-228, 2009.
DOI : 10.1109/ICCV.2009.5459169

H. Goh, N. Thome, M. Cord, and J. Lim, Unsupervised and Supervised Visual Codes with Restricted Boltzmann Machines, European conference on Computer Vision (ECCV), pp.298-311, 2012.
DOI : 10.1007/978-3-642-33715-4_22

URL : https://hal.archives-ouvertes.fr/hal-00816428

G. Golub and C. Van-loan, Matrix Computations, p.29, 1996.

P. Gosselin, M. Cord, and S. Philipp-foliguet, Combining visual dictionary, kernel-based similarity and learning strategy for image category retrieval, Computer Vision and Image Understanding, vol.110, issue.3, pp.403-417, 2008.
DOI : 10.1016/j.cviu.2007.09.018

URL : https://hal.archives-ouvertes.fr/hal-00520290

K. Grauman and T. Darrell, The pyramid match kernel: discriminative classification with sets of image features, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.1458-1465, 2005.
DOI : 10.1109/ICCV.2005.239

K. Grauman and B. Leibe, Visual Object Recognition, Synthesis Lectures on Artificial Intelligence and Machine Learning, vol.5, issue.2, p.44, 2011.
DOI : 10.2200/S00332ED1V01Y201103AIM011

M. Guillaumin, J. Verbeek, and C. Schmid, Multimodal semi-supervised learning for image classification, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.902-909, 2010.
DOI : 10.1109/CVPR.2010.5540120

URL : https://hal.archives-ouvertes.fr/inria-00548640

I. Guyon, B. E. Boser, and V. Vapnik, Automatic capacity tuning of very large vc-dimension classifiers, Advances in Neural Information Processing Systems (NIPS), pp.147-155, 1993.

S. Han and N. Vasconcelos, Biologically plausible saliency mechanisms improve feedforward object recognition, Vision Research, vol.50, issue.22, pp.2295-2307, 2010.
DOI : 10.1016/j.visres.2010.05.034

URL : http://doi.org/10.1016/j.visres.2010.05.034

R. M. Haralick, K. Shanmugam, and I. Dinstein, Textural Features for Image Classification, IEEE Transactions on Systems, Man, and Cybernetics, vol.3, issue.6, pp.610-621, 1973.
DOI : 10.1109/TSMC.1973.4309314

C. Harris and M. Stephens, A Combined Corner and Edge Detector, Procedings of the Alvey Vision Conference 1988, pp.147-151, 1988.
DOI : 10.5244/C.2.23

G. E. Hinton, Training Products of Experts by Minimizing Contrastive Divergence, Neural Computation, vol.22, issue.8, pp.1771-1800, 2002.
DOI : 10.1162/089976600300015385

G. E. Hinton and R. R. Salakhutdinov, Reducing the Dimensionality of Data with Neural Networks, Science, vol.313, issue.5786, pp.313504-507, 2006.
DOI : 10.1126/science.1127647

T. K. Ho, Random decision forest, International Conference on Document Analysis and Recognition (ICDAR), pp.278-282, 1995.

A. Holub and P. Perona, A Discriminative Framework for Modelling Object Classes, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.664-671, 2005.
DOI : 10.1109/CVPR.2005.25

A. D. Holub, M. Welling, and P. Perona, Combining generative models and Fisher kernels for object recognition, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.136-143, 2005.
DOI : 10.1109/ICCV.2005.56

H. Hotelling, Analysis of a complex of statistical variables into principal components., Journal of Educational Psychology, vol.24, issue.6, pp.417-441, 1933.
DOI : 10.1037/h0071325

C. Hsieh, K. Chang, C. Lin, S. S. Keerthi, and S. Sundararajan, A dual coordinate descent method for large-scale linear SVM, Proceedings of the 25th international conference on Machine learning, ICML '08, pp.408-415, 2008.
DOI : 10.1145/1390156.1390208

M. Hu, Visual pattern recognition by moment invariants, IRE Transactions on Information Theory, vol.8, issue.2, pp.179-187, 1962.

W. Hu, H. Zuo, O. Wu, Y. Chen, Z. Zhang et al., Recognition of adult images, videos, and web page bags, ACM Transactions on Multimedia Computing, Communications, and Applications, vol.7, issue.1, pp.7-8, 2011.
DOI : 10.1145/2037676.2037685

F. Huang and Y. Lecun, Large-scale learning with svm and convolutional nets for generic object categorization, Computer Vision and Pattern Recognition Conference (CVPR), p.40, 2006.

J. Huang, R. Kumar, S. Mitra, M. Zhu, W. Zabih et al., Spatial color indexing and applications, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271), pp.245-268, 1999.
DOI : 10.1109/ICCV.1998.710779

D. Hubel and T. Wiesel, Receptive fields of single neurones in the cat's striate cortex, The Journal of Physiology, vol.148, issue.3, pp.574-591, 1959.
DOI : 10.1113/jphysiol.1959.sp006308

D. H. Hubel and T. N. Wiesel, Receptive fields and functional architecture of monkey striate cortex, The Journal of Physiology, vol.195, issue.1, pp.215-243, 1968.
DOI : 10.1113/jphysiol.1968.sp008455

M. Huiskes and M. Lew, The MIR flickr retrieval evaluation, Proceeding of the 1st ACM international conference on Multimedia information retrieval, MIR '08, pp.39-43, 2008.
DOI : 10.1145/1460096.1460104

M. J. Huiskes, B. Thomee, and M. S. Lew, New trends and ideas in visual concept detection, Proceedings of the international conference on Multimedia information retrieval, MIR '10, pp.527-536, 2010.
DOI : 10.1145/1743384.1743475

T. Jaakkola and D. Haussler, Exploiting generative models in discriminative classifiers, Advances in Neural Information Processing Systems, pp.487-493, 1998.

R. Jain, The Art of Computer Systems Performance Analysis: techniques for experimental design, measurement, simulation, and modeling, p.87, 1991.

C. Jansohn, A. Ulges, and T. M. Breuel, Detecting pornographic video content by combining image features with motion information, Proceedings of the seventeen ACM international conference on Multimedia, MM '09, pp.601-604, 2009.
DOI : 10.1145/1631272.1631366

K. Jarrett, K. Kavukcuoglu, M. Ranzato, and Y. Lecun, What is the best multi-stage architecture for object recognition?, 2009 IEEE 12th International Conference on Computer Vision, pp.2146-2153, 2009.
DOI : 10.1109/ICCV.2009.5459469

H. Jégou and O. Chum, Negative evidences and co-occurrences in image retrieval: the benefit of PCA and whitening, European Conference on Computer Vision (ECCV, p.31, 2012.

H. Jégou, M. Douze, C. Schmid, and P. Pérez, Aggregating local descriptors into a compact image representation, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.3304-3311, 2010.
DOI : 10.1109/CVPR.2010.5540039

H. Jégou, F. Perronnin, M. Douze, J. Sánchez, P. Pérez et al., Aggregating Local Image Descriptors into Compact Codes, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.9, pp.1704-1716, 2012.
DOI : 10.1109/TPAMI.2011.235

Y. Jia, C. Huang, D. , and T. , Beyond spatial pyramids: Receptive field learning for pooled image features, Computer Vision and Pattern Recognition (CVPR), pp.3370-3377, 2012.

Z. Jiang, Z. Lin, D. , and L. S. , Learning a discriminative dictionary for sparse coding via label consistent K-SVD, CVPR 2011, pp.1697-1704, 2011.
DOI : 10.1109/CVPR.2011.5995354

T. Joachims, Training linear SVMs in linear time, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '06, pp.217-226, 2006.
DOI : 10.1145/1150402.1150429

I. Jolliffe, Principal Component Analysis, p.28, 2002.
DOI : 10.1007/978-1-4757-1904-8

M. J. Jones and J. M. Rehg, Statistical color models with application to skin detection, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149), pp.81-96, 2002.
DOI : 10.1109/CVPR.1999.786951

F. Jurie and B. Triggs, Creating efficient codebooks for visual recognition, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.604-610, 2005.
DOI : 10.1109/ICCV.2005.66

URL : https://hal.archives-ouvertes.fr/inria-00548511

K. Kavukcuoglu, M. Ranzato, and Y. Lecun, Fast inference in sparse coding algorithms with applications to object recognition, p.41, 2008.

K. Kavukcuoglu, P. Sermanet, Y. Boureau, K. Gregor, M. Mathieu et al., Learning convolutional feature hierachies for visual recognition, Advances in Neural Information Processing Systems (NIPS), p.40, 2010.

Y. Ke and R. Sukthankar, Pca-sift: a more distinctive representation for local image descriptors, Computer Vision and Pattern Recognition (CVPR), pp.506-513, 2004.

J. Kim and K. Grauman, Boundary preserving dense local regions, Computer Vision and Pattern Recognition (CVPR), pp.1553-1560, 2011.

T. Kohonen, Self-organized formation of topologically correct feature maps, Neurocomputing: foundations of research, pp.509-521, 1988.
DOI : 10.1007/BF00337288

P. Koniusz and K. Mikolajczyk, Spatial Coordinate Coding to reduce histogram representations, Dominant Angle and Colour Pyramid Match, 2011 18th IEEE International Conference on Image Processing, pp.661-664, 2011.
DOI : 10.1109/ICIP.2011.6116639

P. Koniusz, F. Yan, and K. Mikolajczyk, Comparison of mid-level feature coding approaches and pooling strategies in visual concept detection, Special Issue on Visual Concept Detection, pp.479-492, 2013.
DOI : 10.1016/j.cviu.2012.10.010

J. Krapac, Image Representations for Ranking and Classification, p.77, 2011.
URL : https://hal.archives-ouvertes.fr/tel-00650998

J. Krapac, J. Verbeeky, J. , and F. , Modeling spatial layout with fisher vectors for image categorization, 2011 International Conference on Computer Vision, pp.1487-1494, 2011.
DOI : 10.1109/ICCV.2011.6126406

URL : https://hal.archives-ouvertes.fr/inria-00612277

A. Krizhevsky, I. Sutskever, and G. Hinton, Imagenet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems (NIPS), pp.1106-1114, 2012.

M. P. Kumar, A. Zisserman, T. , and P. H. , Efficient discriminative learning of parts-based models, 2009 IEEE 12th International Conference on Computer Vision, pp.552-559, 2009.
DOI : 10.1109/ICCV.2009.5459192

L. I. Kuncheva and C. J. Whitaker, Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy, Machine Learning, pp.181-207, 2003.

I. Laptev, On Space-Time Interest Points, International Journal of Computer Vision, vol.17, issue.8, pp.107-123, 2005.
DOI : 10.1007/s11263-005-1838-7

D. Larlus and F. Jurie, Latent mixture vocabularies for object categorization and segmentation, Image and Vision Computing, vol.27, issue.5, pp.523-534, 2009.
DOI : 10.1016/j.imavis.2008.04.022

URL : https://hal.archives-ouvertes.fr/inria-00548649

H. Larochelle, D. Erhan, A. Courville, J. Bergstra, and Y. Bengio, An empirical evaluation of deep architectures on problems with many factors of variation, Proceedings of the 24th international conference on Machine learning, ICML '07, pp.473-480, 2007.
DOI : 10.1145/1273496.1273556

S. Lazebnik and M. Raginsky, Supervised Learning of Quantizer Codebooks by Information Loss Minimization, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.31, issue.7, pp.311294-1309, 2009.
DOI : 10.1109/TPAMI.2008.138

S. Lazebnik, C. Schmid, and J. Ponce, A sparse texture representation using local affine regions, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.27, issue.8, pp.271265-1278, 2005.
DOI : 10.1109/TPAMI.2005.151

URL : https://hal.archives-ouvertes.fr/inria-00548530

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.2169-2178, 2006.
DOI : 10.1109/CVPR.2006.68

URL : https://hal.archives-ouvertes.fr/inria-00548585

D. Le and S. Satoh, Nii, japan at ImageCLEF 2011 photo annotation task, Cross- Language Evaluation Forum (CLEF Notebook Papers, p.98, 2011.

A. Lechervy, P. Gosselin, and F. Precioso, Boosting kernel combination for multi-class image categorization, 2012 19th IEEE International Conference on Image Processing, p.60, 2012.
DOI : 10.1109/ICIP.2012.6467254

URL : https://hal.archives-ouvertes.fr/hal-00753156

Y. Lecun, B. Boser, J. S. Denker, R. E. Howard, W. Habbard et al., Advances in neural information processing systems (nips) In Handwritten digit recognition with a back-propagation network, pp.396-404, 1990.

Y. Lecun, F. J. Huang, and L. Bottou, Learning methods for generic object recognition with invariance to pose and lighting, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., pp.97-104, 2004.
DOI : 10.1109/CVPR.2004.1315150

Y. Lecun, K. Kavukvuoglu, and C. Farabet, Convolutional networks and applications in vision, Proceedings of 2010 IEEE International Symposium on Circuits and Systems, p.40, 2010.
DOI : 10.1109/ISCAS.2010.5537907

H. Lee, E. Chaitanya, and A. Ng, Sparse deep belief net model for visual area V2, Advances in Neural Information Processing Systems (NIPS), pp.873-880, 2008.

H. Lee, R. Grosse, R. Ranganath, and A. Y. Ng, Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations, Proceedings of the 26th Annual International Conference on Machine Learning, ICML '09, pp.609-616, 2009.
DOI : 10.1145/1553374.1553453

J. Lee, Y. Kuo, P. Chung, C. , and E. , Naked image detection based on adaptive and extensible skin color model, Pattern Recognition, vol.40, issue.8, pp.402261-2270, 2007.
DOI : 10.1016/j.patcog.2006.11.016

S. Lee, W. Shim, K. , and S. , Hierarchical system for objectionable video detection, IEEE Transactions on Consumer Electronics, vol.55, issue.2, pp.677-684, 2009.
DOI : 10.1109/TCE.2009.5174439

B. Leibe, A. Leonardis, and B. Schiele, An Implicit Shape Model for Combined Object Categorization and Segmentation, ECCV Workshop on Statistical Learning in Computer Vision, p.44, 2004.
DOI : 10.1007/11957959_26

B. Leibe, A. Leonardis, and B. Schiele, Robust Object Detection with Interleaved Categorization and Segmentation, International Journal of Computer Vision, vol.73, issue.2, pp.259-289, 2008.
DOI : 10.1007/s11263-007-0095-3

B. Leibe and B. Schiele, Interleaved Object Categorization and Segmentation, Procedings of the British Machine Vision Conference 2003, pp.759-768, 2003.
DOI : 10.5244/C.17.78

C. Leistner, A. Saffari, J. Santner, and H. Bischof, Semi-Supervised Random Forests, 2009 IEEE 12th International Conference on Computer Vision, pp.506-513, 2009.
DOI : 10.1109/ICCV.2009.5459198

V. Lepetit and P. Fua, Keypoint recognition using randomized trees, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, issue.9, pp.1465-1479, 2006.
DOI : 10.1109/TPAMI.2006.188

M. Lew, N. Sebe, C. Djeraba, and R. Jain, Content-based multimedia information retrieval, ACM Transactions on Multimedia Computing, Communications, and Applications, 2006.
DOI : 10.1145/1126004.1126005

B. Li, R. Xiao, Z. Li, R. Cai, B. Lu et al., Rank-SIFT: Learning to rank repeatable local interest points, CVPR 2011, pp.1737-1744, 2011.
DOI : 10.1109/CVPR.2011.5995461

J. Li and N. M. Allinson, A comprehensive review of current local features for computer vision, Neurocomputing, vol.71, issue.10-12, pp.10-121771, 2008.
DOI : 10.1016/j.neucom.2007.11.032

Y. Linde, A. Buzo, and R. M. Gray, An Algorithm for Vector Quantizer Design, IEEE Transactions on Communications, vol.28, issue.1, pp.84-95, 1980.
DOI : 10.1109/TCOM.1980.1094577

T. Lindeberg, Feature detection with automatic scale selection, International Journal of Computer Vision, vol.30, issue.2, pp.79-116, 1998.
DOI : 10.1023/A:1008045108935

J. Liu and M. Shah, Scene Modeling Using Co-Clustering, 2007 IEEE 11th International Conference on Computer Vision, pp.1-7, 2007.
DOI : 10.1109/ICCV.2007.4408866

J. Liu, Y. Yang, and M. Shah, Learning semantic visual vocabularies using diffusion distance, Computer Vision and Pattern Recognition (CVPR), pp.461-468, 2009.

L. Liu, L. Wang, and X. Liu, In defense of soft-assignment coding, International Conference on Computer Vision (ICCV), pp.2486-2493, 2011.

N. Liu, E. Dellandrea, L. Chen, A. Trus, C. Zhu et al., LIRIS-Imagine at ImageCLEF 2012 Photo Annotation task, Working Notes of the 2012 Conference and Labs of the Evaluation Forum, p.120, 2012.

N. Liu, E. Dellandrea, C. Zhu, C. Bichot, C. et al., A selective weighted late fusion for visual concept recognition, ECCV 2012 Workshop on Information fusion in Computer Vision for Concept Recognition. 103, 2012.
URL : https://hal.archives-ouvertes.fr/hal-01353059

Y. Liu, X. Wang, Y. Zhang, and S. Tang, Fusing Audio-Words with Visual Features for Pornographic Video Detection, 2011IEEE 10th International Conference on Trust, Security and Privacy in Computing and Communications, pp.1488-1493, 2011.
DOI : 10.1109/TrustCom.2011.205

A. Lopes, S. Avila, A. Peixoto, R. Oliveira, M. Coelho et al., Nude Detection in Video Using Bag-of-Visual-Features, 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing, pp.224-231, 2009.
DOI : 10.1109/SIBGRAPI.2009.32

A. Lopes, S. Avila, A. Peixoto, R. Oliveira, D. A. Araújo et al., A bag-of-features approach based on hue-sift descriptor for nude detection, European Signal Processing Conference (EUSIPCO), pp.1552-1556, 2009.

D. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

D. Lowe, Local naive bayes nearest neighbor for image classification, Conference on Computer Vision and Pattern Recognition (CVPR), pp.3650-3656, 2012.

D. G. Lowe, Object recognition from local scale-invariant features, Proceedings of the Seventh IEEE International Conference on Computer Vision, pp.1150-1157, 1999.
DOI : 10.1109/ICCV.1999.790410

L. Lu, K. Toyama, and G. D. Hager, A Two Level Approach for Scene Recognition, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.688-695, 2005.
DOI : 10.1109/CVPR.2005.51

W. Y. Ma and B. S. Manjunath, NeTra: A toolbox for navigating large image databases, Multimedia Systems, vol.7, issue.3, pp.184-198, 1999.
DOI : 10.1007/s005300050121

J. Mairal, F. Bach, J. Ponce, and G. Sapiro, Online learning for matrix factorization and sparse coding, Journal of Machine Learning Research, vol.11, pp.19-60, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00408716

J. Mairal, F. Bach, J. Ponce, G. Sapiro, and A. Zisserman, Supervised dictionary learning, Advances in Neural Information Processing Systems (NIPS), pp.1033-1040, 2008.
URL : https://hal.archives-ouvertes.fr/inria-00322431

B. S. Manjunath and W. Y. Ma, Texture features for browsing and retrieval of image data, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.18, issue.8, pp.837-842, 1996.
DOI : 10.1109/34.531803

E. Mbanya, S. Gerke, C. Hentschel, and P. Ndjiki-nya, Sample selection, category specific features and reasoning, Cross-Language Evaluation Forum (CLEF Notebook Papers, p.98, 2011.

K. Mikolajczy and C. Schmid, Scale & Affine Invariant Interest Point Detectors, International Journal of Computer Vision, vol.60, issue.1, pp.63-86, 2004.
DOI : 10.1023/B:VISI.0000027790.02288.f2

K. Mikolajczyk and C. Schmid, An Affine Invariant Interest Point Detector, European Conference Computer Vision (ECCV, p.17, 2002.
DOI : 10.1007/3-540-47969-4_9

URL : https://hal.archives-ouvertes.fr/inria-00548252

K. Mikolajczyk and C. Schmid, A performance evaluation of local descriptors, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.27, issue.10, pp.1615-1630, 2005.
DOI : 10.1109/TPAMI.2005.188

URL : https://hal.archives-ouvertes.fr/inria-00548227

K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas et al., A Comparison of Affine Region Detectors, International Journal of Computer Vision, vol.65, issue.1-2, pp.43-72, 2005.
DOI : 10.1007/s11263-005-3848-x

URL : https://hal.archives-ouvertes.fr/inria-00548528

H. Müller, P. Clough, T. Deselaers, and B. Caputo, ImageCLEF: Experimental Evaluation in Visual Information Retrieval, p.50, 2010.
DOI : 10.1007/978-3-642-15181-1

F. Mokhtarian, Silhouette-based isolated object recognition through curvature scale space, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.17, issue.5, pp.539-544, 1995.
DOI : 10.1109/34.391387

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.455.6853

F. Moosmann, E. Nowak, J. , and F. , Randomized Clustering Forests for Image Classification, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.30, issue.9, pp.1632-1646, 2008.
DOI : 10.1109/TPAMI.2007.70822

URL : https://hal.archives-ouvertes.fr/inria-00548666

F. Moosmann, B. Triggs, J. , and F. , Fast discriminative visual codebooks using randomized clustering forests, Advances in Neural Information Processing Systems (NIPS), pp.985-992, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00203734

J. Mutch and D. G. Lowe, Object Class Recognition and Localization Using Sparse Features with Limited Receptive Fields, International Journal of Computer Vision, vol.5, issue.7, pp.45-57, 2008.
DOI : 10.1007/s11263-007-0118-0

E. Nadernejad, S. Sharifzadeh, and H. Hassanpour, Edge detection techniques: Evaluations and comparisons, Applied Mathematical Sciences, vol.2, issue.31, pp.1507-1520, 2008.

C. Nebauer, Evaluation of convolutional neural networks for visual recognition, IEEE Transactions on Neural Networks, vol.9, issue.4, pp.685-695, 1998.
DOI : 10.1109/72.701181

R. Negrel, D. Picard, P. H. , and G. , Using spatial pyramids with compacted vlat for image categorization, International Conference on Pattern Recognition (ICPR, p.30, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00753158

M. Nilsback and A. Zisserman, A Visual Vocabulary for Flower Classification, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), p.68, 2006.
DOI : 10.1109/CVPR.2006.42

D. Nister and H. Stewenius, Scalable Recognition with a Vocabulary Tree, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.2161-2168, 2006.
DOI : 10.1109/CVPR.2006.264

E. Nowak, F. Jurie, and B. Triggs, Sampling Strategies for Bag-of-Features Image Classification, European Conference on Computer Vision (ECCV), p.38, 2006.
DOI : 10.1007/11744085_38

URL : https://hal.archives-ouvertes.fr/hal-00203752

S. Nowak, K. Nagel, and J. Liebetrau, The CLEF 2011 Photo Annotation and Concept-based Retrieval Tasks, Cross-Language Evaluation Forum (CLEF Notebook Papers, pp.51-58, 2011.

A. Oliva and A. Torralba, Modeling the shape of the scene: A holistic representation of the spatial envelope, International Journal of Computer Vision, vol.42, issue.3, pp.145-175, 2001.
DOI : 10.1023/A:1011139631724

G. L. Oliveira, E. R. Nascimento, A. W. Vieira, and M. F. Campos, Sparse Spatial Coding: A novel approach for efficient and accurate object recognition, 2012 IEEE International Conference on Robotics and Automation, pp.2592-2598, 2012.
DOI : 10.1109/ICRA.2012.6224785

A. Opelt, M. Fussenegger, A. Pinz, and P. Auer, Weak Hypotheses and Boosting for Generic Object Detection and Recognition, European Conference on Computer Vision (ECCV), pp.71-84, 2004.
DOI : 10.1007/978-3-540-24671-8_6

P. Ott and M. Everingham, Shared parts for deformable part-based models, CVPR 2011, pp.1513-1520, 2011.
DOI : 10.1109/CVPR.2011.5995357

D. Parikh, C. L. Zitnick, C. , and T. , Unsupervised learning of hierarchical spatial structures in images, 2009 IEEE Conference on Computer Vision and Pattern Recognition, p.38, 2009.
DOI : 10.1109/CVPR.2009.5206549

S. Parizi, J. Oberlin, and P. Felzenszwalb, Reconfigurable models for scene recognition, 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp.2775-2782, 2012.
DOI : 10.1109/CVPR.2012.6248001

G. Pass and R. Zabih, Histogram refinement for content-based image retrieval, Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96, p.13, 1996.
DOI : 10.1109/ACV.1996.572008

O. A. Penatti, E. Valle, D. S. Torres, and R. , Encoding Spatial Arrangement of Visual Words, Conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications (CIARP), pp.240-247, 2011.
DOI : 10.1023/B:VISI.0000027790.02288.f2

F. Perronnin and C. Dance, Fisher Kernels on Visual Vocabularies for Image Categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, p.80, 2007.
DOI : 10.1109/CVPR.2007.383266

F. Perronnin, C. Dance, G. Csurka, and M. Bressan, Adapted Vocabularies for Generic Visual Categorization, European Conference on Computer Vision (ECCV), pp.464-475, 2006.
DOI : 10.1007/11744085_36

F. Perronnin, Y. Liu, J. Sánchez, and H. Poirier, Large-scale image retrieval with compressed Fisher vectors, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.3384-3391, 2010.
DOI : 10.1109/CVPR.2010.5540009

F. Perronnin, J. Sánchez, and Y. Liu, Large-scale image categorization with explicit data embedding, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.2297-2304, 2010.
DOI : 10.1109/CVPR.2010.5539914

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, European Conference on Computer Vision (ECCV), pp.143-156, 2010.
DOI : 10.1007/978-3-642-15561-1_11

URL : https://hal.archives-ouvertes.fr/inria-00548630

J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, Lost in quantization: Improving particular object retrieval in large scale image databases, 2008 IEEE Conference on Computer Vision and Pattern Recognition, p.22, 2008.
DOI : 10.1109/CVPR.2008.4587635

D. Picard and P. Gosselin, Improving image similarity with vectors of locally aggregated tensors, 2011 18th IEEE International Conference on Image Processing, pp.669-672, 2011.
DOI : 10.1109/ICIP.2011.6116641

URL : https://hal.archives-ouvertes.fr/hal-00591993

D. Picard, N. Thome, and M. Cord, An efficient system for combining complementary kernels in complex visual categorization tasks, 2010 IEEE International Conference on Image Processing, pp.3877-3880, 2010.
DOI : 10.1109/ICIP.2010.5651051

URL : https://hal.archives-ouvertes.fr/hal-00656365

D. Picard, N. Thome, and M. Cord, Learning geometric combinations of gaussian kernels with alternating quasi-newton algorithm, European Symposium on Artificial Neural Networks (ESANN), p.98, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00705374

J. C. Platt, Fast training of support vector machines using sequential minimal optimization, Advances in kernel methods, pp.185-208, 1999.

J. Prewitt, Picture processing and Psychopictorics, chapter Object Enhancement and Extraction, p.14, 1970.

P. Quelhas, F. Monay, J. Odobez, D. Gatica-perez, and T. Tuytelaars, A Thousand Words in a Scene, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.29, issue.9, pp.291575-1589, 2007.
DOI : 10.1109/TPAMI.2007.1155

D. Ramanan, D. A. Forsyth, and A. Zisserman, Tracking People by Learning Their Appearance, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.29, issue.1, pp.65-81, 2007.
DOI : 10.1109/TPAMI.2007.250600

D. Ramanan and C. Sminchisescu, Training Deformable Models for Localization, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 1 (CVPR'06), pp.206-213, 2006.
DOI : 10.1109/CVPR.2006.315

M. Ranzato, Y. Boureau, and Y. Lecun, Sparse feature learning for deep belief networks, Advances in Neural Information Processing Systems (NIPS), p.41, 2007.

M. Ranzato, F. Huang, Y. Boureau, and Y. Lecun, Unsupervised Learning of Invariant Feature Hierarchies with Applications to Object Recognition, 2007 IEEE Conference on Computer Vision and Pattern Recognition, p.40, 2007.
DOI : 10.1109/CVPR.2007.383157

N. Rea, G. Lacey, C. Lambe, and R. Dahyot, Multimodal periodicity analysis for illicit content detection in videos, 3rd European Conference on Visual Media Production (CVMP 2006). Part of the 2nd Multimedia Conference 2006, pp.106-114, 2006.
DOI : 10.1049/cp:20061978

C. Ries and R. Lienhart, A survey on visual adult image recognition, Multimedia Tools and Applications, pp.1-28, 2012.
DOI : 10.1007/s11042-012-1132-y

M. Riesenhuber and T. Poggio, Hierarchical models of object recognition in cortex, Nature Neuroscience, vol.2, pp.1019-1025, 1999.

A. Rocha, D. C. Hauagge, J. Wainer, and S. Goldenstein, Automatic Produce Classification from Images Using Color, Texture and Appearance Cues, 2008 XXI Brazilian Symposium on Computer Graphics and Image Processing, pp.3-10, 2008.
DOI : 10.1109/SIBGRAPI.2008.9

L. Rokach, Ensemble-based classifiers, Artificial Intelligence Review, vol.13, issue.4, pp.1-39, 2010.
DOI : 10.1007/s10462-009-9124-7

H. A. Rowley, Y. Jing, and S. Baluja, Large scale image-based adult-content filtering, International Conference on Computer Vision Theory and Applications (VISAPP), pp.290-296, 2006.

O. Russakovsky, Y. Lin, K. Yu, and L. Fei-fei, Object-Centric Spatial Pooling for Image Classification, European Conference on Computer Vision (ECCV), pp.1-15, 2012.
DOI : 10.1007/978-3-642-33709-3_1

B. Safadi, Indexation sémantique des images et des vidéos par apprentissage actif, p.31, 2012.

A. Saffari, H. Grabner, and H. Bischof, SERBoost: Semi-supervised Boosting with Expectation Regularization, European Conference on Computer Vision, pp.588-601, 2008.
DOI : 10.1007/978-3-540-88690-7_44

J. Sánchez, F. Perronnin, and T. E. , Modeling the spatial layout of images beyond spatial pyramids, Pattern Recognition Letters, vol.33, issue.16, pp.33-38, 2012.
DOI : 10.1016/j.patrec.2012.07.019

R. E. Schapire, The strength of weak learnability, Machine Learning, pp.197-227, 1990.

R. E. Schapire, Y. Freund, P. Bartlett, L. , and W. S. , Boosting the margin: a new explanation for the effectiveness of voting methods, The Annals of Statistics, vol.26, issue.5, pp.1651-1686, 1998.
DOI : 10.1214/aos/1024691352

C. Schmid, R. Mohr, and C. Bauckhage, Evaluation of interest point detectors, International Journal of Computer Vision, vol.37, issue.2, pp.151-172, 2000.
DOI : 10.1023/A:1008199403446

URL : https://hal.archives-ouvertes.fr/inria-00548302

B. Schölkopf, A. Smola, and K. Müller, Nonlinear Component Analysis as a Kernel Eigenvalue Problem, Neural Computation, vol.20, issue.5, pp.1299-1319, 1998.
DOI : 10.1007/BF02281970

B. Scholkopf and A. J. Smola, Learning with Kernels: Support Vector Machines, Regularization , Optimization, and Beyond, p.32, 2001.

N. Sebe, I. Cohen, A. Garg, and T. Huang, Machine Learning in Computer Vision, p.63, 2005.

T. Serre, L. Wolf, S. Bileschi, M. Riesenhuber, and T. Poggio, Robust Object Recognition with Cortex-Like Mechanisms, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.29, issue.3, pp.29411-426, 2007.
DOI : 10.1109/TPAMI.2007.56

S. Shalev-shwartz, Y. Singer, and N. Srebro, Pegasos, Proceedings of the 24th international conference on Machine learning, ICML '07, pp.807-814, 2007.
DOI : 10.1145/1273496.1273598

J. Shawe-taylor and N. Cristianini, Kernel Methods for Pattern Analysis, p.79, 2004.
DOI : 10.1017/CBO9780511809682

M. B. Short, L. Black, A. H. Smith, C. T. Wetterneck, W. et al., A Review of Internet Pornography Use Research: Methodology and Content from the Past 10 Years, Cyberpsychology, Behavior, and Social Networking, vol.15, issue.1, pp.13-23, 2012.
DOI : 10.1089/cyber.2010.0477

J. Shotton, M. Johnson, and R. Cipolla, Semantic texton forests for image categorization and segmentation, 2008 IEEE Conference on Computer Vision and Pattern Recognition, p.36, 2008.
DOI : 10.1109/CVPR.2008.4587503

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, pp.91-93, 2003.
DOI : 10.1109/ICCV.2003.1238663

A. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, Content-based image retrieval at the end of the early years, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, issue.12, pp.1349-1380, 2000.
DOI : 10.1109/34.895972

P. D. Smolensky, J. L. Mcclelland, and C. Research-group, Information processing in dynamical systems: foundations of harmony theory, Parallel distributed processing: explorations in the microstructure of cognition, pp.194-281, 1986.

I. E. Sobel, Camera models and machine perception, p.14, 1970.

C. Steel, The mask-sift cascading classifier for pornography detection, World Congress on Internet Security (WorldCIS), pp.139-142, 2012.

R. O. Stehling, M. A. Nascimento, and A. X. Falcão, A compact and efficient image retrieval approach based on border/interior pixel classification, Proceedings of the eleventh international conference on Information and knowledge management , CIKM '02, pp.102-109, 2002.
DOI : 10.1145/584792.584812

M. Stricker and M. Orengo, Similarity of color images, Storage and Retrieval for Image and Video Databases, pp.381-392, 1995.

Y. Su and F. Jurie, Semantic contexts and fisher vectors for the ImageCLEF 2011 photo annotation task, Cross-Language Evaluation Forum (CLEF Notebook Papers, pp.95-98, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00808656

M. J. Swain and D. H. Ballard, Color indexing, International Journal of Computer Vision, vol.31, issue.1, pp.11-32, 1991.
DOI : 10.1007/BF00130487

H. Tamura, S. Mori, Y. , and T. , Textural Features Corresponding to Visual Perception, IEEE Transactions on Systems, Man, and Cybernetics, vol.8, issue.6, pp.460-473, 1978.
DOI : 10.1109/TSMC.1978.4309999

G. Taylor, R. Fergus, Y. Lecun, and C. Bregler, Convolutional Learning of Spatio-temporal Features, European Conference on Computer Vision (ECCV), p.40, 2010.
DOI : 10.1007/978-3-642-15567-3_11

C. Thériault, N. Thome, and M. Cord, HMAX-S: Deep scale representation for biologically inspired image categorization, 2011 18th IEEE International Conference on Image Processing, pp.1261-1264, 2011.
DOI : 10.1109/ICIP.2011.6115663

C. Thériault, N. Thome, and M. Cord, Extended Coding and Pooling in the HMAX Model, IEEE Transactions on Image Processing, vol.22, issue.2, p.40, 2012.
DOI : 10.1109/TIP.2012.2222900

B. Thomee and A. Popescu, Overview of the ImageCLEF 2012 Flickr Photo Annotation and Retrieval Task, CLEF (Online Working Notes, pp.58-83, 2012.

S. Thorpe, D. Fize, and C. Marlot, Speed of processing in the human visual system, Nature, vol.381, issue.6582, pp.520-522, 1996.
DOI : 10.1038/381520a0

E. Tola, V. Lepetit, and P. Fua, DAISY: An Efficient Dense Descriptor Applied to Wide-Baseline Stereo, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.5, pp.815-830, 2010.
DOI : 10.1109/TPAMI.2009.77

X. Tong, L. Duan, C. Xu, Q. Tian, L. Hanqing et al., Periodicity detection of local motion, IEEE International Conference on Multimedia and Expo (ICME), pp.650-653, 2005.

A. Torralba, A. Fergus, and Y. Weiss, Small codes and large image databases for recognition, 2008 IEEE Conference on Computer Vision and Pattern Recognition, p.14, 2008.
DOI : 10.1109/CVPR.2008.4587633

A. Torralba, K. P. Murphy, W. T. Freeman, R. , and M. A. , Context-based vision system for place and object recognition, Proceedings Ninth IEEE International Conference on Computer Vision, 2003.
DOI : 10.1109/ICCV.2003.1238354

A. Trémeau, S. Tominaga, and K. N. Plataniotis, Color in Image and Video Processing: Most Recent Trends and Future Research Directions, EURASIP Journal on Image and Video Processing, vol.23, issue.7, pp.1-26, 2008.
DOI : 10.1889/1.2036350

C. Tsai, Training support vector machines based on stacked generalization for image classification, Neurocomputing, vol.64, pp.497-503, 2005.
DOI : 10.1016/j.neucom.2004.08.005

K. Tsuda, T. Kin, and K. Asai, Marginalized kernels for biological sequences, International Conference on Intelligent Systems for Molecular Biology (ISMB), pp.268-275, 2002.
DOI : 10.1093/bioinformatics/18.suppl_1.S268

M. Tuceryan and A. K. Jain, Handbook of Pattern Recognition and Computer Vision, chapter Texture Analysis, p.13, 2000.

T. Tuytelaars, Dense interest points, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.2281-2288, 2010.
DOI : 10.1109/CVPR.2010.5539911

T. Tuytelaars, M. Fritz, K. Saenko, D. , and T. , The NBNN kernel, 2011 International Conference on Computer Vision, p.38, 2011.
DOI : 10.1109/ICCV.2011.6126449

T. Tuytelaars and K. Mikolajczyk, Local invariant feature detectors: A survey. Foundations and Trends in Computer Graphics and Vision, pp.177-280, 2008.

A. Ulges, C. Schulze, D. Borth, and A. Stahl, Pornography detection in video benefits (a lot) from a multi-modal approach, Proceedings of the 2012 ACM international workshop on Audio and multimedia methods for large-scale video analysis, AMVA '12, pp.21-26, 2012.
DOI : 10.1145/2390214.2390222

A. Ulges and A. Stahl, Automatic detection of child pornography using color visual words, 2011 IEEE International Conference on Multimedia and Expo, pp.1-6, 2011.
DOI : 10.1109/ICME.2011.6011977

M. M. Ullah and I. Laptev, Actlets: A novel local representation for human action recognition in video, 2012 19th IEEE International Conference on Image Processing, pp.777-780, 2012.
DOI : 10.1109/ICIP.2012.6466975

URL : https://hal.archives-ouvertes.fr/hal-01063332

Y. Ushiku, H. Muraoka, S. Inaba, T. Fujisawa, K. Yasumoto et al., ISI at ImageCLEF 2012: Scalable System for Image Annotation, Working Notes of the 2012 Conference and Labs of the Evaluation Forum, p.104, 2012.

E. Valle, S. Avila, L. Da, A. Jr, F. De-souza et al., Contentbased filtering for video sharing social networks, Brazilian Symposium on Information and Computer System Security, p.109, 2012.

K. Van-de-sande, T. Gevers, and C. Snoek, Evaluating Color Descriptors for Object and Scene Recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.9, pp.1582-1596, 2010.
DOI : 10.1109/TPAMI.2009.154

K. E. Van-de-sande and C. G. Snoek, The university of amsterdam's concept detection system at ImageCLEF 2011, Cross-Language Evaluation Forum (CLEF Notebook Papers, p.98, 2011.

J. Van-de-weijer and C. Schmid, Coloring Local Feature Extraction, European Conference on Computer Vision (ECCV), pp.334-348, 2006.
DOI : 10.1002/col.10049

URL : https://hal.archives-ouvertes.fr/inria-00548576

J. Van-gemert, C. Veenman, A. Smeulders, and J. Geusebroek, Visual Word Ambiguity, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.7, pp.1271-1283, 2010.
DOI : 10.1109/TPAMI.2009.132

J. C. Van-gemert, J. Geusebroek, C. J. Veenman, and A. W. Smeulders, Kernel Codebooks for Scene Categorization, European Conference on Computer Vision (ECCV), pp.696-709, 2008.
DOI : 10.1007/978-3-540-88690-7_52

J. C. Van-gemert, J. Geusebroek, C. J. Veenman, C. G. Snoek, and A. W. Smeulders, Robust Scene Categorization by Learning Image Statistics in Context, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06), p.39, 2006.
DOI : 10.1109/CVPRW.2006.177

V. N. Vapnik, Statistical Learning Theory, p.32, 1998.

V. N. Vapnik and A. Lerner, Pattern recognition using generalized portrait method. Automation and Remote Control, pp.774-780, 1963.

A. Vedaldi and B. Fulkerson, VLFeat -An open and portable library of computer vision algorithms, ACM International Conference on Multimedia, p.84, 2010.

A. Vedaldi, V. Gulshan, M. Varma, and A. Zisserman, Multiple kernels for object detection, 2009 IEEE 12th International Conference on Computer Vision, pp.606-613, 2009.
DOI : 10.1109/ICCV.2009.5459183

A. Vedaldi and A. Zisserman, Efficient additive kernels via explicit feature maps, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), vol.34, issue.70, p.120, 2012.

V. Viitaniemi and J. Laaksonen, Experiments on Selection of Codebooks for Local Image Feature Histograms, International Conference on Visual Information Systems: Web-Based Visual Information Search and Management, pp.126-137, 2008.
DOI : 10.1007/978-3-540-85891-1_16

P. Vincent, H. Larochelle, Y. Bengio, and P. Manzagol, Extracting and composing robust features with denoising autoencoders, Proceedings of the 25th international conference on Machine learning, ICML '08, pp.1096-1103, 2008.
DOI : 10.1145/1390156.1390294

J. Vogel and B. Schiele, Semantic Modeling of Natural Scenes for Content-Based Image Retrieval, International Journal of Computer Vision, vol.10, issue.1, pp.133-157, 2007.
DOI : 10.1007/s11263-006-8614-1

H. Wang, A. Klaser, C. Schmid, and C. Liu, Action recognition by dense trajectories, CVPR 2011, pp.3169-3176, 2011.
DOI : 10.1109/CVPR.2011.5995407

URL : https://hal.archives-ouvertes.fr/inria-00583818

J. Wang, J. Yang, K. Yu, F. Lv, T. Huang et al., Locality-constrained Linear Coding for image classification, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.3360-3367, 2010.
DOI : 10.1109/CVPR.2010.5540018

M. Weber, M. Welling, and P. Perona, Towards automatic discovery of object categories, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662), pp.2101-2108, 2000.
DOI : 10.1109/CVPR.2000.854754

J. Weston, F. Ratle, C. , and R. , Deep learning via semi-supervised embedding, International Conference on Machine Learning (ICML), pp.1168-1175, 2008.

J. Willamowski, D. Arregui, G. Csurka, C. R. Dance, F. et al., Categorizing nine visual classes using local appearance descriptors, International Conference on Pattern Recognition (ICPR, p.38, 2004.

C. Williams and M. Seeger, Using the nyström method to speed up kernel machines, Advances in Neural Information Processing Systems (NIPS), pp.682-688, 2001.

J. Winn and A. Criminisi, Object class recognition at a glance, Computer Vision and Pattern Recognition (CVPR, p.36, 2006.

J. Winn, A. Criminisi, and T. Minka, Object categorization by learned universal visual dictionary, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.1800-1807, 2005.
DOI : 10.1109/ICCV.2005.171

L. Wolf and I. Martin, Robust Boosting for Learning from Few Examples, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.359-364, 2005.
DOI : 10.1109/CVPR.2005.305

D. H. Wolpert, Stacked generalization, Neural Networks, vol.5, issue.2, p.36, 1992.
DOI : 10.1016/S0893-6080(05)80023-1

E. S. Xioufis, G. Tsoumakas, and I. Vlahavas, MLKD's Participation at Imageof the 2012 Conference and Labs of the Evaluation Forum Photo Annotation and Concept-based Retrieval Tasks, Working Notes of the 2012 Conference and Labs of the Evaluation Forum, p.104, 2012.

J. Yang, Y. Jiang, A. G. Hauptmann, and C. Ngo, Evaluating bag-of-visual-words representations in scene classification, Proceedings of the international workshop on Workshop on multimedia information retrieval , MIR '07, pp.197-206, 2007.
DOI : 10.1145/1290082.1290111

J. Yang, Y. Li, Y. Tian, L. Duan, and W. Gao, Group sensitive multiple kernel learning for object categorization, International Conference on Computer Vision (ICCV, p.57, 2009.

J. Yang, K. Yu, Y. Gong, and T. Huang, Linear spatial pyramid matching using sparse coding for image classification, Computer Vision and Pattern Recognition (CVPR), pp.1794-1801, 2009.

M. Yang, K. Kpalma, and J. And-ronsin, A Survey of Shape Feature Extraction Techniques, Pattern Recognition, pp.43-90, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00446037

Y. Yang and D. Ramanan, Articulated pose estimation with flexible mixtures-of-parts, CVPR 2011, pp.1385-1392, 2011.
DOI : 10.1109/CVPR.2011.5995741

B. Yao, A. Khosla, and L. Fei-fei, Combining randomization and discrimination for finegrained image categorization, Computer Vision and Pattern Recognition (CVPR), p.36, 2011.

K. Yu, T. Zhang, and Y. Gong, Nonlinear learning using local coordinate coding, Advances in Neural Information Processing Systems (NIPS), pp.2223-2231, 2009.

D. Zhang and G. Lu, Review of shape representation and description techniques, Pattern Recognition, vol.37, issue.1, pp.1-19, 2004.
DOI : 10.1016/j.patcog.2003.07.008

H. Zhang, A. C. Berg, M. Maire, M. , and J. , SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.2126-2136, 2006.
DOI : 10.1109/CVPR.2006.301

W. Zhang and T. G. Dietterich, Learning visual dictionaries and decision lists for object recognition, 2008 19th International Conference on Pattern Recognition, pp.1-4, 2008.
DOI : 10.1109/ICPR.2008.4761769

H. Zheng and M. Daoudi, Blocking adult images based on statistical skin detection, Electronic Letters on Computer Vision and Image Analysis, vol.4, issue.2, p.107, 2004.

X. Zhou, K. Yu, T. Zhang, and T. Huang, Image Classification Using Super-Vector Coding of Local Image Descriptors, European Conference on Computer Vision (ECCV), pp.141-154, 2010.
DOI : 10.1007/978-3-642-15555-0_11

L. Zhu, Y. Chen, A. L. Yuille, F. , and W. T. , Latent hierarchical structural learning for object detection, Computer Vision and Pattern Recognition (CVPR), pp.1062-1069, 2010.

D. Ziou and S. Tabbone, Edge Detection Techniques -An Overview, International Journal of Pattern Recognition and Image Analysis, vol.8, pp.537-559, 1998.
URL : https://hal.archives-ouvertes.fr/inria-00098446

A. Znaidia, A. Shabou, A. Popescu, H. Le-borgne, and C. Hudelot, Multimodal feature generation framework for semantic image classification, Proceedings of the 2nd ACM International Conference on Multimedia Retrieval, ICMR '12, pp.1-8, 2012.
DOI : 10.1145/2324796.2324842

URL : https://hal.archives-ouvertes.fr/hal-00825190

H. Zuo, W. Hu, and O. Wu, Patch-based skin color detection and its application to pornography image filtering, Proceedings of the 19th international conference on World wide web, WWW '10, pp.1227-1228, 2010.
DOI : 10.1145/1772690.1772887