T. Ahonen, A. Hadid, and M. Pietikainen, Face Description with Local Binary Patterns: Application to Face Recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, issue.12, p.28, 2006.
DOI : 10.1109/TPAMI.2006.244

H. Bay, A. Ess, T. Tuytelaars, and L. V. , Speeded-Up Robust Features (SURF), Computer Vision and Image Understanding, vol.110, issue.3, pp.346-359, 2008.
DOI : 10.1016/j.cviu.2007.09.014

P. Belhumeur, J. Hespanha, and D. Kriegman, Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection, Transactions on Pattern Analysis and Machine Intelligence (PAMI), vol.19, issue.7, pp.711-720, 1997.
DOI : 10.1007/BFb0015522

S. Belongie, J. Malik, and J. Puzicha, Shape matching and object recognition using shape contexts, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.4, pp.509-522, 2001.
DOI : 10.1109/34.993558

A. C. Berg, P. N. Belhumeur, N. Kumar, and S. K. Nayar, Attribute and simile classifiers for face verification, International Conference on Computer Vision (ICCV), 2009.

H. Bilen, V. Namboodiri, and L. Van-gool, Object and Action Classification with Latent Variables, Procedings of the British Machine Vision Conference 2011, 2011.
DOI : 10.5244/C.25.17

C. M. Bishop, Pattern recognition and machine learning, 2006.

A. Bosch, A. Zisserman, and X. Munoz, Representing shape with a spatial pyramid kernel, Proceedings of the 6th ACM international conference on Image and video retrieval, CIVR '07, 2007.
DOI : 10.1145/1282280.1282340

P. Brodatz, Textures: A Photographic Album for Artists and Designers, 1966.

C. J. Burges, A tutorial on support vector machines for pattern recognition, Data Mining and Knowledge Discovery, vol.2, issue.2, pp.121-167, 1998.
DOI : 10.1023/A:1009715923555

Y. Cao, C. Wang, Z. Li, L. Zhang, and L. Zhang, Spatial-bag-of-features, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5540021

B. Caputo, E. Hayman, and P. Mallikarjuna, Class-specific material categorisation, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005.
DOI : 10.1109/ICCV.2005.54

O. Chapelle, Training a Support Vector Machine in the Primal, Neural Computation, vol.6, issue.5, pp.1155-1178, 2007.
DOI : 10.1198/106186005X25619

J. Chen, S. Shan, C. He, G. Zhao, M. Pietikainen et al., WLD: A Robust Local Image Descriptor, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.9, pp.1705-1720, 2010.
DOI : 10.1109/TPAMI.2009.155

M. Croiser and L. D. Griffin, Using Basic Image Features for Texture Classification, International Journal of Computer Vision, vol.62, issue.1, pp.447-460, 2010.
DOI : 10.1007/s11263-009-0315-0

G. Csurka, C. R. Dance, L. Fan, J. Willamowski, and C. Bray, Visual categorization with bags of keypoints, Intl. Workshop on Stat. Learning in Comp. Vision, 2004.

O. G. Cula and K. J. Dana, Compact representation of bidirectional texture functions, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, 2001.
DOI : 10.1109/CVPR.2001.990645

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005.
DOI : 10.1109/CVPR.2005.177
URL : https://hal.archives-ouvertes.fr/inria-00548512

V. Delaitre, I. Laptev, and J. Sivic, Recognizing human actions in still images: a study of bag-of-features and part-based representations, Procedings of the British Machine Vision Conference 2010, 2010.
DOI : 10.5244/C.24.97
URL : https://hal.archives-ouvertes.fr/hal-01060885

V. Delaitre, J. Sivic, and I. Laptev, Learning person-object interactions for action recognition in still images, Advances in Neural Information Processing Systems (NIPS), 2011.
URL : https://hal.archives-ouvertes.fr/hal-00648156

C. Desai, D. Ramanan, and C. Fowlkes, Discriminative models for static humanobject interactions, Computer Vision and Pattern Recognition (CVPR) Workshops, 2010.

R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification, 2001.

M. Everingham, L. Van-gool, C. K. Williams, J. Winn, and A. Zisserman, The PAS- CAL Visual Object Classes Challenge, 2007.

M. Everingham, L. Van-gool, C. K. Williams, J. Winn, and A. Zisserman, The PAS- CAL Visual Object Classes Challenge, 2010.
DOI : 10.1007/11736790_8
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.101.6521

M. Everingham, L. Van-gool, C. K. Williams, J. Winn, and A. Zisserman, The PAS- CAL Visual Object Classes Challenge, 2011.

R. Fan, K. Chang, C. Hsieh, X. Wang, and C. Lin, LIBLINEAR: A library for large linear classification, Journal of Machine Learning Research, vol.9, pp.1871-1874, 2008.

L. Fei-fei and P. Perona, A Bayesian Hierarchical Model for Learning Natural Scene Categories, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005.
DOI : 10.1109/CVPR.2005.16

P. Felzenszwalb, R. Girshick, D. Mcallester, and D. Ramanan, Object Detection with Discriminatively Trained Part-Based Models, Transactions on Pattern Analysis and Machine Intelligence (PAMI), pp.1627-1645, 2010.
DOI : 10.1109/TPAMI.2009.167
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.153.2745

P. Felzenszwalb and D. P. Huttenlocher, Pictorial Structures for Object Recognition, International Journal of Computer Vision, vol.61, issue.1, pp.55-79, 2005.
DOI : 10.1023/B:VISI.0000042934.15159.49
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.12.6365

X. Feng, M. Pietikäinen, and T. Hadid, Facial expression recognition with local binary patterns and linear programming, Transactions on Pattern Analysis and Machine Intelligence (PAMI), pp.546-548, 2005.
DOI : 10.1007/11579427_33

R. Fergus, P. Perona, and A. Zisserman, Weakly Supervised Scale-Invariant Learning of Models for Visual Recognition, International Journal of Computer Vision, vol.20, issue.1, pp.273-303, 2007.
DOI : 10.1007/s11263-006-8707-x

M. Fischler and R. Elschlager, The Representation and Matching of Pictorial Structures, IEEE Transactions on Computers, vol.22, issue.1, pp.67-92, 1973.
DOI : 10.1109/T-C.1973.223602

D. Gao and N. Vasconcelos, Discriminant saliency for visual recognition form cluttered scenes, Advances in Neural Information Processing Systems (NIPS), 2004.

D. Gao and N. Vasconcelos, Integrated learning of saliency, complex features and object detectors from cluttered scenes, Computer Vision and Pattern Recognition (CVPR), 2005.

A. Gupta, A. Kembhavi, and L. S. Davis, Observing Human-Object Interactions: Using Spatial and Functional Compatibility for Recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.31, issue.10, pp.1775-1789, 2009.
DOI : 10.1109/TPAMI.2009.83

T. Harada, Y. Ushiku, Y. Yamashita, and Y. Kuniyoshi, Discriminative spatial pyramid, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995691
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.662.6786

H. Harzallah, F. Jurie, and C. Schmid, Combining efficient object localization and image classification, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459257
URL : https://hal.archives-ouvertes.fr/inria-00439516

E. Hayman, B. Caputo, M. Fritz, and J. Eklundh, On the Significance of Real-World Conditions for Material Classification, European Conference on Computer Vision (ECCV), 2004.
DOI : 10.1007/978-3-540-24673-2_21

G. B. Huang, M. Ramesh, T. Berg, and E. Learned-miller, Labeled faces in the wild: A database for studying face recognition in unconstrained environments, 2007.

N. Ikizler, G. R. Cinbis, S. Pehlivan, and P. Duygulu, Recognizing actions from still images, 2008 19th International Conference on Pattern Recognition, 2008.
DOI : 10.1109/ICPR.2008.4761663
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.159.265

N. Ikizler and P. Duygulu, Histogram of oriented rectangles: A new pose descriptor for human action recognition, Image and Vision Computing, vol.27, issue.10, pp.1515-1526, 2009.
DOI : 10.1016/j.imavis.2009.02.002

N. Ikizler-cinbis, G. R. Cinbis, and S. Sclaroff, Learning actions from the Web, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459368

L. Itti, C. Koch, and E. Niebur, A model of saliency-based visual attention for rapid scene analysis, Transactions on Pattern Analysis and Machine Intelligence (PAMI), pp.1254-1259, 1998.
DOI : 10.1109/34.730558

T. Jaakkola and D. Haussler, Exploiting generative models in discriminative classifiers, Advances in Neural Information Processing Systems (NIPS), 1998.

R. S. Javier, V. Rodrigo, and C. Mauricio, Recognition of faces in unconstrained environments: a comparative study, EURASIP Journal on Advances in Signal Processing, 2009.

F. S. Khan, J. Van-de-weijer, and M. Vanrell, Top-down color attention for object recognition, International Conference on Computer Vision, 2009.

C. Koch and S. Ullman, Shifts in Selective Visual Attention: Towards the Underlying Neural Circuitry, Human Neurobiology, vol.4, pp.219-227, 1985.
DOI : 10.1007/978-94-009-3833-5_5

J. Krapac, J. Verbeek, and F. Jurie, Learning tree-structured descriptor quantizers for image categorization, British Machine Vision Conference (BMVC), 2011.
URL : https://hal.archives-ouvertes.fr/inria-00613118

S. Lazebnik, C. Schmid, and J. Ponce, A sparse texture representation using local affine regions, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.27, issue.8, pp.1265-1278, 2005.
DOI : 10.1109/TPAMI.2005.151
URL : https://hal.archives-ouvertes.fr/inria-00548530

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.68
URL : https://hal.archives-ouvertes.fr/inria-00548585

A. Lehmann, B. Leibe, and L. Van-gool, Fast PRISM: Branch and Bound Hough Transform for Object Class Detection, International Journal of Computer Vision, vol.57, issue.2, pp.175-197, 2011.
DOI : 10.1007/s11263-010-0342-x

B. Leibe, A. Leonardis, and B. Schiele, Robust Object Detection with Interleaved Categorization and Segmentation, International Journal of Computer Vision, vol.73, issue.2, pp.259-289, 2008.
DOI : 10.1007/s11263-007-0095-3

T. J. Leung and J. Malik, Representing and recognizing the visual appearance of materials using three-dimensional textons, International Journal of Computer Vision, vol.43, issue.1, pp.29-44, 2001.
DOI : 10.1023/A:1011126920638

S. Liao, W. Fan, A. C. Chung, and D. Y. Yeung, Facial Expression Recognition using Advanced Local Binary Patterns, Tsallis Entropies and Global Appearance Features, 2006 International Conference on Image Processing
DOI : 10.1109/ICIP.2006.312418

D. Liu, G. Hua, P. Viola, and T. Chen, Integrated feature selection and higher-order spatial feature extraction for object categorization, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587403
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.319.2362

L. Liu, P. Fieguth, and G. Kuang, Compressed Sensing for Robust Texture Classification, Asian Conference on Computer Vision (ACCV), 2010.
DOI : 10.1007/978-3-540-24673-2_21

Y. Liu, D. Zhang, G. Lu, and W. Ma, A survey of content-based image retrieval with high-level semantics, Pattern Recognition, vol.40, issue.1, pp.262-282, 2007.
DOI : 10.1016/j.patcog.2006.04.045

D. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.14.4931

M. J. Lyons, S. Akamatsu, M. Kamachi, and J. Gyoba, Coding facial expressions with Gabor wavelets, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition, 1998.
DOI : 10.1109/AFGR.1998.670949
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.16.7484

M. Marszalek, C. Schmid, H. Harzallah, and J. Van-de-weijer, Learning object representations for visual object class recognition, PASCAL Visual recognition challenge workshop, 2007.
URL : https://hal.archives-ouvertes.fr/inria-00548669

K. Mikolajczyk and C. Schmid, Scale & Affine Invariant Interest Point Detectors, International Journal of Computer Vision, vol.60, issue.1, pp.63-86, 2004.
DOI : 10.1023/B:VISI.0000027790.02288.f2
URL : https://hal.archives-ouvertes.fr/inria-00548554

K. Mikolajczyk and C. Schmid, A performance evaluation of local descriptors, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.27, issue.10, pp.1615-1630, 2005.
DOI : 10.1109/TPAMI.2005.188
URL : https://hal.archives-ouvertes.fr/inria-00548227

F. Moosmann, D. Larlus, and F. Jurie, Learning saliency maps for object categorization, European Conference on Computer Vision (ECCV) Workshops, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00203726

N. Morioka and S. Satoh, Building Compact Local Pairwise Codebook with Joint Feature Space Clustering, European Conference on Computer Vision (ECCV), 2010.
DOI : 10.1007/978-3-642-15549-9_50

N. Morioka and S. Satoh, Learning Directional Local Pairwise Bases with Sparse Coding, Procedings of the British Machine Vision Conference 2010, 2010.
DOI : 10.5244/C.24.32

N. Murray, M. Vanrell, X. Otazu, and C. A. Parraga, Saliency estimation using a non-parametric low-level vision model, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995506

E. Nowak, F. Jurie, and B. Triggs, Sampling Strategies for Bag-of-Features Image Classification, European Conference on Computer Vision (ECCV), 2006.
DOI : 10.1007/11744085_38
URL : https://hal.archives-ouvertes.fr/hal-00203752

T. Ojala, M. Pietikainen, and T. Maenpaa, Multiresolution gray-scale and rotation invariant texture classification with local binary patterns, Transactions on Pattern Analysis and Machine Intelligence (PAMI), pp.971-987, 2002.
DOI : 10.1109/TPAMI.2002.1017623

M. Pandey and S. Lazebnik, Scene recognition and weakly supervised object localization with deformable part-based models, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126383

D. Parikh, L. Zitnick, and T. Chen, Determining Patch Saliency Using Low-Level Context, European Conference on Computer Vision (ECCV), 2008.
DOI : 10.1007/978-3-540-88688-4_33

F. Perronnin and C. Dance, Fisher Kernels on Visual Vocabularies for Image Categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383266

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher kernel for largescale image classification, European Conference on Computer Vision (ECCV), 2010.
URL : https://hal.archives-ouvertes.fr/inria-00548630

M. Pietikainen, A. Hadid, G. Zhao, and T. Ahonen, Computer Vision Using Local Binary Patterns, 2011.

A. Prest, C. Schmid, and V. Ferrari, Weakly Supervised Learning of Interactions between Humans and Objects, Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2011.
DOI : 10.1109/TPAMI.2011.158
URL : https://hal.archives-ouvertes.fr/inria-00516477

T. Quack, V. Ferrari, B. Leibe, and L. Van-gool, Efficient Mining of Frequent and Distinctive Feature Configurations, 2007 IEEE 11th International Conference on Computer Vision, 2007.
DOI : 10.1109/ICCV.2007.4408906

D. Ramanan, Learning to parse images of articulated objects, Advances in Neural Information Processing Systems (NIPS), 2006.

R. Ronfard, C. Schmid, and B. Triggs, Learning to Parse Pictures of People, European Conference on Computer Vision (ECCV), 2002.
DOI : 10.1007/3-540-47979-1_47
URL : https://hal.archives-ouvertes.fr/inria-00545109

S. Savarese, J. Winn, and A. Criminisi, Discriminative Object Class Models of Appearance and Shape by Correlatons, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.102

B. Scholkopf and A. J. Smola, Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond, 2001.

F. Sebastiani, Machine learning in automated text categorization, ACM Computing Surveys, vol.34, issue.1, pp.1-47, 2002.
DOI : 10.1145/505282.505283
URL : http://arxiv.org/abs/cs/0110053

H. J. Seo and P. Milanfar, Face Verification Using the LARK Representation, IEEE Transactions on Information Forensics and Security, vol.6, issue.4, pp.1275-1286, 2011.
DOI : 10.1109/TIFS.2011.2159205

C. Shan, S. Gong, and P. W. Mcowan, Facial expression recognition based on Local Binary Patterns: A comprehensive study, Image and Vision Computing, vol.27, issue.6, pp.803-816, 2009.
DOI : 10.1016/j.imavis.2008.08.005

G. Sharma and F. Jurie, Learning discriminative spatial representation for image classification, Procedings of the British Machine Vision Conference 2011, 2011.
DOI : 10.5244/C.25.6
URL : https://hal.archives-ouvertes.fr/hal-00722820

G. Sharma, F. Jurie, and C. Schmid, Discriminative spatial saliency for image classification, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6248093
URL : https://hal.archives-ouvertes.fr/hal-00714311

G. Sharma, S. Hussain, and F. Jurie, Local Higher-Order Statistics (LHS) for Texture Categorization and Facial Analysis, European Conference on Computer Vision (ECCV), 2012.
DOI : 10.1007/978-3-642-33786-4_1
URL : https://hal.archives-ouvertes.fr/hal-00722819

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, 2003.
DOI : 10.1109/ICCV.2003.1238663

X. Tan and B. Triggs, Enhanced Local Texture Feature Sets for Face Recognition Under Difficult Lighting Conditions, TIP, vol.19, issue.6, pp.1635-1650, 2010.
DOI : 10.1007/978-3-540-75690-3_13
URL : https://hal.archives-ouvertes.fr/inria-00548674

A. M. Treisman and G. Gelade, A feature-integration theory of attention, Cognitive Psychology, vol.12, issue.1, pp.97-136, 1980.
DOI : 10.1016/0010-0285(80)90005-5

M. Turk and A. Pentland, Face recognition using eigenfaces, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1991.
DOI : 10.1109/CVPR.1991.139758

S. Hussain, Machine Learning Methods for Visual Object Detection, 2011.
URL : https://hal.archives-ouvertes.fr/tel-00680048

E. R. Urbach, J. B. Roerdink, and M. H. Wilkinson, Connected Shape-Size Pattern Spectra for Rotation and Scale-Invariant Classification of Gray-Scale Images, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.29, issue.2, pp.272-285, 2007.
DOI : 10.1109/TPAMI.2007.28

A. Vailaya, A. Jain, and H. Zhang, On image classification: city vs. landscape, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173), pp.311921-1935, 1998.
DOI : 10.1109/IVL.1998.694464

K. Valkealahti and E. Oja, Reduced multidimensional co-occurence histograms in texture classification, Transactions on Pattern Analysis and Machine Intelligence (PAMI), pp.90-94, 1998.

M. Varma and A. Zisserman, Texture classification: Are filter banks necessary? In Computer Vision and Pattern Recognition, 2003.

M. Varma and A. Zisserman, A Statistical Approach to Texture Classification from Single Images, International Journal of Computer Vision, vol.62, issue.1-2, pp.61-81, 2005.
DOI : 10.1007/s11263-005-4635-4

A. Vedaldi and B. Fulkerson, Vlfeat, Proceedings of the international conference on Multimedia, MM '10, 2008.
DOI : 10.1145/1873951.1874249

A. Vedaldi, V. Gulshan, M. Varma, and A. Zisserman, Multiple kernels for object detection, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459183

A. Vedaldi and A. Zisserman, Efficient additive kernels using explicit feature maps, Computer Vision and Pattern Recognition (CVPR), 2010.

M. Wang, J. Konrad, P. Ishwar, K. Jing, and H. Rowley, Image saliency: From intrinsic to extrinsic context, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995743

L. Wolf, T. Hassner, and Y. Taigman, Similarity Scores Based on Background Samples, Asian Conference on Computer Vision (ACCV), 2009.
DOI : 10.1007/978-3-642-12304-7_9

J. Xiao, J. Hays, K. Ehinger, A. Oliva, and A. Torralba, SUN database: Large-scale scene recognition from abbey to zoo, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539970

Y. Xu, H. Ji, and C. Fermuller, Viewpoint Invariant Texture Description Using Fractal Analysis, International Journal of Computer Vision, vol.27, issue.2, pp.85-100, 2009.
DOI : 10.1007/s11263-009-0220-6

Y. Xu, X. Yang, H. Ling, and H. Ji, A new texture descriptor using multifractal analysis in multi-orientation wavelet pyramid, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5540217

J. Yang, K. Yu, Y. Gong, and T. Huang, Linear spatial pyramid matching using sparse coding for image classification, Computer Vision and Pattern Recognition, 2009.

J. Yang, K. Yu, and T. Huang, Efficient Highly Over-Complete Sparse Coding Using a Mixture Model, European Conference on Computer Vision (ECCV), 2010.
DOI : 10.1007/978-3-642-15555-0_9

W. Yang, Y. Wang, and G. Mori, Recognizing human actions from still images with latent poses, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539879

Y. Yang and D. Ramanan, Articulated pose estimation with flexible mixtures-ofparts, Computer Vision and Pattern Recognition (CVPR), pp.1385-1392, 2011.

B. Yao and L. Fei-fei, Grouplet: A structured image representation for recognizing human and object interactions, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5540234

B. Yao and L. Fei-fei, Action Recognition with Exemplar Based 2.5D Graph Matching, ECCV, 2012.
DOI : 10.1007/978-3-642-33765-9_13

B. Yao and L. Fei-fei, Recognizing human-object interactions in still images by modeling the mutual context of objects and human poses, Transactions on Pattern Analysis and Machine Intelligence, p.2012

B. Yao, A. Khosla, and L. Fei-fei, Combining randomization and discrimination for fine-grained image categorization, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995368

J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid, Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study, International Journal of Computer Vision, vol.36, issue.1, pp.213-238, 2007.
DOI : 10.1007/s11263-006-9794-4
URL : https://hal.archives-ouvertes.fr/inria-00548574

X. Zhou, N. Cui, Z. Li, F. Liang, and T. Huang, Hierarchical Gaussianization for image classification, International Conference on Computer Vision, 2009.

X. Zhou, K. Yu, T. Zhang, and T. S. Huang, Image Classification Using Super-Vector Coding of Local Image Descriptors, European Conference on Computer Vision (ECCV), 2010.
DOI : 10.1007/978-3-642-15555-0_11

S. C. Zhu, Y. Wu, and D. Mumford, Filters, random-fields and maximum-entropy (FRAME): Towards a unified theory for texture modeling, International Journal of Computer Vision, vol.27, issue.2, pp.107-126, 1998.
DOI : 10.1023/A:1007925832420

X. Zhu and D. Ramanan, Face detection, pose estimation, and landmark localization in the wild, Computer Vision and Pattern Recognition (CVPR), pp.2879-2886, 2012.