V. Ordonez, G. Kulkarni, and T. L. Berg, Im2text: Describing images using 1 million captioned photographs, Advances in Neural Information Processing Systems, 2011.

F. Perronnin and C. Dance, Fisher Kernels on Visual Vocabularies for Image Categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383266

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, Proceedings of the European Conference on Computer Vision, 2010.
DOI : 10.1007/978-3-642-15561-1_11

URL : https://hal.archives-ouvertes.fr/inria-00548630

A. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, Content-based image retrieval at the end of the early years, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, issue.12, pp.1349-1380, 2000.
DOI : 10.1109/34.895972

R. Szeliski, Computer Vision: Algorithms and Applications, 2011.
DOI : 10.1007/978-1-84882-935-0

V. Vapnik, The Nature of Statistical Learning Theory, 1995.