Tobii Studio User's Manual Version 3.4.5 (cit, p.83, 2016. ,
FoodLog: Multimedia Tool for Healthcare Applications, IEEE MultiMedia, vol.22, pp.4-8, 2015. ,
Food Balance Estimation by Using Personal Dietary Tendencies in a Multimedia Food Log, IEEE Transactions on Multimedia, vol.40, p.22, 2013. ,
Frequency Statistics of Words Used in Japanese Food Records of FoodLog, ACM UbiComp, p.23, 2014. ,
Multiple Instance Classification: Review, Taxonomy and Comparative Study, Artif. Intell, vol.201, p.35, 2013. ,
Support Vector Machines for Multiple-Instance Learning, Advances in Neural Information Processing Systems (NIPS), p.34, 2002. ,
NetVLAD: CNN Architecture for Weakly Supervised Place Recognition, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.5297-5307, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01242052
Pooling in Image Representation: the Visual Codeword Point of View, Computer Vision and Image Understanding, vol.54, p.46, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-01172709
Spotlight the Negatives: A Generalized Discriminative Latent Model, British Machine Vision Conference, pp.1-11, 2015. ,
Multiple Instance Learning: Algorithms and Applications, p.35, 2009. ,
In: Computer Vision and Image Understanding 110.3. Similarity Matching in Computer Vision and Multimedia, p.15, 2008. ,
Menu-Match: Restaurant-Specific Food Logging from Images, 2015 IEEE Winter Conference on Applications of Computer Vision, p.22, 2015. ,
DOI : 10.1109/wacv.2015.117
MUTAN: Multimodal Tucker Fusion for Visual Question Answering, p.96, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-02073637
Leveraging Context to Support Automated Food Recognition in Restaurants, Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision. WACV '15, p.22, 2015. ,
Active labeling application applied to food-related object recognition, Bolã nos, Marc, M Garolera, and P Radeva, vol.106, p.23, 2013. ,
Defending Yarbus: Eye movements reveal observers' task, In: Journal of Vision, vol.14, p.30, 2014. ,
DOI : 10.1167/14.3.29
URL : https://jov.arvojournals.org/data/journals/jov/932817/i1534-7362-14-3-29.pdf
Food-Mining-101 Discriminative Components with Random Forests, vol.23, pp.39-41, 2014. ,
DOI : 10.1007/978-3-319-10599-4_29
Eye Movement Analysis for Activity Recognition Using Electrooculography, IEEE Trans. Pattern Anal. Mach. Intell, vol.33, p.31, 2011. ,
Master's Thesis : Deep Learning for Visual Recognition, p.47, 2016. ,
Multiple Instance Learning: A Survey of Problem Characteristics and Applications, p.35, 2016. ,
Return of the Devil in the Details: Delving Deep into Convolutional Nets, British Machine Vision Conference (BMVC), 2014. ,
Cross-Modal Recipe Retrieval: How to Cook this Dish?, In: MultiMedia Modeling: 23rd International Conference, MMM 2017, p.22, 2017. ,
PFID: Pittsburgh fast-food image dataset, In: ICIP (cit, vol.21, p.20, 2009. ,
Food Recognition for Dietary Assessment Using Deep Convolutional Neural Networks, New Trends in Image Analysis and Processing-ICIAP 2015 Workshops-ICIAP 2015 International Workshops: BioFor, p.22, 2015. ,
White paper: Cisco VNI Forecast and Methodology, p.1, 2016. ,
Image retrieval using long-term semantic learning, IEEE, p.54, 2006. ,
URL : https://hal.archives-ouvertes.fr/hal-00520307
Visual Categorization with Bags of Keypoints, Workshop on Statistical Learning in Computer Vision. ECCV (cit, p.15, 2004. ,
Histograms of oriented gradients for human detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), vol.1, p.15, 2005. ,
URL : https://hal.archives-ouvertes.fr/inria-00548512
You-Do, I-Learn: Egocentric unsupervised discovery of objects and their modes of interaction towards video-based guidance, Computer Vision and Image Understanding, vol.149, p.31, 2016. ,
ImageNet: A Large-Scale Hierarchical Image Database, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.248-255, 2009. ,
A Conditional Random Field for Multiple-Instance Learning, Proceedings of the 27th International Conference on Machine Learning (ICML-10). Ed. by Johannes F ¨ urnkranz and Thorsten Joachims. Omnipress, p.35, 2010. ,
Solving the multiple instance problem with axis-parallel rectangles, Artificial Intelligence, vol.89, p.33, 1997. ,
MANTRA: Minimum Maximum Latent Structural SVM for Image Classification and Ranking, International Conference on Computer Vision, pp.2713-2721, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01343784
WELDON: Weakly Supervised Learning of Deep Convolutional Neural Networks, IEEE Conference on Computer Vision and Pattern Recognition, pp.4743-4752, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01343785
Incremental learning of latent structural SVM for weakly supervised image classification, IEEE International Conference on Image Processing, p.35, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01077058
WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01515640
Comparative Study of Fixation Density Maps, IEEE Transactions on Image Processing, vol.22, p.31, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00757423
The Pascal Visual Object Classes Challenge: A Retrospective, In: International Journal of Computer Vision, vol.111, pp.98-136, 2015. ,
LIBLINEAR: A library for large linear classification, In: JMLR (cit, p.45, 2008. ,
Classifying food images represented as Bag of Textons, 2014 IEEE International Conference on Image Processing (ICIP), p.22, 2014. ,
A Benchmark Dataset to Study Representation of Food Images, In: ECCV workshop, vol.21, p.20, 2014. ,
Latent Semantic Analysis of Facial Action Codes for Automatic Facial Expression Recognition, Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval. MIR '04, p.15, 2004. ,
Learning to Recognize Daily Actions Using Gaze, European Conference on Computer Vision, pp.314-327, 2012. ,
What do we perceive in a glance of a real-world scene?, In: Journal of Vision, vol.7, p.65, 2007. ,
Object Detection with Discriminatively Trained Part-Based Models, IEEE Trans. Pattern Anal. Mach. Intell, vol.32, pp.1627-1645, 2010. ,
A review of multi-instance learning assumptions, In: The Knowledge Engineering Review, vol.25, p.35, 2010. ,
Backpropagation algorithm for relevance feedback in image retrieval, Proceedings. 2001 International Conference on, vol.1, p.54, 2001. ,
Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position, In: Biological Cybernetics, vol.36, p.4, 1980. ,
Multi-instance learning using recurrent neural networks, The 2012 International Joint Conference on Neural Networks (IJCNN), p.35, 2012. ,
Action classification in still images using human eye movements, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp.16-23, 2015. ,
PET: An eye-tracking dataset for animal-centric Pascal object classes, IEEE International Conference on Multimedia and Expo, p.32, 2015. ,
Actions and Attributes from Wholes and Parts, International Conference on Computer Vision (ICCV), p.86, 2015. ,
Multiscale Orderless Pooling of Deep Convolutional Activation Features, European Conference on Computer Vision (ECCV), pp.392-407, 2014. ,
Deep Fishing: Gradient Features from Deep Nets, British Machine Vision Conference, vol.87, p.86, 2015. ,
SALSAS: Sub-linear active learning strategy with approximate k-NN search, Pattern Recognition 44, vol.10, p.54, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00773102
Active learning methods for interactive image retrieval, Image Processing, p.54, 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-00520292
RETIN AL: An active learning strategy for image category retrieval, Image Processing, vol.4, p.54, 2004. ,
URL : https://hal.archives-ouvertes.fr/hal-00520315
Multi-Instance Kernels, Proc. 19th International Conf. on Machine Learning, p.33, 2002. ,
Visual Perception and Sequences of Eye Movement Fixations: A Stochastic Modeling Approach, IEEE Transactions on Systems, Man and Cybernetics, vol.22, p.31, 1992. ,
An inverse Yarbus process: Predicting observers' task from eye movement patterns, Vision Research 103, p.30, 2014. ,
Food Image Recognition Using Very Deep Convolutional Networks, Proceedings of the 2Nd International Workshop on Multimedia Assisted Dietary Management. MADiMa '16, vol.40, p.23, 2016. ,
DietCam: Multiview Food Recognition Using a Multikernel SVM, IEEE Journal of Biomedical and Health Informatics, vol.20, pp.20-22, 2016. ,
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification, 2015 IEEE International Conference on Computer Vision, ICCV 2015, pp.1026-1034, 2015. ,
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition, IEEE Trans. Pattern Anal. Mach. Intell, vol.37, issue.6, pp.1904-1916, 2015. ,
Deep Residual Learning for Image Recognition, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.18, pp.770-778, 2016. ,
Analysis of food images: Features and classification, 2014 IEEE International Conference on Image Processing (ICIP), vol.21, p.20, 2014. ,
Modeling Restaurant Context for Food Recognition, IEEE Transactions on Multimedia 19, vol.2, p.22, 2017. ,
Multiple instance learning : foundations and algorithms, Dánel Sánchez-TarragóTarrag´Tarragó, and Sarah Vluymans, p.35, 2016. ,
Regularized max pooling for image categorization, British Machine Vision Conference (BMVC), p.86, 2014. ,
DOI : 10.5244/c.28.32
URL : http://www.bmva.org/bmvc/2014/files/abstract072.pdf
Image Recognition of 85 Food Categories by Feature Fusion, 2010 IEEE International Symposium on Multimedia, p.22, 2010. ,
Receptive Fields, Binocular Interaction, and Functional Architecture in the Cat's Visual Cortex, In: Journal of Physiology, vol.160, pp.106-154, 1962. ,
The psychology and pedagogy of reading, p.24, 1908. ,
PinView: Implicit Feedback in Content-Based Image Retrieval, p.31, 2014. ,
Image Representations and New Domains in Neural Image Captioning, EMNLP Vision + Learning workshop, p.22, 2015. ,
Eye Tracking in Human-Computer Interaction and Usability Research: Ready to Deliver the Promises, p.30, 2003. ,
In: Virtual environments and advanced interface design, p.30, 1995. ,
Deep-based Ingredient Recognition for Cooking Recipe Retrieval, In: ACMMM (cit. on pp. 9, vol.40, pp.20-23, 2016. ,
Cuttingplane training of structural SVMs, Machine Learning 77.1, p.61, 2009. ,
Blocks That Shout: Distinctive Parts for Scene Classification, IEEE Conference on Computer Vision and Pattern Recognition, p.34, 2013. ,
A theory of reading: From eye fixations to comprehension, In: Psychological Review, vol.87, p.24, 1980. ,
Food Detection and Recognition Using Convolutional Neural Network, Proceedings of the 22Nd ACM International Conference on Multimedia. MM '14, p.22, 2014. ,
DOI : 10.1145/2647868.2654970
From Where and How to What We See, International Conference on Computer Vision, pp.625-632, 2013. ,
DOI : 10.1109/iccv.2013.83
Eye tracking assisted extraction of attentionally important objects from videos, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.31, 2015. ,
Markov models for ocular fixation locations in the presence and absence of colour, In: Journal of the Royal Statistical Society: Series C (Applied Statistics, p.31, 2017. ,
Automatic Expansion of a Food Image Dataset Leveraging Existing Categories with Domain Adaptation, Proc. of ECCV Workshop on TASK-CV (cit, p.41, 2014. ,
Food image recognition with deep convolutional features, In: ACM UbiComp (cit. on pp, vol.22, 2014. ,
FoodCam-256: A Large-scale Realtime Mobile Food RecognitionSystem employing High-Dimensional Features and Compression of Classifier Weights, ACM International Conference on Multimedia, pp.761-762, 2014. ,
FoodCam-256: A Large-scale Realtime Mobile Food RecognitionSystem Employing High-Dimensional Features and Compression of Classifier Weights, Proceedings of the 22Nd ACM International Conference on Multimedia. MM '14, pp.761-762, 2014. ,
FoodCam: A real-time food recognition system on a smartphone, Multimedia Tools and Applications 74, vol.14, p.23, 2015. ,
FOOD IMAGE RECOGNITION USING DEEP CONVOLUTIONAL NETWORK WITH PRE-TRAINING AND FINETUNING, IEEE Internatinal Conference on Multimedia and Exposition, workshop CEA, 2015. ,
An Enhanced Bag-of-Visual Word Vector Space Model to Represent Visual Content in Athletics Images, IEEE Transactions on Multimedia, issue.1, p.16, 2012. ,
An Overview of the Technology Assisted Dietary Assessment Project at Purdue University, Proc. IEEE Int. Symp. Multimedia (cit, p.24, 2010. ,
Image processing based approach to food balance analysis for personal food logging, 2010 IEEE International Conference on Multimedia and Expo, vol.40, p.22, 2010. ,
, , 2008.
, Can relevance of images be inferred from eye movements?, In: Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, p.26, 2008.
Eye Tracking for Everyone, IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (cit, p.31, 2016. ,
ImageNet Classification with Deep Convolutional Neural Networks, Advances in Neural Information Processing Systems (NIPS), vol.18, pp.1097-1105, 2012. ,
Saliency Unified: A Deep Architecture for simultaneous Eye Fixation Prediction and Salient Object Segmentation, IEEE Conference on Computer Vision and Pattern Recognition, p.31, 2016. ,
Self-Paced Learning for Latent Variable Models, Advances in Neural Information Processing Systems (NIPS), p.35, 2010. ,
Distributed Representations of Sentences and Documents, In: ICML (cit, p.51, 2014. ,
Visual Attention and Applications in Multimedia Technologies, Proceedings of the IEEE 101.9, p.30, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00951728
What we see is most likely to be what matters: Visual attention and applications, 16th IEEE International Conference on Image Processing (ICIP), p.30, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-00441011
A coherent computational approach to model bottom-up visual attention, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, p.30, 2006. ,
URL : https://hal.archives-ouvertes.fr/hal-00669578
Predicting visual fixations on video based on low-level visual features, Vision Research 47, vol.19, p.96, 2007. ,
URL : https://hal.archives-ouvertes.fr/hal-00287424
Labeled Faces in the Wild: A Survey, Advances in Face Detection and Facial Image Analysis, pp.189-248, 2016. ,
Backpropagation applied to handwritten zip code recognition, Neural computation 1.4, vol.18, pp.541-551, 1989. ,
Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification, Advances in Neural Information Processing Systems, p.66, 2010. ,
Multiple instance learning for soft bags via top instances, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.4277-4285, 2015. ,
Investigating the Bag-of-words Method for 3D Shape Retrieval, In: EURASIP J. Adv. Signal Process, vol.5, p.16, 2010. ,
Microsoft COCO: Common Objects in Context, Computer Vision-ECCV 2014: 13th European Conference, pp.740-755, 2014. ,
DeepFood: Deep Learning-Based Food Image Recognition for Computer-Aided Dietary Assessment, Inclusive Smart Cities and Digital Health-14th International Conference on Smart Homes and Health Telematics, ICOST 2016, pp.37-48, 2016. ,
One gaze is worth ten thousand (key-)words, IEEE International Conference on Image Processing (ICIP), vol.32, p.31, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01323204
Distinctive Image Features from Scale-Invariant Keypoints, In: Int. J. Comput. Vision, vol.60, issue.2, pp.91-110, 2004. ,
A PROBABILISTIC MODEL FOR FOOD IMAGE RECOGNITION IN RESTAURANTS, IEEE Internatinal Conference on Multimedia and Exposition, pp.20-22, 2015. ,
Regularization of Neural Networks using DropConnect, In: ICML (cit, p.19, 2013. ,
NeTra: A Toolbox for Navigating Large Image Databases, In: Multimedia Syst, p.3, 1999. ,
Eye Tracking and Eye-Based HumanComputer Interaction, p.30, 2014. ,
A framework for multiple-instance learning, Advances in Neural Information Processing Systems (NIPS) (cit, p.33, 1998. ,
Multiple-Instance Learning for Natural Scene Classification, In: ICML (cit, p.33, 1998. ,
Reinforcement Learning for Visual Object Detection, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.56, p.35, 2016. ,
Action from Still Image Dataset and Inverse Optimal Control to Learn Task Specific Visual Scanpaths, Advances in Neural Information Processing Systems, vol.63, pp.1923-1931, 2013. ,
Multiple Instance Reinforcement Learning for Efficient Weakly-Supervised Detection in Images, vol.56, p.35, 2014. ,
Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition, IEEE Trans. Pattern Anal. Mach. Intell. 37, vol.7, p.31, 2015. ,
Multiple-food recognition considering co-occurrence employing manifold ranking, In: ICPR (cit, p.22, 2012. ,
Tastes and Textures Estimation of Foods Based on the Analysis of Its Ingredients List and Image, New Trends in Image Analysis and Processing-ICIAP 2015 Workshops: ICIAP 2015 International Workshops, p.22, 2015. ,
Tutorial: Visual Learning with Weak Supervision, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013. ,
ORIENTING OF ATTENTION, The Quarterly Journal of Experimental Psychology, vol.32, p.24, 1980. ,
Distributed representations of words and phrases and their compositionality, NIPS (cit, vol.50, p.40, 2013. ,
Being a Super Cook: Joint Food Attributes and Multi-Modal Content Modeling for Recipe Retrieval and Exploration, IEEE Transactions on Multimedia X.XX, pp.20-22, 2016. ,
Active segmentation with fixation, IEEE International Conference on Computer Vision, p.31, 2009. ,
Image-based Calorie Content Estimation for Dietary Assessment, 2011 IEEE International Symposium on Multimedia, p.22, 2011. ,
Deformable Part-based Fully Convolutional Network for Object Detection, Proceedings of the British Machine Vision Conference (BMVC) (cit, p.96, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01637070
Im2Calories: Towards an automated mobile vision food diary, Proceedings of the IEEE International Conference on Computer Vision, vol.11, pp.20-23, 2016. ,
Does where you Gaze on an Image Affect your Perception of Quality? Applying Visual Attention to Image Quality Metric, 2007 IEEE International Conference on Image Processing, vol.2, p.30, 2007. ,
URL : https://hal.archives-ouvertes.fr/hal-00342599
Considering Temporal Variations of Spatial Visual Distortions in Video Quality Assessment, IEEE Journal of Selected Topics in Signal Processing, issue.2, p.30, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-00345895
Platemate: Crowdsourcing Nutritional Analysis from Food Photographs, Proceedings of the 24th Annual ACM Symposium on User Interface Software and Technology. UIST '11, p.23, 2011. ,
A mobile, lightweight, poll-based food identification system, Pattern Recognition 47, vol.5, pp.1941-1952, 2014. ,
The Tobii I-VT Fixation Filter (cit, p.80, 2012. ,
Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks, IEEE Conference on Computer Vision and Pattern Recognition, vol.87, pp.1717-1724, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00911179
Shallow and Deep Convolutional Networks for Saliency Prediction, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.598-606, 2016. ,
Scene recognition and weakly supervised object localization with deformable part-based models, IEEE International Conference on Computer Vision, p.34, 2011. ,
Training Object Class Detectors from Eye Tracking Data, European Conference on Com-puter Vision (ECCV), vol.32, p.63, 2014. ,
Gaze-Based Relevance Feedback for Realizing Region-Based Image Retrieval, IEEE Transactions on Multimedia 16, p.31, 2014. ,
Image retrieval over networks: Active learning using ant algorithm, Multimedia, IEEE Transactions on 10, vol.7, p.54, 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-00656363
Measuring Calorie and Nutrition From Food Image, IEEE Transactions on Instrumentation and Measurement, vol.63, p.22, 2014. ,
FooDD: Food Detection Dataset for Calorie Measurement Using Food Images, New Trends in Image Analysis and Processing-ICIAP 2015 Workshops: ICIAP 2015 International Workshops, pp.20-22, 2015. ,
Robust Higher Order Potentials for Enforcing Label Consistency, In: Int. J. Comput. Vision, vol.82, p.31, 2009. ,
Can computers learn from humans to see better?: inferring scene semantics from viewers' eye movements, International Conference on Multimedia, p.30, 2011. ,
An Eye Fixation Database for Saliency Detection in Images, European Conference on Computer Vision, p.31, 2010. ,
Generative Adversarial Text-to-Image Synthesis, Proceedings of The 33rd International Conference on Machine Learning, p.97, 2016. ,
Weakly Supervised Large Scale Object Localization with Multiple Instance Learning and Bag Splitting, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.38, pp.405-416, 2016. ,
A database for fine grained activity detection of cooking activities, In: CVPR (cit, p.21, 2012. ,
The perceptron, a perceiving and recognizing automaton Project Para, 1957. ,
Object-Centric Spatial Pooling for Image Classification, Computer Vision-ECCV 2012: 12th European Conference on Computer Vision, pp.1-15, 2012. ,
Object-Centric Spatial Pooling for Image Classification, European Conference on Computer Vision, p.35, 2012. ,
What's the point: Semantic segmentation with point supervision, 2016. ,
Introduction to Modern Information Retrieval, p.3, 1986. ,
Learning Cross-modal Embeddings for Cooking Recipes and Food Images, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.22, 2017. ,
The IBM 2016 English conversational telephone speech recognition system, pp.7-11, 2016. ,
Predicting the Category and Attributes of Mental Pictures Using Deep Gaze Pooling, vol.97, p.31, 2016. ,
Prediction of Search Targets From Fixations in Open-World Settings, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015. ,
Harvesting Image Databases from the Web, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.33, p.41, 2011. ,
Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization, Advances in Neural Information Processing Systems, pp.2409-2417, 2013. ,
, GazeDPM: Early Integration of Gaze Information in Deformable Part Models, vol.35, p.31, 2015.
Multiple instance subspace learning via partial random projection tree for local reflection symmetry in natural images, Pattern Recognition, vol.52, p.34, 2016. ,
Generalized Dictionaries for Multiple Instance Learning, In: Int. J. Comput. Vision, vol.114, issue.2-3, p.34, 2015. ,
Mastering the game of Go with deep neural networks and tree search, pp.484-489, 2016. ,
Very deep convolutional networks for large-scale image recognition, 2015. ,
Video Google: A Text Retrieval Approach to Object Matching in Videos, International Conference on Computer Vision (ICCV), 2003. ,
Content-Based Image Retrieval at the End of the Early Years, IEEE Trans. Pattern Anal. Mach. Intell. 22, vol.12, issue.2, pp.1349-1380, 2000. ,
Contextualizing object detection and classification, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol.87, p.86, 2011. ,
DOI : 10.1109/cvpr.2011.5995330
URL : https://pure.qub.ac.uk/portal/files/148946383/Contextualizing_Object_Detection_and_Classification_TPAMI2015.pdf
Dropout: A Simple Way to Prevent Neural Networks from Overfitting, In: J. Mach. Learn. Res, vol.15, p.19, 2014. ,
Discovery of Everyday Human Activities from Long-term Visual Behaviour Using Topic Models, Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing. UbiComp '15, p.31, 2015. ,
User-adaptive Models for Recognizing Food Preparation Activities, ACM MM workshop CEA, p.21, 2013. ,
DOI : 10.1145/2506023.2506031
Automatic Recipe Cuisine Classification by Ingredients, Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct Publication. UbiComp '14 Adjunct, p.22, 2014. ,
DOI : 10.1145/2638728.2641335
Crowdsourcing Annotations for Visual Object Detection, AAAI Workshop, p.31, 2012. ,
Learning Discriminative Part Detectors for Image Classification and Cosegmentation, International Conference on Computer Vision (ICCV), p.34, 2013. ,
DOI : 10.1109/iccv.2013.422
URL : https://hal.archives-ouvertes.fr/hal-00932380
Going Deeper with Convolutions, Computer Vision and Pattern Recognition (CVPR), 2015. ,
DOI : 10.1109/cvpr.2015.7298594
URL : http://arxiv.org/pdf/1409.4842
Rethinking the Inception Architecture for Computer Vision, 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, p.40, 2016. ,
A Bag of Words Approach for 3D Object Categorization, Proceedings of the 4th International Conference on Computer Vision/Computer Graphics CollaborationTechniques. MIRAGE '09, p.16, 2009. ,
Learning Using Privileged Information: Similarity Control and Knowledge Transfer, In: J. Mach. Learn. Res, vol.16, p.63, 2015. ,
DOI : 10.1007/978-3-319-17091-6_1
Attention Is All You Need, In: Arxiv (cit, p.96, 2017. ,
Saliency-based selection of sparse descriptors for action recognition, IEEE International Conference on Image Processing (ICIP) (cit, p.26, 2012. ,
Show and Tell: A Neural Image Caption Generator, In: CVPR (cit, p.97, 2015. ,
DOI : 10.1109/cvpr.2015.7298935
URL : http://arxiv.org/pdf/1411.4555
3d object detection with multiple kinects, Computer Vision-ECCV 2012. Workshops and Demonstrations, p.21, 2012. ,
DOI : 10.1007/978-3-642-33868-7_10
Can You See It? Two Novel Eye-Tracking-Based Measures for Assigning Tags to Image Regions, Advances in Multimedia Modeling, International Conference, p.31, 2013. ,
DOI : 10.1007/978-3-642-35725-1_4
The attraction of visual attention to texts in real-world scenes, In: Journal of Vision, vol.12, p.80, 2012. ,
Computational Model of Stereoscopic 3D Visual Saliency, IEEE Transactions on Image Processing, vol.22, p.31, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00788847
Bag-of-Features Based Medical Image Retrieval via Multiple Assignment and Visual Words Weighting, IEEE Trans. Med. Imaging, vol.30, p.15, 2011. ,
Solving the Multiple-Instance Problem: A Lazy Learning Approach, Proceedings of the Seventeenth International Conference on Machine Learning. ICML '00, p.33, 2000. ,
Quantifying the relationship between visual salience and visual importance, vol.7527, p.30, 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-00494592
Gaze latent support vector machine for image classification, IEEE International Conference on Image Processing (ICIP), vol.12, pp.236-240, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01342580
Gaze Latent Support Vector Machine for Image Classification Improved by Weakly Supervised Region Selection, Pattern Recognition, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01557368
Recipe recognition with large multimodal food dataset, IEEE International Conference on Multimedia & Expo Workshops, pp.1-6, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01196959
Max-Margin Multiple-Instance Dictionary Learning, International Conference on Machine Learning, p.34, 2013. ,
Relaxed Multiple-Instance SVM with Application to Object Discovery, International Conference on Computer Vision (ICCV), p.34, 2015. ,
Revisiting Multiple Instance Neural Networks, p.33, 2016. ,
Human Action Recognition by Semilatent Topic Models, IEEE Transactions on Pattern Analysis and Machine Intelligence 31, vol.10, p.15, 2009. ,
Fast food recognition from videos of eating for calorie estimation, 2009 IEEE International Conference on Multimedia and Expo, p.22, 2009. ,
A Hybrid Semantic Item Model for Recipe Search by Example, 2010 IEEE International Symposium on Multimedia, p.22, 2010. ,
Gaze-enabled egocentric video summarization via constrained submodular maximization, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.31, 2015. ,
Geolocalized Modeling for Dish Recognition, IEEE Transactions on Multimedia 17, vol.8, p.22, 2015. ,
Geolocalized Modeling for Dish Recognition, IEEE Transactions on Multimedia 17, vol.8, p.20, 2015. ,
Food recognition using statistics of pairwise local features, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol.40, p.22, 2010. ,
Review Networks for Caption Generation, In: NIPS (cit, p.96, 2016. ,
Eye Movements and Vision, Plenum, vol.25, p.24, 1967. ,
No-Reference Image Quality Assessment Using Visual Codebooks, IEEE Transactions on Image Processing 21, vol.7, p.16, 2012. ,
How transferable are features in deep neural networks?, In: Advances in Neural Information Processing Systems (NIPS), pp.3320-3328, 2014. ,
Learning structural SVMs with latent variables, In: ICML (cit, p.96, 2009. ,
The Concave-Convex Procedure (CCCP), Advances in Neural Information Processing Systems, vol.14, p.62, 2001. ,
Studying Relationships between Human Gaze, Description, and Computer Vision, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.739-746, 2013. ,
Visualizing and Understanding Convolutional Networks, European Conference on Computer Vision, vol.87, p.86, 2014. ,
Eye can read your mind: Decoding gaze fixations to reveal categorical search targets, In: Journal of Vision, vol.13, p.31, 2013. ,
Improve Multi-Instance Neural Networks through Feature Selection, In: Neural Processing Letters, vol.19, p.33, 2004. ,
EM-DD: An Improved Multiple-Instance Learning Technique, Advances in Neural Information Processing Systems, p.33, 2001. ,
The Application of Visual Saliency Models in Objective Image Quality Assessment: A Statistical Evaluation, IEEE Transactions on Neural Networks and Learning Systems 27, vol.6, p.30, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01675144
Fine-Grained Image Classification by Exploring Bipartite-Graph Labels, 2016 IEEE Conference on Computer Vision and Pattern Recognition, p.22, 2016. ,
Deep Recurrent Models with Fast-Forward Connections for Neural Machine Translation, Transactions of the Association for Computational Linguistics (TACL), vol.4, pp.371-383, 2016. ,
Multi-Instance Learning: A Survey, rep. National Laboratory for Novel Software Technology (cit, p.35, 2004. ,
Multi-instance Learning by Treating Instances As non-I.I.D. Samples, Proceedings of the 26th Annual International Conference on Machine Learning. ICML '09, p.35, 2009. ,
Ensembles of Multi-instance Learners, Machine Learning: ECML 2003: 14th European Conference on Machine Learning, p.33, 2003. ,