Deep learning with differential privacy, Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, CCS '16, pp.308-318, 2016. ,
Learning from hints in neural networks, Journal of Complexity, vol.6, issue.2, pp.192-198, 1990. ,
A method for learning from hints, Advances in Neural Information Processing Systems, vol.5, pp.73-80, 1992. ,
Hints and the vc dimension, Neural Computation, vol.5, issue.2, pp.278-288, 1993. ,
Learning From Data, 2012. ,
A learning algorithm for Boltzmann machines, Cognitive Science, vol.9, pp.147-169, 1985. ,
Data Classification: Algorithms and Applications, 2014. ,
Multi-Valued and Universal Binary Neurons: Theory, Learning and Applications, 2000. ,
Intelligent Systems II: Complete Approximation by Neural Network Operators, 2016. ,
, Intelligent Mathematics II: Applied Mathematics and Approximation Theory, vol.441, 2016.
Approximation theory. moduli of continuity and global smoothness preservation, 2002. ,
Deep reinforcement learning using capsules in advanced game environments, 2018. ,
Multi-task feature learning, Advances in Neural Information Processing Systems 19, Proceedings of the Twentieth Annual Conference on Neural Information Processing Systems, pp.41-48, 2006. ,
A spectral regularization framework for multi-task structure learning, Advances in Neural Information Processing Systems 20, Proceedings of the Twenty-First Annual Conference on Neural Information Processing Systems, pp.25-32, 2007. ,
A closer look at memorization in deep networks, ICML, vol.70, pp.233-242, 2017. ,
Deep reinforcement learning: A brief survey, IEEE Signal Processing Magazine, vol.34, issue.6, pp.26-38, 2017. ,
Sarcopenic obesity and risk of cardiovascular disease and mortality: a population-based cohort study of older men, Journal of the American Geriatrics Society, vol.62, issue.2, pp.253-60, 2014. ,
Joint language and translation modeling with recurrent neural networks, Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, vol.2013, pp.1044-1054, 2013. ,
Do deep nets really need to be deep?, Proceedings of the 27th International Conference on Neural Information Processing Systems, vol.2, pp.2654-2662, 2014. ,
Neural machine translation by jointly learning to align and translate, 2014. ,
Document image defect models, Proceddings, IAPR Workshop on Syntactic and Structural Pattern Recognition, 1990. ,
Exploiting the past and the future in protein secondary structure prediction, Bioinformatics, vol.15, issue.11, pp.937-946, 1999. ,
Modular learning in neural networks, Proc. AAAI, pp.279-284, 1987. ,
Deep learning with non-medical training used for chest pathology identification, Proc. SPIE, Medical Imaging: Computer-Aided Diagnosis, vol.9414, pp.94140-94147, 2015. ,
Universal approximation bounds for superpositions of a sigmoidal function. Information Theory, IEEE Transactions on, vol.39, issue.3, pp.930-945, 1993. ,
A model of inductive bias learning, J. Artif. Int. Res, vol.12, issue.1, pp.149-198, 2000. ,
Structured prediction energy networks, Proceedings of the 33nd International Conference on Machine Learning, pp.983-992, 2016. ,
Learning structured output dependencies using deep neural networks, Deep Learning Workshop in the 32nd International Conference on Machine Learning (ICML), 2015. ,
A unified neural based model for structured output problems, Conférence Francophone sur l'Apprentissage Automatique (CAP), 2015. ,
Neural networks regularization through class-wise invariant representation learning, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-02129472
Spotting l3 slice in ct scans using deep convolutional network and transfer learning, Computers in Biology and Medicine, vol.87, pp.95-103, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01643960
Spotting l3 slice in ct scans using deep convolutional network and transfer learning, Computers in Biology and Medicine, vol.87, pp.95-103, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01643960
Deep multi-task learning with evolving weights, European Symposium on Artificial Neural Networks (ESANN), 2016. ,
Pondération dynamique dans un cadre multi-tâche pour réseaux de neurones profonds, Apprentissage et, 2016. ,
Deep neural networks regularization for structured output prediction, Neurocomputing, vol.281, pp.169-177, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-02094963
Deep multi-task learning with evolving weights, European Symposium on Artificial Neural Networks (ESANN), 2016. ,
Localizing parts of faces using a consensus of exemplars, CVPR, pp.545-552, 2011. ,
Dynamic Programming, 1957. ,
A theory of learning from different domains, Machine Learning, vol.79, pp.151-175, 2010. ,
Analysis of representations for domain adaptation, Advances in Neural Information Processing Systems 19, Proceedings of the Twentieth Annual Conference on Neural Information Processing Systems, pp.137-144, 2006. ,
A notion of task relatedness yielding provable multipletask learning guarantees, Machine Learning, vol.73, pp.273-287, 2008. ,
A theoretical framework for learning from a pool of disparate data sources, KDD, pp.443-449, 2002. ,
Exploiting task relatedness for multiple task learning, Learning Theory and Kernel Machines, pp.567-580, 2003. ,
Learning Deep Architectures for AI. Found, Trends Mach. Learn, vol.2, issue.1, pp.1-127, 2009. ,
Practical recommendations for gradient-based training of deep architectures, Neural Networks: Tricks of the Trade, vol.7700, pp.437-478, 2012. ,
Deep learning of representations: Looking forward, 2013. ,
Representation learning: A review and new perspectives, IEEE Trans. Pattern Anal. Mach. Intell, vol.35, issue.8, pp.1798-1828, 2013. ,
Representation Learning: A Review and New Perspectives, IEEE PAMI, vol.35, issue.8, pp.1798-1828, 2013. ,
Greedy layer-wise training of deep networks, Advances in Neural information Processing Systems, vol.19, pp.153-160, 2006. ,
Greedy Layer-Wise Training of Deep Networks, pp.153-160, 2007. ,
Scaling learning algorithms towards AI, 2007. ,
Learning long-term dependencies with gradient descent is difficult, Trans. Neur. Netw, vol.5, issue.2, pp.157-166, 1994. ,
Application of high-dimensional feature selection: evaluation for genomic prediction in man, Scientific reports, vol.5, p.10312, 2015. ,
An algorithm that learns what's in a name, Machine learning, vol.34, issue.1-3, pp.211-231, 1999. ,
Regularization and complexity control in feed-forward networks, Proceedings International Conference on Artificial Neural Networks ICANN'95, vol.1, pp.141-148, 1995. ,
Learning bounds for domain adaptation, Advances in Neural Information Processing Systems 20, Proceedings of the Twenty-First Annual Conference on Neural Information Processing Systems, pp.129-136, 2007. ,
Multi-task gaussian process prediction, Advances in Neural Information Processing Systems 20, Proceedings of the TwentyFirst Annual Conference on Neural Information Processing Systems, pp.153-160, 2007. ,
Domain separation networks, NIPS, pp.343-351, 2016. ,
Convex Optimization, 2004. ,
Bagging predictors, Machine Learning, vol.24, pp.123-140, 1996. ,
Random forests, Mach. Learn, vol.45, issue.1, pp.5-32, 2001. ,
Recnorm: Simultaneous normalisation and classification applied to speech recognition, NIPS, pp.234-240, 1990. ,
Signature verification using a "siamese" time delay neural network, Advances in Neural Information Processing Systems, vol.6, pp.737-744, 1994. ,
Applied optimal control: optimization, estimation, and control, 1969. ,
A gradient method for optimizing multi-stage allocation processes, Proc. Harvard Univ. Symposium on digital computers and their applications, 1961. ,
A steepest-ascent method for solving optimum programming problems, 1961. ,
Decoding by linear programming, IEEE Trans. Inf. Theor, vol.51, issue.12, pp.4203-4215, 2005. ,
The secret sharer: Measuring unintended neural network memorization & extracting secrets, 2018. ,
Multitask learning: A knowledge-based source of inductive bias, ICML, pp.41-48, 1993. ,
Multitask learning, Machine Learning, vol.28, pp.41-75, 1997. ,
Semi-supervised learning. Adaptive computation and machine learning, 2006. ,
Exact reconstruction of sparse signals via nonconvex minimization, IEEE Signal Processing Letters, vol.14, issue.10, pp.707-710, 2007. ,
Fast algorithms for nonconvex compressive sensing: Mri reconstruction from very few data, 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, pp.262-265, 2009. ,
Cnn-based algorithm for drusen identification, International Symposium on Circuits and Systems, 2006. ,
Capturing long-term dependencies for protein secondary structure prediction, Advances in Neural Networks-ISNN 2004, International Symposium on Neural Networks, pp.494-500, 2004. ,
Marginalized denoising autoencoders for domain adaptation, ICML. icml.cc / Omnipress, 2012. ,
Lower Bound Theory of Nonzero Entries in Solutions of l2-lp Minimization, 2009. ,
Deep steering: Learning end-to-end driving model from spatial and temporal visual cues, 2017. ,
Deep autoencoder neural networks for gene ontology annotation predictions, Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, BCB '14, pp.533-540, 2014. ,
On the properties of neural machine translation: Encoder-decoder approaches, Proceedings of SSST@EMNLP 2014, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation, pp.103-111, 2014. ,
, , 2015.
Learning a similarity metric discriminatively, with application to face verification, IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005, pp.539-546, 2005. ,
Automated segmentation of muscle and adipose tissue on CT images for human body composition analysis, Proceedings of SPIE, vol.7261, pp.72610-72610, 2009. ,
Empirical evaluation of gated recurrent neural networks on sequence modeling, 2014. ,
Gated feedback recurrent neural networks, Proceedings of the 32nd International Conference on Machine Learning, pp.2067-2075, 2015. ,
Deep, big, simple neural nets for handwritten digit recognition, Neural Comput, vol.22, issue.12, pp.3207-3220, 2010. ,
Deep, big, simple neural nets for handwritten digit recognition, Neural Computation, vol.22, issue.12, pp.3207-3220, 2010. ,
Multi-column deep neural networks for image classification, PROCEEDINGS OF THE 25TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2012, pp.3642-3649, 2012. ,
Deep neural networks segment neuronal membranes in electron microscopy images, Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held, pp.2852-2860, 2012. ,
Transfer learning for latin and chinese characters with deep neural networks, International Joint Conference on Neural Networks, pp.1-6, 2012. ,
A unified architecture for natural language processing: deep neural networks with multitask learning, Machine Learning, Proceedings of he 25th International Conference, pp.160-167, 2008. ,
A unified architecture for natural language processing: deep neural networks with multitask learning, Machine Learning, Proceedings of the Twenty-Fifth International Conference (ICML 2008), pp.160-167, 2008. ,
Support-vector networks, Machine Learning, vol.20, issue.3, pp.273-297, 1995. ,
Learning from multiple sources, Journal of Machine Learning Research, vol.9, pp.1757-1774, 2008. ,
Feature Detection and Tracking with Constrained Local Models, BMVC, vol.10, pp.95-96, 2006. ,
Approximation with artificial neural networks, 2001. ,
Comparison of Two Deformable Registration Algorithms in the Presence of Radiologic Change Between Serial Lung CT Scans, Journal of Digital Imaging, vol.28, issue.6, pp.755-760, 2015. ,
Approximation by superpositions of a sigmoidal function, Mathematics of Control, Signals and Systems, vol.2, issue.4, pp.303-314, 1989. ,
Boosting for transfer learning, Machine Learning, Proceedings of the Twenty-Fourth International Conference (ICML 2007), pp.193-200, 2007. ,
Deep learning vector quantization, European Symposium on Artificial Neural Networks (ESANN), 2016. ,
Large scale distributed deep networks, Proceedings of the 25th International Conference on Neural Information Processing Systems, vol.1, pp.1223-1231, 2012. ,
Learning while searching in constraint-satisfaction-problems, Morgan Kaufmann. References, vol.113, pp.178-185, 1986. ,
Regularization methods for neural networks and related models, 2015. ,
ImageNet: A Large-Scale Hierarchical Image Database, CVPR09, 2009. ,
Imagenet: A large-scale hierarchical image database, CVPR, pp.248-255, 2009. ,
, Deep learning: Methods and applications. Found. Trends Signal Process, vol.7, pp.197-387, 2014.
Natural neural networks, Advances in Neural Information Processing Systems, vol.28, pp.2071-2079, 2015. ,
Tutorial on variational autoencoders, 2016. ,
For most large underdetermined systems of linear equations the minimal l1-norm solution is also the sparsest Solution, 2004. ,
Learning to generate chairs, tables and cars with convolutional networks, IEEE Trans. Pattern Anal. Mach. Intell, vol.39, issue.4, pp.692-705, 2017. ,
Incorporating nesterov momentum into adam, 2016. ,
The numerical solution of variational problems, Journal of Mathematical Analysis and Applications, vol.5, issue.1, pp.30-45, 1962. ,
Investigating human priors for playing video games, 2018. ,
Adaptive Subgradient Methods for Online Learning and Stochastic Optimization, COLT, pp.257-269, 2010. ,
Low resource dependency parsing: Crosslingual parameter sharing in a neural network parser, ACL (2), pp.845-850, 2015. ,
A firm foundation for private data analysis, Commun. ACM, vol.54, issue.1, pp.86-95, 2011. ,
Calibrating noise to sensitivity in private data analysis, TCC, vol.3876, pp.265-284, 2006. ,
The algorithmic foundations of differential privacy, Foundations and Trends in Theoretical Computer Science, vol.9, issue.3-4, pp.211-407, 2014. ,
A statistical approach for phrase location and recognition within a text line: An application to street name recognition, IEEE PAMI, vol.24, issue.2, pp.172-188, 2002. ,
Invariant Subspaces, 1965. ,
Why does unsupervised pre-training help deep learning?, J. Mach. Learn. Res, vol.11, pp.625-660, 2010. ,
Scalable object detection using deep neural networks, CVPR, pp.2155-2162, 2014. ,
Regularized multi-task learning, Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp.109-117, 2004. ,
Massively parallel architectures for AI: netl, thistle, and boltzmann machines, Proceedings of the National Conference on Artificial Intelligence, pp.109-113, 1983. ,
Variable selection via nonconcave penalized likelihood and its oracle properties, Journal of the American Statistical Association, vol.96, pp.1348-1360, 2001. ,
Learning Hierarchical Features for Scene Labeling, IEEE PAMI, vol.35, issue.8, pp.1915-1929, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00742077
Wasserstein discriminant analysis. Machine learning, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-02112754
Deeptraffic: Driving fast through dense traffic with deep reinforcement learning, 2018. ,
Hidden markov model regression, Graduate School of Arts and Sciences, 1993. ,
Phoneme boundary estimation using bidirectional recurrent neural networks and its applications. Systems and Computers in Japan, vol.30, pp.20-30, 1999. ,
Neural network model for a mechanism of pattern recognition unaffected by shift in position-Neocognitron, Trans. IECE, J62-A, issue.10, pp.658-665, 1979. ,
Neocognitron: A self-organizing neural network for a mechanism of pattern recognition unaffected by shift in position, Biological Cybernetics, vol.36, issue.4, pp.193-202, 1980. ,
Increasing robustness against background noise: visual pattern recognition by a Neocognitron, Neural Networks, vol.24, issue.7, pp.767-778, 2011. ,
Training multi-layered neural network Neocognitron, Neural Networks, vol.40, pp.18-31, 2013. ,
Neocognitron: A new algorithm for pattern recognition tolerant of deformations and shifts in position, Pattern Recognition, vol.15, issue.6, pp.455-469, 1982. ,
On the approximate realization of continuous mappings by neural networks, Neural Networks, vol.2, issue.3, pp.183-192, 1989. ,
Further experiments with papa, Il Nuovo Cimento, vol.20, issue.2, pp.112-115, 1955. ,
Domain-adversarial training of neural networks, Journal of Machine Learning Research, vol.17, issue.59, pp.1-35, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01624607
Knowledge transfer via multiple model local structure mapping, Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp.283-291, 2008. ,
A note on the complexity of lp minimization, Mathematical Programming, vol.129, issue.2, pp.285-299, 2011. ,
Neural networks and the bias/variance dilemma, Neural Comput, vol.4, issue.1, pp.1-58, 1992. ,
, Pac-bayes and domain adaptation. arXiv, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01563152
Automatic lumbar vertebra segmentation from clinical CT for wedge compression fracture diagnosis, Proceedings of the SPIE, vol.3, pp.796303-796312, 2011. ,
Representation properties of networks: Kolmogorov's theorem is irrelevant, Neural Computation, vol.1, issue.4, pp.465-469, 1989. ,
Vertebrae localization in pathological spine CT via dense classification from sparse annotations, MICCAI, pp.262-70, 2013. ,
Automatic Localization and Identification of Vertebrae in Arbitrary Field-of-View CT Scans, pp.590-598, 2012. ,
Robust Registration of Longitudinal Spine CT, pp.251-258, 2014. ,
Understanding the difficulty of training deep feedforward neural networks, International conference on artificial intelligence and statistics, pp.249-256, 2010. ,
Deep sparse rectifier neural networks, Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS-11), vol.15, pp.315-323, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00752497
Domain adaptation for large-scale sentiment classification: A deep learning approach, Proceedings of the 28th International Conference on Machine Learning, pp.513-520, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00752091
Are there deep reasons underlying the pathologies of today's deep learning algorithms?, Artificial General Intelligence, pp.70-79, 2015. ,
Automatic spine identification in abdominal CT slices using image partition forests, International Symposium on Image and Signal Processing and Analysis, 2009. ,
Co-evolving recurrent neurons learn deep memory pomdps, Proceedings of the 7th Annual Conference on Genetic and Evolutionary Computation, GECCO '05, pp.491-498, 2005. ,
Deep Learning, 2016. ,
Multi-digit number recognition from street view imagery using deep convolutional neural networks, International Conference on Learning Representations, 2014. ,
Generative adversarial nets, Advances in Neural Information Processing Systems, vol.27, pp.2672-2680, 2014. ,
Explaining and harnessing adversarial examples, 2014. ,
Maxout networks, Proceedings of the 30th International Conference on Machine Learning, ICML 2013, pp.1319-1327, 2013. ,
Evaluation and selection of biases in machine learning, Machine Learning, vol.20, pp.5-22, 1995. ,
A higher body mass index and fat mass are factors predictive of docetaxel dose intensity, Anticancer research, vol.33, issue.12, p.5655, 2013. ,
Supervised Sequence Labelling with Recurrent Neural Networks, Studies in Computational Intelligence, vol.385, 2012. ,
DOI : 10.1007/978-3-642-24797-2
URL : http://mediatum.ub.tum.de/doc/673554/document.pdf
Generating sequences with recurrent neural networks, 2013. ,
Generating sequences with recurrent neural networks, 2013. ,
Towards end-to-end speech recognition with recurrent neural networks, Proceedings of the 31th International Conference on Machine Learning, pp.1764-1772, 2014. ,
Towards end-to-end speech recognition with recurrent neural networks, Proceedings of the 31st International Conference on International Conference on Machine Learning, vol.32, 2014. ,
A novel connectionist system for unconstrained handwriting recognition, vol.31, pp.855-868, 2009. ,
DOI : 10.1109/tpami.2008.137
URL : http://www.idsia.ch/~juergen/tpami_2008.pdf
Speech recognition with deep recurrent neural networks, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.6645-6649, 2013. ,
DOI : 10.1109/icassp.2013.6638947
Offline handwriting recognition with multidimensional recurrent neural networks, Advances in Neural Information Processing Systems, vol.21, pp.545-552, 2009. ,
DOI : 10.1007/978-1-4471-4072-6_12
, Neural turing machines, 2014.
Hybrid computing using a neural network with dynamic external memory, Nature, vol.538, issue.7626, pp.471-476, 2016. ,
DOI : 10.1038/nature20101
, Documenta Mathematica-Extra, vol.ISMP, pp.389-400, 2012.
Contour enhancement, short term memory, and constancies in reverberating neural networks, Studies in Applied Mathematics, vol.52, issue.3, pp.213-257, 1973. ,
DOI : 10.1007/978-94-009-7758-7_8
, Contour Enhancement, Short Term Memory, and Constancies in Reverberating Neural Networks, pp.332-378, 1982.
The Minimum Description Length Principle (Adaptive Computation and Machine Learning), References, vol.117, 2007. ,
A tutorial introduction to the minimum description length principle, Advances in Minimum Description Length: Theory and Applications, 2005. ,
Towards deep neural network architectures robust to adversarial examples, 2014. ,
Mémoire sur le problème d'analyse relatif à l'équilibre des plaques élastiques encastrées, vol.33, 1908. ,
Dimensionality reduction by learning an invariant mapping, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.1735-1742, 2006. ,
DOI : 10.1109/cvpr.2006.100
Writer-independent feature learning for offline signature verification using deep convolutional neural networks, 2016. ,
DOI : 10.1109/ijcnn.2016.7727521
URL : http://arxiv.org/pdf/1604.00974
On the approximation capability of recurrent neural networks, International Symposium on Neural Computation, pp.12-16, 1998. ,
Memorization without generalization in a multilayered neural network, Europhysics Letters), vol.20, issue.5, p.471, 1992. ,
A joint many-task model: Growing a neural network for multiple NLP tasks, EMNLP, pp.1923-1933, 2017. ,
Fundamentals of Artificial Neural Networks, 1995. ,
Statistical Learning with Sparsity: The Lasso and Generalizations, 2015. ,
Brain tumor segmentation with deep neural networks, 2015. ,
Delving deep into rectifiers: Surpassing human-level performance on imagenet classification, ICCV 2015, pp.1026-1034, 2015. ,
Deep residual learning for image recognition, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.770-778, 2016. ,
Identity mappings in deep residual networks, Computer Vision-ECCV 2016-14th European Conference, pp.630-645, 2016. ,
The organization of behavior: A neuropsychological theory, 1949. ,
Neurocomputing, 1989. ,
Theory of the backpropagation neural network, International Joint Conference on Neural Networks (IJCNN), pp.593-605, 1989. ,
Learning bayesian networks: The combination of knowledge and statistical data, Machine Learning, vol.20, issue.3, pp.197-243, 1995. ,
Training products of experts by minimizing contrastive divergence, Neural Comput, vol.14, issue.8, pp.1771-1800, 2002. ,
Learning multiple layers of representation, Trends in Cognitive Sciences, vol.11, pp.428-434, 2007. ,
Learning multiple layers of representation, Trends in Cognitive Sciences, vol.11, pp.428-434, 2007. ,
A practical guide to training restricted boltzmann machines, Neural Networks: Tricks of the Trade, vol.7700, pp.599-619, 2012. ,
Parallel distributed processing: Explorations in the microstructure of cognition, chapter Distributed Representations, vol.1, pp.77-109, 1986. ,
A fast learning algorithm for deep belief nets, Neural Comput, vol.18, issue.7, pp.1527-1554, 2006. ,
A fast learning algorithm for deep belief nets, Neural Computation, vol.18, issue.7, pp.1527-1554, 2006. ,
Matrix capsules with EM routing, International Conference on Learning Representations, 2018. ,
Reducing the dimensionality of data with neural networks, Science, vol.313, issue.5786, pp.504-507, 2006. ,
Boltzmann machines: Constraint satisfaction networks that learn, 1984. ,
Random decision forests, Proceedings of the Third International Conference on Document Analysis and Recognition, vol.1, p.278, 1995. ,
Untersuchungen zu dynamischen neuronalen Netzen, 1991. ,
Gradient flow in recurrent nets: the difficulty of learning long-term dependencies, A Field Guide to Dynamical Recurrent Neural Networks, 2001. ,
Long short-term memory, Neural Comput, vol.9, issue.8, pp.1735-1780, 1997. ,
Fcns in the wild: Pixel-level adversarial and constraint-based adaptation, 2016. ,
Multilayer feedforward networks are universal approximators, Neural Networks, vol.2, issue.5, pp.359-366, 1989. ,
Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks, Neural Netw, vol.3, issue.5, pp.551-560, 1990. ,
Playing atari games with deep reinforcement learning and human checkpoint replay, 2016. ,
Deep and wide multiscale recursive networks for robust image labeling, 2013. ,
Learning-Based Vertebra Detection and Iterative Normalized-Cut Segmentation for Spinal MRI, IEEE Transactions on Medical Imaging, vol.28, issue.10, pp.1595-1605, 2009. ,
Receptive fields, binocular interaction, and functional architecture in the cat's visual cortex, Journal of Physiology, vol.160, pp.106-154, 1962. ,
Frustratingly easy domain adaptation, Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp.256-263, 2007. ,
An Adaptive Logic System with Generalizing Properties, 1962. ,
Batch normalization: Accelerating deep network training by reducing internal covariate shift, Proceedings of the 32nd International Conference on Machine Learning, pp.448-456, 2015. ,
Capabilities of three-layered perceptrons, IEEE International Conference on Neural Networks, vol.1, p.218, 1988. ,
Polynomial theory of complex systems, IEEE Transactions on Systems, Man and Cybernetics, issue.4, pp.364-378, 1971. ,
Cybernetic Predicting Devices, 1965. ,
Cybernetics and forecasting techniques, 1967. ,
Deep structured output learning for unconstrained text recognition, 2014. ,
An Introduction to Statistical Learning: With Applications in R, 2014. ,
What is the best multi-stage architecture for object recognition, ICCV 2009, pp.2146-2153, 2009. ,
Instance weighting for domain adaptation in NLP, ACL 2007, Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, 2007. ,
Representational transfer in deep belief networks, 28th Canadian Conference on Artificial Intelligence, pp.338-342, 2015. ,
Protein secondary structure prediction based on position-specific scoring matrices, Journal of Molecular Biology, vol.292, issue.2, pp.195-202, 1999. ,
Inferring algorithmic patterns with stack-augmented recurrent nets, Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems, pp.190-198, 2015. ,
Exploring the limits of language modeling, 2016. ,
Automatic inference of articulated spine models in CT images using high-order markov random fields, Medical Image Analysis, vol.15, issue.4, pp.426-437, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00856308
Impact of sarcopenia on survival in patients undergoing living donor liver transplantation, American Journal of Transplantation, vol.13, issue.6, pp.1549-1556, 2013. ,
Deep visual-semantic alignments for generating image descriptions, IEEE Conference on Computer Vision and Pattern Recognition, pp.3128-3137, 2015. ,
An Introduction to Computational Learning Theory, 1994. ,
Gradient theory of optimal flight paths, Ars Journal, vol.30, issue.10, pp.947-954, 1960. ,
Multi-task learning using uncertainty to weigh losses for scene geometry and semantics, 2017. ,
Convolutional neural networks for sentence classification, 2014. ,
Convolutional neural networks for sentence classification, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, pp.1746-1751, 2014. ,
Adam: A method for stochastic optimization, 2014. ,
Auto-encoding variational bayes, 2013. ,
Unifying visual-semantic embeddings with multimodal neural language models, 2014. ,
Tensor decompositions and applications, SIAM Rev, vol.51, issue.3, pp.455-500, 2009. ,
On the representation of continuous functions of several variables by superposition of continuous functions of one variable and addition, Doklady Akademii Nauk SSSR, vol.114, pp.369-373, 1957. ,
On the representation of continuous functions of several variables by superposition of continuous functions of one variable and addition, Doklady Akademii. Nauk USSR, vol.114, pp.679-681, 1965. ,
Learning multiple layers of features from tiny images, 2009. ,
ImageNet Classification with Deep Convolutional Neural Networks, Advances in Neural Information Processing Systems, vol.25, pp.1097-1105, 2012. ,
Deep nets don't learn via memorization, 2017. ,
Incorporating prior knowledge on features into learning, Proceedings of the Eleventh International Conference on Artificial Intelligence and Statistics, pp.227-234, 2007. ,
Ask me anything: Dynamic memory networks for natural language processing, Proceedings of The 33rd International Conference on Machine Learning, vol.48, pp.1378-1387, 2016. ,
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data, ICML, pp.282-289, 2001. ,
Deep learning for medical image segmentation, 2015. ,
Sarcopenia is an independent prognostic factor in elderly patients with diffuse large b-cell lymphoma treated with immunochemotherapy, Leukemia & Lymphoma, vol.55, issue.4, pp.817-823, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01141161
Learning to learn with the informative vector machine, Machine Learning, Proceedings of the Twenty-first International Conference (ICML 2004), 2004. ,
Interactive Facial Feature Localization, ECCV, 2012, Proceedings, Part III, pp.679-692, 2012. ,
Une procédure d'apprentissage pour réseau à seuil asymétrique, Proceedings of Cognitiva 85, pp.599-604, 1985. ,
Back-propagation applied to handwritten zip code recognition, Neural Computation, vol.1, issue.4, pp.541-551, 1989. ,
Backpropagation applied to handwritten zip code recognition, Neural Comput, vol.1, issue.4, pp.541-551, 1989. ,
Handwritten digit recognition with a back-propagation network, Advances in Neural Information Processing Systems, vol.2, pp.396-404, 1990. ,
Gradient-based learning applied to document recognition, Proceedings of the IEEE, pp.2278-2324, 1998. ,
Efficient backprop, Neural Networks: Tricks of the Trade, This Book is an Outgrowth of a 1996 NIPS Workshop, pp.9-50, 1998. ,
Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations, Proceedings of the 26th Annual International Conference on Machine Learning, ICML '09, pp.609-616, 2009. ,
Memoir using the chain rule, vol.7, pp.2-3, 1676. ,
IODA : An input / output deep architecture for image labeling, Pattern Recognition, vol.48, issue.9, pp.2847-2858, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-02094941
Multilayer feedforward networks with a nonpolynomial activation function can approximate any function, Neural Networks, vol.6, issue.6, pp.861-867, 1993. ,
Analyse des infiniment petits, pour l'intelligence des lignes courbes, L'Imprimerie Royale, 1696. ,
Pruning filters for efficient convnets, 2016. ,
Socializing the semantic gap: A comparative survey on image tag assignment, refinement, and retrieval, ACM Comput. Surv, vol.49, issue.1, p.39, 2016. ,
Ridge functions, sigmoidal functions and neural networks. Approximation theory VII, pp.163-206, 1992. ,
The representation of the cumulative rounding error of an algorithm as a Taylor expansion of the local rounding errors, 1970. ,
Taylor expansion of the accumulated rounding error, BIT Numerical Mathematics, vol.16, issue.2, pp.146-160, 1976. ,
An introduction to computing with neural nets, SIGARCH Computer Architecture News, vol.16, issue.1, pp.7-25, 1988. ,
On the limited memory bfgs method for large scale optimization, Math. Program, vol.45, issue.3, pp.503-528, 1989. ,
A recursive recurrent neural network for statistical machine translation, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, vol.1, pp.1491-1500, 2014. ,
Fully convolutional networks for semantic segmentation, IEEE Conference on Computer Vision and Pattern Recognition, pp.3431-3440, 2015. ,
Learning multiple tasks with deep relationship networks, 2015. ,
Fully-adaptive feature sharing in multi-task networks with applications in person attribute classification, CVPR, pp.1131-1140, 2017. ,
Optimization by vector space methods. Decision and control, 1969. ,
Hierarchical segmentation and identification of thoracic vertebra using learning-based edge detection and coarse-to-fine deformable model, Computer Vision and Image Understanding, vol.117, issue.9, pp.1072-1083, 2013. ,
Rectifier nonlinearities improve neural network acoustic models, ICML Workshop on Deep Learning for Audio, Speech and Language Processing, 2013. ,
Transfer learning using kolmogorov complexity: Basic theory and empirical evaluations, Advances in Neural Information Processing Systems 20, Proceedings of the Twenty-First Annual Conference on Neural Information Processing Systems, pp.985-992, 2007. ,
Automated landmarking and labeling of fully and partially scanned spinal columns in CT images, Medical Image Analysis, vol.17, issue.8, pp.1151-1163, 2013. ,
, Adversarial autoencoders. CoRR, 2015.
Identifying histological elements with convolutional neural networks, Int. Conf. on Soft Computing As Transdisciplinary Science and Technology, pp.450-456, 2008. ,
DOI : 10.1145/1456223.1456316
Deep learning: A critical appraisal, 2018. ,
The algebraic mind: Integrating connectionism and cognitive science, 2003. ,
Deep learning via hessian-free optimization, Proceedings of the 27th International Conference on Machine Learning (ICML-10), pp.735-742, 2010. ,
DOI : 10.1007/978-3-642-35289-8_27
URL : http://www.cs.toronto.edu/~jmartens/docs/HF_book_chapter.pdf
Learning recurrent neural networks with Hessian-free optimization, ICML 2011, pp.1033-1040, 2011. ,
DOI : 10.1007/978-3-642-35289-8_27
URL : http://www.cs.toronto.edu/~jmartens/docs/HF_book_chapter.pdf
Cancer cachexia in the age of obesity: Skeletal muscle depletion is a powerful prognostic factor, independent of body mass index, Journal of Clinical Oncology, vol.31, issue.12, pp.1539-1547, 2013. ,
Stacked Convolutional AutoEncoders for Hierarchical Feature Extraction, pp.52-59, 2011. ,
DOI : 10.1007/978-3-642-21735-7_7
URL : http://www.idsia.ch/~juergen/icann2011stack.pdf
A logical calculus of the ideas immanent in nervous activity, The bulletin of mathematical biophysics, vol.5, issue.4, pp.115-133, 1943. ,
Deformable models in medical image analysis: a survey, Medical image analysis, vol.1, issue.2, pp.91-108, 1996. ,
Spine detection in CT and MR using iterated marginal space learning, Medical Image Analysis, vol.17, issue.8, pp.1283-1292, 2013. ,
Mapping and revising markov logic networks for transfer learning, Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, pp.608-614, 2007. ,
Transfer learning with markov logic networks, ICML workshop on structural knowledge transfer for machine learning, 2006. ,
Transfer learning by mapping with minimal target data, Proceedings of the AAAI-08 workshop on transfer learning for complex tasks, 2008. ,
Statistical language models based on neural networks, 2012. ,
Distributed representations of words and phrases and their compositionality, NIPS, pp.3111-3119, 2013. ,
Perceptrons: An Introduction to Computational Geometry, 1969. ,
Perceptrons: Expanded Edition, 1988. ,
Cross-stitch networks for multi-task learning, CVPR, pp.3994-4003, 2016. ,
DOI : 10.1109/cvpr.2016.433
URL : http://arxiv.org/pdf/1604.03539
The need for biases in learning generalizations, 1980. ,
, Machine Learning, 1997.
Cadaver validation of skeletal muscle measurement by magnetic resonance imaging and computerized tomography, Journal of applied physiology, vol.85, issue.1, pp.115-122, 1998. ,
Playing atari with deep reinforcement learning, NIPS Deep Learning Workshop, 2013. ,
Human-level control through deep reinforcement learning, Nature, vol.518, issue.7540, pp.529-533, 2015. ,
Conditional restricted boltzmann machines for structured output prediction, UAI 2011, Proceedings of the Twenty-Seventh Conference on Uncertainty in Artificial Intelligence, pp.514-522, 2011. ,
Foundations of Machine Learning, 2012. ,
Pruning convolutional neural networks for resource efficient transfer learning, 2016. ,
Exact calculation of the product of the Hessian matrix of feed-forward network error functions and a vector in O(N) time, 1993. ,
On the number of linear regions of deep neural networks, Proceedings of the 27th International Conference on Neural Information Processing Systems, vol.2, pp.2924-2932, 2014. ,
Minimizing nonconvex functions for sparse vector reconstruction, IEEE Trans. Signal Processing, vol.58, issue.7, pp.3485-3496, 2010. ,
Rectified linear units improve restricted boltzmann machines, Proceedings of the 27th International Conference on Machine Learning (ICML-10), pp.807-814, 2010. ,
Local binary patterns variants as texture descriptors for medical image analysis, Artificial Intelligence in Medicine, vol.49, issue.2, pp.117-125, 2010. ,
Sparse approximate solutions to linear systems, SIAM J. Comput, vol.24, issue.2, pp.227-234, 1995. ,
A method of solving a convex programming problem with convergence rate O(1/sqr(k)), Soviet Mathematics Doklady, vol.27, pp.372-376, 1983. ,
Deep neural networks are easily fooled: High confidence predictions for unrecognizable images, CVPR, pp.427-436, 2015. ,
Automatic image filtering on social networks using deep learning and perceptual hashing during crises, 2017. ,
A Markovian Approach for Handwritten Document Segmentation, ICPR (3), pp.292-295, 2006. ,
URL : https://hal.archives-ouvertes.fr/hal-00509210
Kolmogorov's mapping neural network existence theorem, Proceedings of the IEEE First International Conference on Neural Networks, vol.III, pp.11-13, 1987. ,
Toward automatic phenotyping of developing embryos from videos, IEEE Trans. Image Processing, vol.14, issue.9, pp.1360-1371, 2005. ,
URL : https://hal.archives-ouvertes.fr/hal-00114920
Incorporating prior information in machine learning by creating virtual examples, Proceedings of the IEEE, vol.86, issue.11, pp.2196-2209, 1998. ,
Learning deconvolution network for semantic segmentation, 2015 IEEE International Conference on Computer Vision, ICCV 2015, pp.1520-1528, 2015. ,
Learning Hidden Markov Models for Regression using Path Aggregation. CoRR, abs/1206, p.3275, 2012. ,
On convergence proofs on perceptrons, Proceedings of the Symposium on the Mathematical Theory of Automata, 1962. ,
Minimum error rate training in statistical machine translation, Proceedings of the ACL, vol.1, 2003. ,
Multiresolution gray-scale and rotation invariant texture classification with local binary patterns, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.7, pp.971-987, 2002. ,
Localization of the lumbar discs using machine learning and exact probabilistic inference, Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp.158-165, 2011. ,
A sociological study of the official history of the perceptrons controversy, Social Studies of Science, vol.26, issue.3, pp.611-659, 1996. ,
, Handbook Of Research On Machine Learning Applications and Trends: Algorithms, Methods and Techniques-2 Volumes, 2009.
Computation with spikes in a winner-take-all network, Neural Computation, vol.21, issue.9, pp.2437-2465, 2009. ,
A survey on transfer learning, IEEE Trans. on Knowl. and Data Eng, vol.22, issue.10, pp.1345-1359, 2010. ,
To go deep or wide in learning?, Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, vol.33, pp.724-732, 2014. ,
Semisupervised knowledge transfer for deep learning from private training data, 2016. ,
Exploiting unrelated tasks in multi-task learning, Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, vol.22, pp.951-959, 2012. ,
Faster cnns with direct sparse convolutions and guided pruning, 2016. ,
Learning-logic, 1985. ,
Understanding the exploding gradient problem, 2012. ,
On the difficulty of training recurrent neural networks, Proceedings of the 30th International Conference on Machine Learning, ICML 2013, pp.1310-1318, 2013. ,
On the difficulty of training recurrent neural networks, Proceedings of the 30th International Conference on International Conference on Machine Learning, vol.28, 2013. ,
Fast exact multiplication by the Hessian, Neural Computation, vol.6, issue.1, pp.147-160, 1994. ,
Sarcopenia negatively impacts short-term outcomes in patients undergoing hepatic resection for colorectal liver metastasis, HPB, vol.13, issue.7, pp.439-446, 2011. ,
Computational optimal transport, 2017. ,
Current methods in medical image segmentation 1, Annual review of biomedical engineering, vol.2, issue.1, pp.315-337, 2000. ,
Some methods of speeding up the convergence of iteration methods, USSR Computational Mathematics and Mathematical Physics, vol.4, issue.5, pp.1-17, 1964. ,
Analyzing noise in autoencoders and deep networks, 2014. ,
Parallel training of deep neural networks with natural gradient and parameter averaging, 2014. ,
Introduction to tensor decompositions and their applications in machine learning, 2017. ,
A tutorial on hidden Markov models and selected applications in speech recognition, Proceedings of the IEEE, vol.77, issue.2, pp.257-286, 1989. ,
Unsupervised representation learning with deep convolutional generative adversarial networks, 2015. ,
Deep learning made easier by linear transformations in perceptrons, Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, vol.22, pp.924-932, 2012. ,
Self-taught learning: transfer learning from unlabeled data, Machine Learning, Proceedings of the Twenty-Fourth International Conference (ICML 2007), pp.759-766, 2007. ,
Large-scale deep unsupervised learning using graphics processors, Proceedings of the 26th Annual International Conference on Machine Learning, ICML '09, pp.873-880, 2009. ,
Unsupervised pretraining for sequence to sequence learning, EMNLP, pp.383-391, 2017. ,
Massively multitask networks for drug discovery, 2015. ,
Efficient Learning of Sparse Representations with an Energy-Based Model, NIPS, pp.1137-1144, 2007. ,
Sparse feature learning for deep belief networks, Advances in Neural Information Processing Systems 20, Proceedings of the TwentyFirst Annual Conference on Neural Information Processing Systems, pp.1185-1192, 2007. ,
Unsupervised learning of invariant feature hierarchies with applications to object recognition, Proc. Computer Vision and Pattern Recognition Conference (CVPR'07), 2007. ,
Efficient learning of sparse representations with an energy-based model, Advances in Neural Information Processing Systems 19, Proceedings of the Twentieth Annual Conference on Neural Information Processing Systems, pp.1137-1144, 2006. ,
CNN features off-the-shelf: An astounding baseline for recognition, CVPR Workshops, pp.512-519, 2014. ,
Faster r-cnn: Towards real-time object detection with region proposal networks, NIPS, vol.28, pp.91-99, 2015. ,
Hierarchical models of object recognition in cortex, Nature Neuroscience, vol.2, issue.11, 1999. ,
Higher order contractive auto-encoder, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), 2011. ,
Contractive auto-encoders: Explicit invariance during feature extraction, Proceedings of the 28th International Conference on Machine Learning, pp.833-840, 2011. ,
Modeling by shortest data description, Automatica, vol.14, issue.5, pp.465-471, 1978. ,
DOI : 10.1016/0005-1098(78)90005-5
U-net: Convolutional networks for biomedical image segmentation, Medical Image Computing and Computer-Assisted Intervention-MICCAI 2015-18th International Conference, pp.234-241, 2015. ,
DOI : 10.1007/978-3-319-24574-4_28
URL : http://arxiv.org/pdf/1505.04597
The perceptron: A probabilistic model for information storage and organization in the brain, Psychological Review, pp.65-386, 1958. ,
Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms, 1962. ,
Detection of sclerotic spine metastases via random aggregation of deep convolutional neural network classifications, 2014. ,
Sluice networks: Learning what to share between loosely related tasks, 2017. ,
Principles of mathematical analysis, 1964. ,
Learning internal representations by error propagation, Parallel Distributed Processing, vol.1, pp.318-362, 1986. ,
Dynamic routing between capsules, Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems, pp.3859-3869, 2017. ,
A semi-automatic methodology for facial landmark annotation, CVPR Workshops, pp.896-903, 2013. ,
DOI : 10.1109/cvprw.2013.132
Deep Boltzmann machines, Proceedings of the International Conference on Artificial Intelligence and Statistics, vol.5, pp.448-455, 2009. ,
Markov chain monte carlo and variational inference: Bridging the gap, Proceedings of the 32nd International Conference on Machine Learning, pp.1218-1226, 2015. ,
Sleep quality prediction from wearable data using deep learning, JMIR mHealth and uHealth, vol.4, issue.4, 2016. ,
DOI : 10.2196/mhealth.6562
URL : https://doi.org/10.2196/mhealth.6562
Geometry-based vs. intensity-based medical image registration: A comparative study on 3D CT data, Computers in Biology and Medicine, vol.69, pp.120-133, 2016. ,
Part-of-speech tagging with neural networks, conference on Computational linguistics, vol.12, pp.44-49, 1994. ,
DOI : 10.3115/991886.991915
URL : http://arxiv.org/pdf/cmp-lg/9410018
A local learning algorithm for dynamic feedforward and recurrent networks, Connection Science, vol.1, issue.4, pp.403-412, 1989. ,
Learning complex, extended sequences using the principle of history compression, Neural Computation, vol.4, issue.2, pp.234-242, 1992. ,
, My first Deep Learning system of 1991 + Deep Learning timeline 1962-2013, 2013.
Deep learning in neural networks: An overview, Neural Networks, vol.61, pp.85-117, 2014. ,
DOI : 10.1016/j.neunet.2014.09.003
URL : http://arxiv.org/pdf/1404.7828
Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond, 2001. ,
On supervised learning from sequential data with applications for speech recognition, 1999. ,
Bidirectional recurrent neural networks, Trans. Sig. Proc, vol.45, issue.11, pp.2673-2681, 1997. ,
Pedestrian detection with unsupervised multi-stage feature learning, Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, CVPR '13, pp.3626-3633, 2013. ,
Understanding Machine Learning: From Theory to Algorithms, 2014. ,
Total body skeletal muscle and adipose tissue volumes: estimation from a single abdominal cross-sectional image, Journal of applied physiology, vol.97, issue.6, pp.2333-2338, 2004. ,
A latent semantic model with convolutional-pooling structure for information retrieval, CIKM, pp.101-110, 2014. ,
Improving predictive inference under covariate shift by weighting the log-likelihood function, Journal of Statistical Planning and Inference, vol.90, issue.2, pp.227-244, 2000. ,
Deep convolutional neural networks for computer-aided detection: Cnn architectures, dataset characteristics and transfer learning, IEEE Transactions on Medical Imaging, vol.35, issue.5, pp.1285-1298, 2016. ,
Creating artificial neural networks that generalize, Neural Networks, vol.4, issue.1, pp.67-79, 1991. ,
Mastering the game of go with deep neural networks and tree search, Nature, vol.529, issue.7587, pp.484-489, 2016. ,
Efficient Pattern Recognition Using a New Transformation Distance, Advances in Neural Information Processing Systems, vol.5, pp.50-58, 1993. ,
Tangent Prop-a formalism for specifying selected invariances in an adaptive network, Advances in Neural Information Processing Systems, vol.4, pp.895-903, 1992. ,
Best practices for convolutional neural networks applied to visual document analysis, Proceedings of the Seventh International Conference on Document Analysis and Recognition, vol.2, p.958, 2003. ,
Deep inside convolutional networks: Visualising image classification models and saliency maps, 2013. ,
Very deep convolutional networks for large-scale image recognition, 2014. ,
Overtraining, regularization and searching for a minimum, with application to neural networks, International Journal of Control, vol.62, pp.1391-1407, 1995. ,
Parsing English with a link grammar, Proc. Third International Workshop on Parsing Technologies, pp.277-292, 1993. ,
Parallel distributed processing: Explorations in the microstructure of cognition, chapter Information Processing in Dynamical Systems: Foundations of Harmony Theory, vol.1, pp.194-281, 1986. ,
Recursive deep models for semantic compositionality over a sentiment treebank, Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp.1631-1642, 2013. ,
Deep multi-task learning with low level tasks supervised at lower layers. In ACL (2). The Association for Computer Linguistics, 2016. ,
Learning structured output representation using deep conditional generative models, NIPS 2015, pp.3483-3491, 2015. ,
Improving Neural Networks with Dropout, 2013. ,
Dropout: A simple way to prevent neural networks from overfitting, Journal of Machine Learning Research, vol.15, pp.1929-1958, 2014. ,
Highway Networks, 2016. ,
, Dictionary Learning, pp.263-274, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01717943
The history of approximation theory: from Euler to Bernstein, 2007. ,
Using gpus for machine learning algorithms, Proceedings of the Eighth International Conference on Document Analysis and Recognition, ICDAR '05, pp.1115-1119, 2005. ,
Cohort of LSTM and lexicon verification for handwriting recognition with gigantic lexicon, 2016. ,
Rule-injection hints as a means of improving network performance and learning time, Neural Networks, EURASIP workshop 1990, pp.120-129, 1990. ,
Dimensionality reduction of multimodal labeled data by local fisher discriminant analysis, J. Mach. Learn. Res, vol.8, pp.1027-1061, 2007. ,
Weakly supervised memory networks, 2015. ,
The fundamentality of sets of ridge functions. aequationes mathematicae, vol.44, pp.226-235, 1992. ,
On the importance of initialization and momentum in deep learning, ICML, vol.28, pp.1139-1147, 2013. ,
Sequence to sequence learning with neural networks, Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems, pp.3104-3112, 2014. ,
Sequence to sequence learning with neural networks, Proceedings of the 27th International Conference on Neural Information Processing Systems, vol.2, pp.3104-3112, 2014. ,
Enzyme function prediction with interpretable models. Computational Systems Biology, pp.373-420, 2009. ,
Efficient processing of deep neural networks: A tutorial and survey, Proceedings of the IEEE, vol.105, issue.12, pp.2295-2329, 2017. ,
Going Deeper with Convolutions, 2014. ,
, , 2014.
Deep neural networks for object detection, vol.26, pp.2553-2561, 2013. ,
Intriguing properties of neural networks, 2013. ,
Contextual Recognition of Hand-drawn Diagrams with Conditional Random Fields, IWFHR, pp.32-37, 2004. ,
Deep networks for robust visual recognition, Proceedings of the 27th International Conference on Machine Learning (ICML-10), pp.1055-1062, 2010. ,
On lp programming, European Journal of Operational Research, vol.22, issue.1, pp.70-100, 1985. ,
Is learning the n-th thing any easier than learning the first?, Advances in Neural Information Processing Systems, pp.640-646, 1996. ,
Learning one more thing, Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, vol.95, pp.1217-1225, 1995. ,
Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society, Series B, vol.58, pp.267-288, 1994. ,
On solving ill-posed problem and method of regularization, Doklady Akademii Nauk USSR, vol.153, pp.501-504, 1963. ,
Solutions of Ill-posed problems, 1977. ,
Coupling CRFs and Deformable Models for 3D Medical Image Segmentation, ICCV, pp.1-8, 2007. ,
Adversarial discriminative domain adaptation, CVPR, 2017. ,
, Deep image prior, 2017.
Multi-modal brain tumor segmentation using deep convolutional neural networks, MICCAI BraTS Challenge Proceedings, pp.31-35, 2014. ,
, Machine Learning of Inductive Bias, 1986.
A theory of the learnable, Commun. ACM, vol.27, issue.11, pp.1134-1142, 1984. ,
Deep content-based music recommendation, NIPS, pp.2643-2651, 2013. ,
On the uniform convergence of relative frequencies of events to their probabilities, Theory of Probability and its Applications, vol.16, pp.264-280, 1971. ,
Extracting and composing robust features with denoising autoencoders, Proceedings of the 25th international conference on Machine learning, ICML '08, pp.1096-1103, 2008. ,
Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion, JMLR, vol.11, pp.3371-3408, 2010. ,
Grammar as a foreign language, Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems, pp.2773-2781, 2015. ,
Show and tell: A neural image caption generator, IEEE Conference on Computer Vision and Pattern Recognition, pp.3156-3164, 2015. ,
DOI : 10.1109/cvpr.2015.7298935
URL : http://arxiv.org/pdf/1411.4555
Dropout training as adaptive regularization, Advances in Neural Information Processing Systems, vol.26, pp.351-359, 2013. ,
Three-dimensional display in nuclear medicine and radiology, Society of Nuclear Medicine, vol.32, issue.3, pp.534-546, 1991. ,
DOI : 10.1109/42.41482
, Cardiovascular Nuclear Medicine and MRI: Quantitation and Clinical Applications, pp.89-100, 1992.
Three-dimensional display in nuclear medicine, IEEE Trans. on Medical Imaging, vol.8, issue.4, pp.297-230, 1989. ,
DOI : 10.1109/42.41482
Regularization of neural networks using dropconnect, ICML, vol.28, pp.1058-1066, 2013. ,
Manifold alignment using procrustes analysis, Machine Learning, Proceedings of the Twenty-Fifth International Conference (ICML 2008), pp.1120-1127, 2008. ,
DOI : 10.1145/1390156.1390297
URL : https://scholarworks.umass.edu/cgi/viewcontent.cgi?article=1061&context=cs_faculty_pubs
Self-organizing polynomial neural network for modelling complex hydrological processes, 2005. ,
Improving content-based and hybrid music recommendation using deep learning, ACM Multimedia, pp.627-636, 2014. ,
DOI : 10.1145/2647868.2654940
An empirical analysis of dropout in piecewise linear networks, International Conference on Learning Representations, 2014. ,
Cresceptron: a self-organizing neural network which grows adaptively, International Joint Conference on Neural Networks (IJCNN), vol.1, pp.576-581, 1992. ,
DOI : 10.1109/ijcnn.1992.287150
URL : http://vision.ai.uiuc.edu/publications/cresceptron_1992.pdf
Learning recognition and segmentation using the cresceptron, International Journal of Computer Vision, vol.25, issue.2, pp.109-143, 1997. ,
Beyond Regression: New Tools for Prediction and Analysis in the Behavioral Sciences, 1974. ,
Applications of advances in nonlinear sensitivity analysis, Proceedings of the 10th IFIP Conference, 31.8-4.9, pp.762-770, 1981. ,
, Memory networks, 2014.
Deep learning via semi-supervised embedding, Machine Learning, Proceedings of the Twenty-Fifth International Conference (ICML 2008), pp.1168-1175, 2008. ,
DOI : 10.1145/1390156.1390303
URL : http://icml2008.cs.helsinki.fi/papers/340.pdf
Deep learning via semi-supervised embedding, Neural Networks: Tricks of the Trade, 2012. ,
DOI : 10.1145/1390156.1390303
URL : http://icml2008.cs.helsinki.fi/papers/340.pdf
An Adaptive "ADALINE" Neuron Using Chemical "Memistors, 1960. ,
Receptive fields of single neurones in the cat's striate cortex, J. Physiol, vol.148, pp.574-591, 1959. ,
Mean-normalized stochastic gradient for large-scale deep learning, International Conference on Acoustics Speech and Signal Processing ICASSP, pp.180-184, 2014. ,
A convergence analysis of log-linear training and its application to speech recognition, IEEE Workshop on Automatic Speech Recognition Understanding, pp.1-6, 2011. ,
Madaline rule ii: a training algorithm for neural networks, IEEE 1988 International Conference on Neural Networks, vol.1, pp.401-408, 1988. ,
The lack of a priori distinctions between learning algorithms, Neural Comput, vol.8, issue.7, pp.1341-1390, 1996. ,
The influence of improvement in one mental function upon the efficiency of other functions.(i), Psychological review, vol.8, issue.3, p.247, 1901. ,
Incorporating prior knowledge with weighted margin support vector machines, Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '04, pp.326-333, 2004. ,
Show, attend and tell: Neural image caption generation with visual attention, Proceedings of the 32nd International Conference on Machine Learning, pp.2048-2057, 2015. ,
An efficient algorithm for minimizing a sum of p-norms, SIAM Journal on Optimization, vol.10, issue.2, pp.551-579, 2000. ,
Multi-task learning for classification with dirichlet process priors, Journal of Machine Learning Research, vol.8, pp.35-63, 2007. ,
Deep multi-task representation learning: A tensor factorisation approach, 2016. ,
Trace norm regularised deep multi-task learning, 2016. ,
Imaging body composition in cancer patients: visceral obesity, sarcopenia and sarcopenic obesity may impact on clinical outcome, Insights into Imaging, pp.489-497, 2015. ,
, How transferable are features in deep neural networks? In NIPS, pp.3320-3328, 2014.
Deep reinforcement learning for simulated autonomous vehicle control, Course Project Reports, pp.1-7, 2016. ,
Incorporating prior domain knowledge into inductive machine learning. Unpublished doctoral dissertation Computer Sciences, 2007. ,
VQSVM: A case study for incorporating prior domain knowledge into inductive machine learning, Neurocomputing, vol.73, pp.2614-2623, 2010. ,
ADADELTA: An Adaptive Learning Rate Method, 2012. ,
ADADELTA: an adaptive learning rate method, 2012. ,
Stochastic pooling for regularization of deep convolutional neural networks, International Conference on Learning Representations (ICLR2013), 2013. ,
Visualizing and understanding convolutional networks, ECCV (1), vol.8689, pp.818-833, 2014. ,
Statistical parametric speech synthesis, Speech Communication, vol.51, issue.11, pp.1039-1064, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-00746106
Understanding deep learning requires rethinking generalization, 2016. ,
Coarse-to-Fine Auto-Encoder Networks (CFAN) for Real-Time Face Alignment, ECCV, Part II, pp.1-16, 2014. ,
Learning deep CNN denoiser prior for image restoration, CVPR, pp.2808-2817, 2017. ,
A design methodology for efficient implementation of deconvolutional neural networks on an FPGA, 2017. ,
Curriculum domain adaptation for semantic segmentation of urban scenes, 2017. ,
A survey on multi-task learning, 2017. ,
Facial landmark detection by deep multi-task learning, Computer Vision, ECCV 2014, 13th European Conference, pp.94-108, 2014. ,
1. bias-variance tradeoff (Sec.A.2.1), 2. feedforward networks (Sec.A.2.2), including (a) backpropagation, derivatives computation, and issues, IJCAI, pp.4119-4125, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01667782
, 3) including L p norm regularization (Sec.A.2.3.1), and early stopping (Sec.A.2.3.2). real life (Sec.A.1.1), while we provide some basic definitions (Sec.A.1.2) and different learning scenarios (Sec.A.1.3). of applications, including ? Text or document classification, and the impact of some regularization approaches on the obtained solution
, ? Speech recognition, speech synthesis, speaker verification; ? Computational biology applications, e.g., protein function or structural prediction; ? Computer vision tasks, e.g., image recognition, face detection; ? Fraud detection (credit card, telephone), and network intrusion
, Medical diagnosis; ? Recommendation systems, search engines, information extraction systems