A Latently Constrained Mixture Model for Audio Source Separation and Localization, proceedings of the 10th International Conference on Latent Variable Analysis and Signal Separation, volume LNCS 7191, pp.372-379, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00768660
Variational EM for binaural sound-source separation and localization, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 2013. ,
DOI : 10.1109/ICASSP.2013.6637612
URL : https://hal.archives-ouvertes.fr/hal-00823453
Hearing on Binaural Manifolds: Acoustic Space Learning for Sound-Source Separation and Localization, International Journal Submissions ? International Journal of Neural Systems, 2013. ,
Binaural Co-Localization of Audio Source Pairs, IEEE Signal Processing Letters, 2013. ,
Mapping Learning with Partially-Latent Output Statistics and Computing Learning the Direction of a Sound Source Using Head Motions and Spectral Features, APPENDIX . PUBLICATIONS Other Articles ?, 2011. ,
Sebstian Wrede & Radu Horaud P. Online Multimodal Speaker Detection for Humanoid Robots, IEEE International Conference on Humanoid Robotics (Humanoids), 2012. ,
Vojtech Franc RAVEL: An Annotated Corpus for Training Robots with Audiovisual Abilities, Journal on Multimodal User Interfaces, vol.7, issue.12, pp.79-91, 2013. ,
Mapping Learning with Partially Latent Output. arXiv preprint, 2013. ,
Active-Speaker Detection and Localization with Microphones and Cameras Embedded into a Robotic Head REFERENCES [Aarabi 02] P. Aarabi. Self-localizing dynamic microphone arrays, IEEE International Conference on Humanoid Robots, pp.474-484, 2002. ,
Sufficient dimension reduction and prediction in regression, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, vol.68, issue.3, pp.4385-4405, 1906. ,
DOI : 10.1093/biomet/asm044
Learning to track 3D human motion from silhouettes, Twenty-first international conference on Machine learning , ICML '04, pp.9-16, 2004. ,
DOI : 10.1145/1015330.1015343
URL : https://hal.archives-ouvertes.fr/inria-00548549
Recovering 3D human pose from monocular images, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, issue.1, pp.44-58, 2006. ,
DOI : 10.1109/TPAMI.2006.21
URL : https://hal.archives-ouvertes.fr/inria-00548619
Geometrically Constrained Robust Time Delay Estimation Using Non-coplanar Microphone Arrays, Proceeding of the 20th European Signal Processing Conference (EUSIPCO), pp.1309-1313, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00768763
RAVEL: an annotated corpus for training robots with audiovisual abilities, Journal on Multimodal User Interfaces, vol.24, issue.2, pp.79-91, 2013. ,
DOI : 10.1007/s12193-012-0111-y
URL : https://hal.archives-ouvertes.fr/hal-00720734
Boosted Mixture of Experts: An Ensemble Learning Scheme, Neural Computation, vol.5, issue.2, pp.483-497, 1999. ,
DOI : 10.1016/S0893-6080(05)80023-1
A Sensorimotor Approach to Sound Localization, REFERENCES [Bach 05, pp.603-635, 2005. ,
DOI : 10.1523/JNEUROSCI.0199-04.2004
The variational Bayesian EM Algorithm for incomplete data: with application to scoring graphical model structures, Bayesian Statistics, pp.453-464, 2003. ,
Laplacian Eigenmaps for Dimensionality Reduction and Data Representation, Neural Computation, vol.15, issue.6, pp.1373-1396, 2003. ,
DOI : 10.1126/science.290.5500.2319
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.131.3745
Single Microphone Blind Audio Source Separation Using EM-Kalman Filter and Short+Long Term AR Modeling, Latent Variable Analysis and Signal Separation, pp.106-113, 2010. ,
DOI : 10.1007/978-3-642-15995-4_14
Retrieval of Mars surface physical properties from OMEGA hyperspectral images using regularized sliced inverse regression, Journal of Geophysical Research, vol.20, issue.2, p.6005, 2009. ,
DOI : 10.1029/2008JE003171
URL : https://hal.archives-ouvertes.fr/inria-00276116
Observatoire pour la Minéralogie, l'Eau, les Glaces et l'Activité, in Mars Express: The Scientific Payload, p.3749, 2004. ,
GTM: The generative topographic mapping, Neural computation, vol.10, issue.1, pp.215-234, 1998. ,
Intrinsic dimension estimation by maximum likelihood in isotropic probabilistic PCA, Pattern Recognition Letters, vol.32, issue.14, pp.1706-1713, 2011. ,
DOI : 10.1016/j.patrec.2011.07.017
URL : https://hal.archives-ouvertes.fr/hal-00440372
A generalization of blind source separation algorithms for convolutive mixtures based on second-order statistics, IEEE Transactions on Speech and Audio Processing, vol.13, issue.1, pp.120-134, 2005. ,
DOI : 10.1109/TSA.2004.838775
The handbook of multisensory processes, 2004. ,
A Matlab simulation of " shoebox " room acoustics for use in research and teaching, Computing and Information Systems, vol.9, issue.3, p.48, 2005. ,
Blind beamforming for non-gaussian signals, IEE Proceedings F (Radar and Signal Processing, pp.362-370, 1993. ,
DOI : 10.1049/ip-f-2.1993.0054
Active-speaker detection and localization with microphones and cameras embedded into a robotic head, 2013 13th IEEE-RAS International Conference on Humanoid Robots (Humanoids), 2013. ,
DOI : 10.1109/HUMANOIDS.2013.7029977
Active-speaker detection and localization with microphones and cameras embedded into a robotic head, 2013 13th IEEE-RAS International Conference on Humanoid Robots (Humanoids), 2013. ,
DOI : 10.1109/HUMANOIDS.2013.7029977
A classification EM algorithm for clustering and two stochastic versions, Computational Statistics & Data Analysis, vol.14, issue.3, pp.315-332, 1992. ,
DOI : 10.1016/0167-9473(92)90042-E
URL : https://hal.archives-ouvertes.fr/inria-00075196
Some Experiments on the Recognition of Speech, with One and with Two Ears, The Journal of the Acoustical Society of America, vol.25, issue.5, pp.975-979, 1953. ,
DOI : 10.1121/1.1907229
Handbook of Blind Source Separation, Independent Component Analysis and Applications, 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-00460653
Fisher Lecture: Dimension Reduction in Regression, Statistical Science, vol.22, issue.1, pp.1-26, 2007. ,
DOI : 10.1214/088342306000000682
Mixtures of linear regressions, Computational Statistics & Data Analysis, vol.8, issue.3, pp.227-245, 1989. ,
DOI : 10.1016/0167-9473(89)90043-1
Learning the Direction of a Sound Source Using Head Motions and Spectral Features, 2011. ,
URL : https://hal.archives-ouvertes.fr/inria-00564708
2D sound-source localization on the binaural manifold, 2012 IEEE International Workshop on Machine Learning for Signal Processing, pp.1-6, 2012. ,
DOI : 10.1109/MLSP.2012.6349784
The cocktail party robot, Proceedings of the seventh annual ACM/IEEE international conference on Human-Robot Interaction, HRI '12, pp.431-438, 2012. ,
DOI : 10.1145/2157689.2157834
Mapping Learning with Partially Latent Output, 2013. ,
Variational EM for binaural sound-source separation and localization, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 2013. ,
DOI : 10.1109/ICASSP.2013.6637612
Nature and composition of the icy terrains of the south pole of Mars from MEX OMEGA observations, 36th Lunar and Planetary Science ConferenceLunar and Planetary Science XXXVI), p.1734, 2005. ,
A Comprehensive Numerical Package for the Modeling of Mars Hyperspectral Images, 38th Lunar and Planetary Science ConferenceLunar and Planetary Science XXXVIII), p.1836, 2007. ,
Spatial covariance models for under-determined reverberant audio source separation, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp.129-132, 2009. ,
DOI : 10.1109/ASPAA.2009.5346503
URL : https://hal.archives-ouvertes.fr/hal-00481529
Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1830-1840, 2010. ,
DOI : 10.1109/TASL.2010.2050716
URL : https://hal.archives-ouvertes.fr/inria-00435807
Maximum likelihood approach for blind audio source separation using time-frequency Gaussian source models, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005., pp.78-81, 2005. ,
DOI : 10.1109/ASPAA.2005.1540173
Joint Modelling of Confounding Factors and Prominent Genetic Regulators Provides Increased Accuracy in Genetical Genomics Studies, PLoS Computational Biology, vol.464, issue.1, p.1002330, 2012. ,
DOI : 10.1371/journal.pcbi.1002330.s014
The EM algorithm for mixtures of factor analyzers, 1996. ,
Is neocortex essentially multisensory? Trends in cognitive sciences, pp.278-285, 2006. ,
Dynamic Classifier Selection, Multiple Classifier Systems, pp.177-189, 2000. ,
DOI : 10.1007/3-540-45014-9_17
A mixture of experts network structure for modelling Doppler ultrasound blood flow signals, Computers in Biology and Medicine, vol.35, issue.7, pp.565-582, 2005. ,
DOI : 10.1016/j.compbiomed.2004.04.001
Mask estimation for missing data speech recognition based on statistics of binaural interaction, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.1 ,
DOI : 10.1109/TSA.2005.860354
The Cocktail Party Problem, Neural Computation, vol.31, issue.2, pp.1875-1902, 2005. ,
DOI : 10.1016/0378-5955(91)90148-3
Movement-produced stimulation in the development of visually guided behavior., Journal of Comparative and Physiological Psychology, vol.56, issue.5, pp.872-876, 1963. ,
DOI : 10.1037/h0040546
Spectro-temporal factors in two-dimensional human sound localization, The Journal of the Acoustical Society of America, vol.103, issue.5, pp.2634-2648, 1998. ,
DOI : 10.1121/1.422784
Relearning sound localization with new ears, The Journal of the Acoustical Society of America, vol.105, issue.2, pp.417-421, 1998. ,
DOI : 10.1121/1.424942
Sound localization for humanoid robots building audio-motor maps based on the HRTF. CONTACT project report, Proceedings of the IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, pp.1170-1176, 2006. ,
Time series modeling via hierarchical mixtures, Statistica Sinica, vol.13, issue.4, pp.1097-1118, 2003. ,
Hierarchical Mixtures of Experts and the EM Algorithm, proc. ICASSP, pp.181-214, 1994. ,
DOI : 10.1214/aos/1176346060
Residual component analysis: Generalising PCA for more flexible inference in linear-Gaussian models, ICML, 2012. ,
Robotic Localization and Separation of Concurrent Sound Sources using Self-Splitting Competitive Learning, 2007 IEEE Symposium on Computational Intelligence in Image and Signal Processing, pp.340-345, 2007. ,
DOI : 10.1109/CIISP.2007.369192
Video-Aided Model-Based Source Separation in Real Reverberant Rooms, IEEE Transactions on Audio, Speech, and Language Processing, vol.21, issue.9, pp.1900-1912, 2013. ,
DOI : 10.1109/TASL.2013.2261814
Pixels that Sound, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.88-95, 2005. ,
DOI : 10.1109/CVPR.2005.274
2D Binaural Sound Localization: for Urban Search and Rescue Robotics, Mobile Robotics, pp.423-435, 2009. ,
DOI : 10.1142/9789814291279_0053
Probabilistic non-linear principal component analysis with Gaussian process latent variable models Multiple Reverberant Sound Localization Based on Rigorous Zero-Crossing-Based ITD Selection, The Journal of Machine Learning Research IEEE Signal Process. Lett, vol.6, issue.17 7, pp.1783-1816, 2005. ,
Sliced Inverse Regression for Dimension Reduction, Journal of the American Statistical Association, vol.13, issue.414, pp.316-327, 1991. ,
DOI : 10.1214/aos/1176345514
Azimuthal source localization using interaural coherence in a robotic dog: modeling and application, Robotica, vol.4, issue.07, pp.1013-1020, 2010. ,
DOI : 10.1121/1.1791872
An EM Algorithm for Localizing Multiple Sound Sources in Reverberant Environments, Proc. NIPS, pp.953-960, 2007. ,
Modelbased expectation-maximization source separation and localization, IEEE Trans. Acoust., Speech, Signal Process, vol.18, issue.2, pp.382-394, 2010. ,
Multichannel Eigenspace Beamforming in a Reverberant Noisy Environment With Multiple Interfering Speech Signals, IEEE Transactions on Audio, Speech, and Language Processing, vol.17, issue.6, pp.1071-1086, 2009. ,
DOI : 10.1109/TASL.2009.2016395
Sound Localization by Human Listeners, Annual Review of Psychology, vol.42, issue.1, pp.135-159, 1991. ,
DOI : 10.1146/annurev.ps.42.020191.001031
A source localization/separation/respatialization system based on unsupervised classification of interaural cues Partial least squares estimator for singleindex models, Proceedings of the International Conference on Digital Audio Effects, pp.233-238, 2000. ,
A sensorimotor account of vision and visual consciousness, Behavioral and Brain Sciences, vol.24, pp.939-1031, 2001. ,
Numerical study on source-distance dependency of head-related transfer functions, The Journal of the Acoustical Society of America, vol.125, issue.5, pp.3253-61, 2009. ,
DOI : 10.1121/1.3111860
Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation. Audio, Speech, and Language Processing, IEEE Transactions onPan S. J. Pan & Q. Yang. A Survey on Transfer Learning. IEEE Transactions on Knowledge and Data Engineering, vol.18, issue.22, pp.550-563, 2010. ,
Geometric source separation: Merging convolutive source separation with geometric beamforming. Speech and Audio Processing, IEEE Transactions on, vol.10, issue.6, pp.352-362, 2002. ,
Bayesian Inference in Mixtures-of-Experts and Hierarchical Mixtures-of-Experts Models with an Application to Speech Recognition, Journal of the American Statistical Association, vol.82, issue.435, pp.953-960, 1996. ,
DOI : 10.1080/01621459.1996.10476965
The foundations of science; science and hypothesis , the value of science, science and method, REFERENCES [Qiao 09] Y. Qiao & N. Minematsu. Mixture of Probabilistic Linear Regressions: A unified view of GMM-based mapping techiques. In proc. ICASSP, pp.3913-3916, 1905. ,
Single-channel speech separation using soft mask filtering. Audio, Speech, and Language Processing, IEEE Transactions on, vol.15, issue.8, pp.2299-2310, 2007. ,
Speech segregation based on sound localization, The Journal of the Acoustical Society of America, vol.114, issue.4, pp.2236-2252, 2003. ,
DOI : 10.1121/1.1610463
Overview and Recent Advances in Partial Least Squares, Subspace, Latent Structure and Feature Selection, pp.34-51 ,
DOI : 10.1002/(SICI)1097-0193(1997)5:4<254::AID-HBM9>3.0.CO;2-2
One Microphone Source Separation, Advances in Neural Information Processing Systems, pp.793-799, 2000. ,
Online multimodal speaker detection for humanoid robots, 2012 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012), 2012. ,
DOI : 10.1109/HUMANOIDS.2012.6651509
Think globally, fit locally: unsupervised learning of low dimensional manifolds, Journal of Machine Learning Research, vol.4, pp.119-155, 2003. ,
A Two-Stage Frequency-Domain Blind Source Separation Method for Underdetermined Convolutive Mixtures, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2007. ,
DOI : 10.1109/ASPAA.2007.4393012
Single-channel speech separation using sparse non-negative matrix factorization, 2006. ,
Nonlinear Component Analysis as a Kernel Eigenvalue Problem, Neural Computation, vol.20, issue.5, pp.1299-1319, 1998. ,
DOI : 10.1007/BF02281970
Crossmodal binding through neural coherence: implications for multisensory processing, Trends in Neurosciences, vol.31, issue.8, pp.401-409, 2008. ,
DOI : 10.1016/j.tins.2008.05.002
Efficient blind separation of convolved sound mixtures, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics, p.4, 1997. ,
DOI : 10.1109/ASPAA.1997.625609
A sparse non-parametric approach for single channel separation of known sounds, Advances in Neural Information Processing Systems, pp.1705-1713, 2009. ,
A tutorial on support vector regression, Statistics and Computing, vol.14, issue.3, pp.199-222, 2004. ,
DOI : 10.1023/B:STCO.0000035301.49549.88
Supervised source localization using diffusion kernels, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp.245-248, 2011. ,
DOI : 10.1109/ASPAA.2011.6082267
A Global Geometric Framework for Nonlinear Dimensionality Reduction, Science, vol.290, issue.5500, pp.2319-2323, 2000. ,
DOI : 10.1126/science.290.5500.2319
Multivariate Relevance Vector Machines for Tracking, European Conference on Computer Vision, pp.124-138, 2006. ,
DOI : 10.1007/11744078_10
Mixtures of Probabilistic Principal Component Analyzers, Neural Computation, vol.2, issue.1, pp.443-482, 1999. ,
DOI : 10.1007/BF00162527
Probabilistic Principal Component Analysis, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.61, issue.3, pp.611-622, 1999. ,
DOI : 10.1111/1467-9868.00196
Support vector method for function approximation, regression estimation, and signal processing Performance measurement in blind audio source separation, Advances in Neural Information Processing Systems 9 ? Proceedings of the 1996 Neural Information Processing Systems Conference, pp.281-287, 1996. ,
On the Use of Spatial Cues to Improve Binaural Source Separation, proc. DAFX, pp.209-213, 2003. ,
Spline models for observational data. Numeéro 59. Siam, 1990. ,
Video Assisted Speech Source Separation, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., p.425, 2005. ,
DOI : 10.1109/ICASSP.2005.1416331
Computational auditory scene analysis: Principles, algorithms and applications, 2006. ,
DOI : 10.1109/9780470043387
Gaussian Process Regression with Heteroscedastic or Non-Gaussian Residuals, Computing Research Repository, vol.abs, 1212. ,
MAP-based underdetermined blind source separation of convolutive mixtures by hierarchical clustering and L1-norm minimization, EURASIP Journal on Advances in Signal Processing, vol.2007, pp.1-12, 2007. ,
Binaural Localization of Multiple Sources in Reverberant and Noisy Environments, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.5, pp.1503-1512, 2012. ,
DOI : 10.1109/TASL.2012.2183869
A review of learning with normal and altered sound-localization cues in human adults, International Journal of Audiology, vol.117, issue.sup1, pp.92-98, 2006. ,
DOI : 10.3109/01050398509045933
An Alternative Model for Mixtures of Experts, proc. NIPS, pp.633-640, 1995. ,
Blind separation of speech mixtures via time-frequency masking, IEEE Transactions on Signal Processing, vol.52, pp.1830-1847, 2004. ,
Principal manifolds and nonlinear dimensionality reduction via tangent space alignment, SIAM Journal on Scientific Computing, vol.26, issue.1, 2004. ,
Stochastic Global Optimization ,
DOI : 10.1007/978-3-642-04898-2_570