H. Akaike, A new look at the statistical model identification problem, IEEE Trans. on Automatic Control, issue.19, pp.716-723, 1974.

H. Attias, A variational Bayesian framework for graphical models, Neural Information Processing Systems (NIPS) Conference, 1999.

S. A. Berrani, L. Amsaleg, and P. Gros, Robust content-based image searches for copyright protection, Proceedings of the first ACM international workshop on Multimedia databases , MMDB 2003, pp.70-77, 2003.
DOI : 10.1145/951676.951690

C. Biernacki, G. Celeux, and G. Govaert, Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models, Computational Statistics & Data Analysis, vol.41, issue.3-4, pp.561-575, 2003.
DOI : 10.1016/S0167-9473(02)00163-9

C. Bishop, Neural networks for Pattern Recognition, 1995.

S. Boyd, A. Ghosh, B. Prabhakar, and D. Shah, Randomized gossip algorithms, IEEE Transactions on Information Theory, vol.52, issue.6, 2006.
DOI : 10.1109/TIT.2006.874516

F. Cozman, M. Cirelo, T. S. Huang, I. Cohen, and N. Sebe, Semisupervised learning of classifiers : theory, algorithms and their applications to human-computer interaction, IEEE Trans. on Pattern Analysis and Machine Intelligence, vol.26, issue.12, pp.1553-1567, 2004.

P. T. Eugster, R. Guerraoui, A. Kermarrec, and L. Massoulié, From epidemics to distributed computing, IEEE Computer, vol.37, issue.5, 2003.

R. Fablet, P. Bouthemy, and P. Perez, Non parametric motion characterization using causal probabilistic models for video indexing and retrieval

J. Goldberger and S. Roweis, Hierarchical clustering of a mixture model, Proc. of Neural Information Processing Systems (NIPS'2004), pp.505-512, 2004.

R. Hammoud and R. Mohr, Gaussian mixture densities for video object recognition, Proc. on Int. Conf. on Pattern Recognition (ICPR'2000), pp.71-75, 2000.

D. Kempe, A. Dobra, and J. Gehrke, Gossip-based computation of aggregate information, 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings., 2003.
DOI : 10.1109/SFCS.2003.1238221

D. Mackay, Bayesian Interpolation, Neural Computation, vol.49, issue.3, pp.415-447, 1992.
DOI : 10.1093/comjnl/11.2.185

D. Milojicic, V. Kalogeraki, R. Lukose, L. Nagaraja, J. Pruyne et al., Peer-to-peer computing, 2002.

W. T. Muller, M. Eisenhardt, and A. Henrich, <title>Efficient content-based P2P image retrieval using peer content descriptions</title>, Internet Imaging V, pp.57-68, 2003.
DOI : 10.1117/12.531184

R. Nowak, Distributed EM algorithms for density estimation and clustering in sensor networks, IEEE Transactions on Signal Processing, vol.51, issue.8, 2003.
DOI : 10.1109/TSP.2003.814623

J. Ponce, M. Hebert, C. Schmid, and A. Zisserman, Towards category-level object recognition, 2006.
DOI : 10.1007/11957959

URL : https://hal.archives-ouvertes.fr/inria-00548614

D. Reynolds, Speaker identification and verification using Gaussian mixture speaker models, Speech Communication, vol.17, issue.1-2, pp.91-108, 1995.
DOI : 10.1016/0167-6393(95)00009-D

C. Schmid, Weakly Supervised Learning of Visual Models and Its Application to Content-Based Retrieval, International Journal of Computer Vision, vol.56, issue.1/2, pp.7-16, 2004.
DOI : 10.1023/B:VISI.0000004829.38247.b0

URL : https://hal.archives-ouvertes.fr/inria-00548553

C. Tang, S. Dwarkadas, and Z. Xu, On scaling latent semantic indexing for large peer-to-peer systems, Proceedings of the 27th annual international conference on Research and development in information retrieval , SIGIR '04, pp.145-153, 2004.
DOI : 10.1145/1008992.1009014

J. J. Verbeek, J. R. Nunnink, and N. Vlassis, Accelerated EM-based clustering of large datasets, Data Mining and Knowledge Discovery, 2006.

L. Xie and P. Perez, Slightly supervised learning of part-based appearance models, Proc. of IEEE Workshop of learning in computer vision and pattern recognition, 2004.

M. Ben, G. Gravier, and F. Bimbot, Enhancing the robustness of bayesian methods for text-independent automatic speaker verication, Chapitre 4. Classification de données multimédia Odyssey'04 Speaker and Language Recognition Workshop, p.3439, 2004.

S. Berrani, L. Amsaleg, and P. Gros, Robust content-based image searches for copyright protection, Proceedings of the first ACM international workshop on Multimedia databases , MMDB 2003, p.7077, 2003.
DOI : 10.1145/951676.951690

F. Bimbot, J. Bonastre, C. Fredouille, G. Gravier, I. Magrin-chagnolleau et al., A tutorial on text-independent speaker verication, EURASIP Journal on Applied Signal Processing, vol.4, issue.4, p.430451, 2004.

C. Bishop, Neural networks for Pattern Recognition, 1995.

M. Chen, M. Shao, and J. Ibrahim, Monte Carlo Methods in Bayesian Computation, 2005.
DOI : 10.1007/978-1-4612-1276-8

J. Goldberger and S. Roweis, Hierarchical clustering of a mixture model, Proc. of Neural Information Processing Systems, pp.505-512, 2004.

S. Julier, A general method for approximating a non linear transformation of probability distributions, 1996.

Y. Mami, D. Charlet, and . Septembre, Speaker identication by location in an optimal space of anchor models, International Conferences on Spoken Language Processing, p.13331336, 2002.

M. Nishida and Y. Ariki, Real time speaker indexing based on subspace method -application to tv news articles and debate, International Conference on Spoken Language Processing, p.13471350, 1998.

G. Schwarz, Estimation the dimension of a model, Annals of statistics, vol.6, p.461464, 1978.

D. Sturim, D. Reynolds, D. Singer, and E. Campbell, Speaker indexing in large audio databases using anchor models, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), p.429432, 2001.
DOI : 10.1109/ICASSP.2001.940859

V. Upendra, J. Navratil, G. N. Ramaswamy, and S. Maes, Very large population text-independant speaker identication using transformation enhanced multi-grained models, IEEE International Conference on Acoustics , Speech, and Signal Processing (ICASSP '01). Salt Lake City, p.461464, 2001.

P. Zezula, G. Amato, V. Dohnal, and M. Batko, Similarity Search -The Metric Space Approach, Advances in Database Systems, vol.32, 2006.

B. Zhou and J. Hansen, Improved structural maximum likelihood eigenspace mapping for rapid speaker adaptation, International Conference on Spoken Language Processing, p.554564, 2002.

C. Bishop, Pattern recognition and machine learning, Information science and statistics, 2006.

C. Bishop and M. Svensen, Robust Bayesian mixture modelling, 'Proceedings Twelfth European Symposium on Artificial Neural Networks', pp.69-74, 2004.

F. Cozman, M. Cirelo, T. Huang, I. Cohen, and N. Sebe, Semisupervised learning of classifiers : theory, algorithms and their applications to human-computer interaction, IEEE Trans. on Pattern Analysis and Machine Intelligence, vol.26, issue.12, pp.1553-1567, 2004.

R. Donati and J. Le-cadre, Target motion analysis and track association with a network of proximity sensors, Information Fusion, vol.7, issue.3, pp.285-303, 2006.
DOI : 10.1016/j.inffus.2005.02.005

P. Eugster, R. Guerraoui, and A. Kermarrec, From epidemics to distributed computing, IEEE Computer, vol.37, issue.5, 2003.

R. Fablet, P. Bouthemy, and P. Perez, Nonparametric motion characterization using causal probabilistic models for video indexing and retrieval, IEEE Transactions on Image Processing, vol.11, issue.4, pp.393-407, 2001.
DOI : 10.1109/TIP.2002.999674

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.77.1078

M. Ferecatu, N. Boujemaa, and M. Crucianu, Hybrid visual and conceptual image representation within active relevance feedback context, Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval , MIR '05, pp.6-12, 2005.
DOI : 10.1145/1101826.1101860

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.161.1401

U. Gargi, Y. Deng, and D. R. Tretter, Managing and searching personal photo collections, Storage and Retrieval for Media Databases 2003, 2002.
DOI : 10.1117/12.476239

E. Giannopoulos, R. Streit, and P. Swaszek, Multi-target track segment bearings-only association and ranging, Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36136), 1997.
DOI : 10.1109/ACSSC.1997.679121

J. Goldberger and H. Aronowitz, A distance measure between gmms based on the unscented transform and its application to speaker recognition, Proc. of Interspeech'2005 conference, 2005.

A. Graham, H. Garcia-molina, A. Paepcke, and T. Winograd, Time as essence for photo browsing through personal digital libraries, Proceedings of the second ACM/IEEE-CS joint conference on Digital libraries , JCDL '02, pp.326-335, 2002.
DOI : 10.1145/544220.544301

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.118.5196

R. Hammoud and R. Mohr, Gaussian mixture densities for video object recognition, Proc. on Int. Conf. on Pattern Recognition, pp.71-75, 2000.

R. Hayek, G. Raschia, P. Valduriez, and N. Mouaddib, Design of PeerSum: A Summary Service for P2P Applications, Lecture Notes in Computer Science, vol.4459, pp.13-26, 2007.
DOI : 10.1007/978-3-540-72360-8_2

URL : https://hal.archives-ouvertes.fr/hal-00376958

C. Hsu and C. Lin, A comparison of methods for multiclass support vector machines, IEEE Transactions on Neural Networks, vol.13, issue.2, pp.415-425, 2002.

D. Kempe, A. Dobra, and J. Gehrke, Gossip-based computation of aggregate information , in 'IEEE symp. on foundations of computer science, 2003.

L. Kennedy, M. Naaman, S. Ahern, R. Nair, and T. Rattenbury, How flickr helps us make sense of the world, Proceedings of the 15th international conference on Multimedia , MULTIMEDIA '07, 2007.
DOI : 10.1145/1291233.1291384

A. Kokaram, N. Rea, R. Dahyot, M. Tekalp, P. Bouthemy et al., Browsing sports video: trends in sports-related indexing and retrieval work, IEEE Signal Processing Magazine, vol.23, issue.2, pp.47-58, 2006.
DOI : 10.1109/MSP.2006.1621448

URL : https://hal.archives-ouvertes.fr/inria-00568189

J. A. Lasserre, C. M. Bishop, and T. P. Minka, Principled Hybrids of Generative and Discriminative Models, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 1 (CVPR'06), pp.17-22, 2006.
DOI : 10.1109/CVPR.2006.227

Y. Mami and D. Charlet, Speaker identification by location in an optimal space of anchor models, 'International Conferences on Spoken Language Processing, pp.1333-1336, 2002.

J. Manjarrez, J. Martinez, and P. Valduriez, A Data Allocation Method for Efficient Content-Based Retrieval in Parallel Multimedia Databases, 'International Symposium on Parallel and Distributed Processing and Applications', 2007.
DOI : 10.1007/978-3-540-74767-3_30

URL : https://hal.archives-ouvertes.fr/hal-00429428

D. Milojicic, V. Kalogeraki, R. Lukose, L. Nagaraja, J. Pruyne et al., Peer-to-peer computing, 2002.

R. Neal and G. Hinton, Learning in graphical models Kluwer academic publishers, chapter A view of the EM algorithm that justifies incremental, sparse and other variants, pp.355-368, 1998.

M. Nilsback and A. Zisserman, A Visual Vocabulary for Flower Classification, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.42

R. Nowak, Distributed EM algorithms for density estimation and clustering in sensor networks, IEEE Transactions on Signal Processing, vol.51, issue.8, 2003.
DOI : 10.1109/TSP.2003.814623

J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, Object retrieval with large vocabularies and fast spatial matching, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383172

J. C. Platt and B. A. Czerwinski, PhotoTOC: automatic clustering for browsing personal photographs, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint, 2002.
DOI : 10.1109/ICICS.2003.1292402

J. Ponce, M. Hebert, C. Schmid, and A. Zisserman, Towards category-level object recognition, 2006.
DOI : 10.1007/11957959

URL : https://hal.archives-ouvertes.fr/inria-00548614

S. Poullot, O. Buisson, and M. Crucianu, Z-grid-based probabilistic retrieval for scaling up content-based copy detection, Proceedings of the 6th ACM international conference on Image and video retrieval, CIVR '07, 2007.
DOI : 10.1145/1282280.1282334

URL : https://hal.archives-ouvertes.fr/hal-01125285

D. Ramanan, D. Forsyth, and K. Barnard, Building models of animals from video, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, issue.8, pp.1319-1334, 2006.
DOI : 10.1109/TPAMI.2006.155

D. Reynolds, Speaker identification and verification using gaussian speaker models', Speech communication 17, pp.91-108, 1995.
DOI : 10.1016/0167-6393(95)00009-d

K. Rodden, How do people manage their digital photographs?, Proceedings of the conference on Human factors in computing systems , CHI '03, pp.409-416, 2003.
DOI : 10.1145/642611.642682

H. Rowley, S. Baluja, and T. Kanade, Neural network-based face detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.20, issue.1, pp.23-38, 1998.
DOI : 10.1109/34.655647

N. Santoro, Design and Analysis of Distributed Algorithms, Wiley Series on Parallel and Distributed Computing), 2006.
DOI : 10.1002/0470072644

C. Schmid, Weakly supervised learning of visual models and its application to contentbased retrieval', Int, Journal of Computer Vision, vol.1, issue.56, pp.7-16, 2004.

S. Sclaroff, L. Cascia, M. Sethi, S. Taycher, and L. , Unifying Textual and Visual Cues for Content-Based Image Retrieval on the World Wide Web, Computer Vision and Image Understanding, vol.75, issue.1-2, pp.86-98, 1999.
DOI : 10.1006/cviu.1999.0765

D. Sturim, D. Reynolds, D. Singer, and E. Campbell, Speaker indexing in large audio databases using anchor models, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), pp.429-432, 2001.
DOI : 10.1109/ICASSP.2001.940859

M. Tipping, Sparse bayesian learning and the relevance vector machine, Journal of Machine Learning Research, vol.1, pp.211-244, 2002.

M. Tipping and C. Bishop, Mixtures of Probabilistic Principal Component Analyzers, Neural Computation, vol.2, issue.1, pp.443-482, 1999.
DOI : 10.1007/BF00162527

M. E. Tipping, Sparse bayesian learning and the relevance vector machine, Journal of Machine Learning Research, vol.1, pp.211-244, 2001.

N. Vasconcelos and A. Lippman, Learning mixture hierarchies, 'Neural Information Processing Systems (NIPS) Conference', 1998.

H. Wactlar, T. Kanade, M. Smith, &. S. Stevens, and . Smith, Intelligent access to digital video: Informedia project, Computer, vol.29, issue.5, 1996.
DOI : 10.1109/2.493456

P. Wellner, M. Flynn, and M. Guillemot, Browsing recordings of multi-party interactions in ambient intelligent environments, 'ACM CHI'2004 Conference (Computer-Human Interaction, pp.120-126, 2004.

O. Williams, A. Blake, and R. Cipolla, Sparse Bayesian learning for efficient visual tracking, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.27, issue.8, pp.1292-1304, 2005.
DOI : 10.1109/TPAMI.2005.167