.. Analyse-qualitative-des-déformations, 105 3.2.5.1 Description de l'approche, p.107

]. Bordag, G. Heyer, and U. Quasthoff, Small Worlds of Concepts and Other Principles of Semantic Search, IICS 2003 -Proceedings of the 3rd international workshop in innovative internet community systems, 2003.
DOI : 10.1007/978-3-540-39884-4_2

A. Borodin, G. O. Roberts, J. S. Rosenthal, and P. Tsaparas, Link analysis ranking: algorithms, theory, and experiments, ACM Transactions on Internet Technology, vol.5, issue.1, pp.231-297, 2005.
DOI : 10.1145/1052934.1052942

L. Brin and . Page, The anatomy of a large-scale hypertextual Web search engine, Computer Networks and ISDN Systems, vol.30, issue.1-7, pp.107-117, 1998.
DOI : 10.1016/S0169-7552(98)00110-X

L. Sergey-brin and . Page, The anatomy of a large-scale hypertextual web search engine Computer networks and ISDN systems, pp.107-117, 1998.

A. Broder, S. Glassman, M. Manasse, and G. Zweig, Syntactic clustering of the Web, Computer Networks and ISDN Systems, vol.29, issue.8-13, pp.1157-1166, 1997.
DOI : 10.1016/S0169-7552(97)00031-7

A. Broder, R. Kumar, F. Maghoul, P. Raghavan, R. Sridhar-rajagopalan et al., Graph structure in the Web, Computer networks : the international journal of computer and telecommunications networking, pp.309-320, 2000.
DOI : 10.1016/S1389-1286(00)00083-9

]. Brunet, Qui lemmatise dilemme attise, Lexicometrica, vol.2, issue.19, 2001.

]. Brunet, Peut-on mesurer la distance entre deux textes ? Corpus, pp.1-19, 2003.

]. Brunet, Les séquences (suite), JADT 2008 -Proceedings of the 9th international conference on the statistical analysis of textual data, pp.253-266, 2008.

]. Caillet, J. Pessiot, and P. Gallinari, Unsupervised learning with term clustering for thematic segmentation of texts, Actes de la 7è conférence en recherche d'information assistée par ordinateur, pp.1-11, 2004.

]. S. Carrière and R. Kazman, WebQuery: searching and visualizing the Web through connectivity, Computer Networks and ISDN Systems, vol.29, issue.8-13, pp.1257-1267, 1997.
DOI : 10.1016/S0169-7552(97)00062-7

]. S. Carrière and R. Kazman, WebQuery: searching and visualizing the Web through connectivity, Computer Networks and ISDN Systems, vol.29, issue.8-13, pp.1257-1267, 1997.
DOI : 10.1016/S0169-7552(97)00062-7

A. Meeyoung-cha, K. Mislove, and . Gummadi, A measurementdriven analysis of information propagation in the Flickr social network, WWW'09 -Proceedings of the 18th international conference on world wide web, pp.721-730, 2009.

M. Cha, H. Haddadi, F. Benevenuto, and K. Gummadi, Measuring user influence in twitter : the million follower fallacy, ICWSM 2010 -Proceedings of the 4th international AAAI conference on weblogs and social media, pp.44-46, 2010.

B. Soumen-chakrabarti, D. Dom, R. Gibson, P. Kumar, . Raghavan et al., Experiments in topic distillation, 1998.

B. Soumen-chakrabarti, P. Dom, and . Indyk, Enhanced hypertext categorization using hyperlinks, Proceedings of the 1998 ACM SIGMOD international conference on management of data, 1998.

]. Clauset, M. Newman, and C. Moore, Finding community structure in very large networks, Physical Review E, vol.70, issue.6, 2004.
DOI : 10.1103/PhysRevE.70.066111

G. Jack, J. Conrad, F. Leidner, and . Schilder, Professional credibility : authority on the web, WICOW'08 -Proceeding of the 2nd ACM workshop on information credibility on the web, 2008.

D. Cutting, D. K. Pedersen, and J. Tukey, Scatter/Gather: a cluster-based approach to browsing large document collections, Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '92, pp.318-329, 1992.
DOI : 10.1145/133160.133214

N. Dai and B. D. Davison, Freshness matters, Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval, SIGIR '10, 2010.
DOI : 10.1145/1835449.1835471

F. Damak, K. Pinel-sauvagnat, G. Cabanac, and M. Boughanem, Recherche de microblogs : quelles sources d'évidences pour raffiner les résultats des moteurs usuels de RI ? In CORIA 2012 -Actes de la 9ème conférence en recherche d'information et applications, p.2012, 2012.

D. Brian, A. Davison, K. Gerasoulis, Y. Kleisouris, H. Lu et al., DiscoWeb : applying link analysis to web search, Proceedings of the 8th World Wide Web Conference, 1999.

D. Brian and . Davison, Topical locality in the Web, Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval -SIGIR '00, pp.272-279, 2000.

D. Brian and . Davison, Topical locality in the web, SIGIR'00 -Proceedings of the 23rd annual international ACM SIGIR conference on research and development in information retrieval, pp.272-279, 2000.

D. Brian and . Davison, Unifying text and link analysis, IJCAI'03 - Workshop on text mining and link analysis, 2003.

S. Scott-deerwester, G. Dumais, T. Furnas, R. Landauer, and . Harshman, Indexing by latent semantic analysis, Journal of the American Society for Information Science, vol.41, issue.6, pp.391-407, 1990.
DOI : 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9

S. Dias, J. Guilloré, . Gabriel-pereira, and . Lopes, Extraction automatique d'associations textuelles à partir de corpora non traités, JADT 2000 -Proceedings of the 5th international conference on the statistical analysis of textual data, 2000.

F. Michelangelo-diligenti, S. Coetzee, L. Lawrence, M. Giles, and . Gori, Focused crawling using context graphs, VLDB'00 -Proceedings of the 26th international conference on very large data bases, pp.527-534, 2000.

]. Donetti and M. Muñoz, Improved spectral algorithm for the detection of network communities, AIP Conference Proceedings, pp.1-2, 2005.
DOI : 10.1063/1.2008598

]. Dorow, D. Widdows, K. Ling, J. Eckmann, D. Sergi et al., Using curvature and Markov clustering in graphs for lexical acquisition and word sense discrimination, 2005.

Y. Duan, L. Jiang, T. Qin, M. Zhou, and H. Shum, An empirical study on learning to rank of tweets, COLING'10 -Proceedings of the 23rd international conference on computational linguistics, pp.295-303, 2010.

N. Dugué and A. Perez, Les capitalistes sociaux sur Twitter : détection via des mesures de similarité, Actes de EGC'2013 (Extraction et Gestion des Connaissances), pp.329-334, 2012.

]. Easley and J. Kleinberg, Networks, crowds, and markets : reasoning about a highly connected world, pp.36-43, 2010.
DOI : 10.1017/CBO9780511761942

M. Levent-ertoz, V. Steinbach, and . Kumar, Finding topics in collections of documents : a shared nearest neighbor approach, Clustering and information retrieval, pp.83-104, 2003.

G. Abel, T. Palla, and . Vicsek, Weighted network modules, New journal of physics, vol.9, issue.6, 2007.

]. Ferret, B. Grau, and N. Masson, Thematic segmentation of texts : two methods for two kinds of texts, Actes de ACL- COLING'98, pp.392-396, 1998.

S. Fortunato, V. Latora, and M. Marchiori, Method to find community structures based on information centrality, Physical Review E, vol.70, issue.5, 2004.
DOI : 10.1103/PhysRevE.70.056104

E. William-frawley, S. Eschenroeder, T. Mills, and . Nguyen, The expression of modality, 2006.

F. Fukumoto and Y. Suzuki, Event tracking based on domain dependency, Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '00, p.57, 2000.
DOI : 10.1145/345508.345548

G. Furnas, S. Deerwester, S. Dumais, T. Landauer, R. Harshman et al., Information retrieval using a singular value decomposition model of latent semantic structure, Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '88, pp.465-480, 1988.
DOI : 10.1145/62437.62487

]. Gale, K. W. Church, and D. Yarowsky, A method for disambiguating word senses in a large corpus, Computers and the Humanities, vol.9, issue.5-6, pp.415-439, 1992.
DOI : 10.1007/BF00136984

]. Garfield, Citation Analysis as a Tool in Journal Evaluation: Journals can be ranked by frequency and impact of citations for science policy studies, Science, vol.178, issue.4060, pp.471-479, 1972.
DOI : 10.1126/science.178.4060.471

A. Geffroy, P. Lafon, and M. Tournier, L'indexation minimale -Plaidoyer pour une non-lemmatisationProblèmes et méthodes de l'indexation maximale, Communication au colloque sur l'analyse des corpus linguistiques, 1973.

F. Geraci, M. Pellegrini, M. Maggini, and F. Sebastiani, Cluster Generation and Cluster Labelling for Web Snippets: A Fast and Accurate Hierarchical Solution, SPIRE'06 -Proceedings of the 13th international conference on string processing and information retrieval, pp.25-36, 2006.
DOI : 10.1007/11880561_3

J. David-gibson, P. Kleinberg, and . Raghavan, Inferring web communities from link topology, HYPERTEXT'98 -Proceedings of the 9th ACM conference on hypertext and hypermedia, pp.225-234, 1998.

M. Girvan and M. Newman, Community structure in social and biological networks, Proceedings of the national academy of sciences, pp.7821-7826, 2002.
DOI : 10.1073/pnas.122653799

B. Saptarshi-gosh, F. Viswanath, N. Kooti, G. Kumar-sharma, F. Korlam et al., Understanding and combating link farming in the Twitter social network, Proceedings of the 21st international conference on World Wide Web (WWW'12), pp.61-70, 2012.

A. Goyal, F. Bonchi, and L. Lakshmanan, Learning influence probabilities in social networks, Proceedings of the third ACM international conference on Web search and data mining, WSDM '10, pp.241-250, 2010.
DOI : 10.1145/1718487.1718518

G. Grefenstette, Corpus derived first, second and third-order word affinities, Proceedings of EURALEX'94, pp.279-290, 1994.

]. Gregory, Finding Overlapping Communities Using Disjoint Community Detection Algorithms, Complex networks, vol.207, pp.47-61, 2009.
DOI : 10.1007/978-3-642-01206-8_5

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.216.5557

]. Gregory, Finding overlapping communities in networks by label propagation, New Journal of Physics, vol.12, issue.10, 2010.
DOI : 10.1088/1367-2630/12/10/103018

R. Guimerà and L. Amaral, Functional cartography of complex metabolic networks, Nature, vol.220, issue.7028, pp.895-900, 2005.
DOI : 10.1038/35075138

H. Zoltan-gyongyi, J. Garcia-molina, and . Pedersen, Combating web spam with TrustRank, VLDB'04 -Proceedings of the 30th international conference on very large data bases, pp.576-587

]. Habert, A. Nazarenko, and A. Salem, Les linguistiques de corpus, 1997.
URL : https://hal.archives-ouvertes.fr/hal-00619268

]. Habert, Des corpus représentatifs : de quoi, pour quoi, comment ? Cahiers de l'université de Perpignan, pp.11-58, 2000.

]. Habert and P. Zweigenbaum, 8. Contextual acquisition of information categories, pp.203-231, 2002.
DOI : 10.1075/cilt.229.14hab

D. Harel and Y. Koren, On clustering using random walks. Lecture notes in computer science, pp.18-41, 2001.
DOI : 10.1007/3-540-45294-x_3

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.19.6463

]. Hearst, TextTiling : segmenting text into multi-paragraph subtopic passages, Computational linguistics, vol.23, issue.1, 1997.
DOI : 10.3115/981732.981734

URL : http://arxiv.org/abs/cmp-lg/9406037

M. Henzinger, Hyperlink analysis on the world wide web, Proceedings of the sixteenth ACM conference on Hypertext and hypermedia , HYPERTEXT '05, pp.1-3, 2005.
DOI : 10.1145/1083356.1083357

M. Henzinger, Hyperlink analysis on the world wide web, Proceedings of the sixteenth ACM conference on Hypertext and hypermedia , HYPERTEXT '05, pp.1-3, 2005.
DOI : 10.1145/1083356.1083357

]. , H. , and P. Sharp, CLUSTAL : a package for performing multiple sequence alignment on a micro computer, Gene, vol.73, issue.1, pp.237-244, 1988.

A. Hotho, S. Staab, and G. Stumme, Wordnet improves text document clustering, SIGIR'03 -Semantic web workshop, 2003.

P. Bernardo-huberman, J. Pirolli, R. Pitkow, and . Lukose, Strong Regularities in World Wide Web Surfing, Science, vol.280, issue.5360, pp.95-97, 1998.
DOI : 10.1126/science.280.5360.95

]. Huberman, The laws of the web : patterns in the ecology of information, 2001.

N. Ide, J. Ide, and . Véronis, Word sense disambiguation : the state of the art, Computational linguistics, vol.24, pp.1-40, 1998.

A. Jackoway, H. Samet, and J. Sankaranarayanan, Identification of live news events using Twitter, Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Location-Based Social Networks, LBSN '11, pp.25-32, 2011.
DOI : 10.1145/2063212.2063224

F. Jacquenet, C. Largeron, and S. Chapaux, Veille technologique assistée par la fouille de textes. Revue des nouvelles technologies de l'information, pp.429-440, 2004.

]. R. Jarvis and E. A. Patrick, Clustering Using a Similarity Measure Based on Shared Near Neighbors, IEEE Transactions on Computers, vol.22, issue.11, pp.1025-1034, 1973.
DOI : 10.1109/T-C.1973.223640

A. Java, X. Song, T. Finin, and B. Tseng, Why we twitter, Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis , WebKDD/SNA-KDD '07, pp.56-65, 2007.
DOI : 10.1145/1348549.1348556

]. Katz, A new status index derived from sociometric analysis, Psychometrika, vol.13, issue.1, pp.39-43, 1953.
DOI : 10.1007/BF02289026

]. Katz, P. Lazarsfeld, and E. Roper, Personal influence : the part played by people in the flow of mass communications, 1955.

J. David-kempe, E. Kleinberg, and . Tardos, Maximizing the spread of influence through a social network, KDD'03 -Proceedings of the 9th ACM SIGKDD international conference on knowledge discovery and data mining, pp.137-146, 2003.

M. Jon and . Kleinberg, Authoritative sources in a hyperlinked environment, Journal of the ACM, vol.46, issue.36, pp.604-632, 1999.

M. Jon and . Kleinberg, Authoritative sources in a hyperlinked environment, Journal of the ACM, vol.46, issue.5, pp.604-632, 1999.

M. Jon and . Kleinberg, Hubs, authorities, and communities, ACM computing surveys, vol.31, issue.4es, p.5, 1999.

M. Jon, R. Kleinberg, P. Kumar, S. Raghavan, A. S. Rajagopalan et al., The web as a graph : measurements, models, and methods, COCOON'99 -Proceedings of the 5th annual international conference on computing and combinatorics, pp.1-17, 1999.

]. Kleinberg, The small-world phenomenon, Proceedings of the thirty-second annual ACM symposium on Theory of computing , STOC '00, pp.163-170, 2000.
DOI : 10.1145/335305.335325

T. Kovach and . Rosenstiel, Warp Speed : America in the Age of Mixed Media. The Century Foundation, 1999.

A. Kritikopoulos, M. Sideri, and I. Varlamis, BlogRank, Proceedings of the 2nd international workshop on Advanced architectures and algorithms for internet delivery and applications , AAA-IDEA '06, 2006.
DOI : 10.1145/1190183.1190193

]. Krovetz, Viewing morphology as an inference process, SIGIR'93 -Proceedings of the 16th annual international ACM SIGIR conference on research and development in information retrieval, pp.191-202, 1993.

]. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins, Trawling the Web for emerging cyber-communities, WWW'99 -Proceedings of the 8th international conference on world wide web, pp.1481-1493, 1999.
DOI : 10.1016/S1389-1286(99)00040-7

]. Kumar, P. Raghavan, . Sridhar-rajagopalan, A. Sivakumar, E. Tomkins et al., The Web as a graph, Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems , PODS '00, pp.1-10, 2000.
DOI : 10.1145/335168.335170

]. Kumar and J. Novak, On the Bursty Evolution of Blogspace, World Wide Web, vol.8, issue.2, pp.159-178, 2005.
DOI : 10.1007/s11280-004-4872-4

]. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins, Core algorithms in the CLEVER system, ACM Transactions on Internet Technology, vol.6, issue.2, pp.131-152, 2006.
DOI : 10.1145/1149121.1149123

]. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins, Core algorithms in the CLEVER system, ACM Transactions on Internet Technology, vol.6, issue.2, pp.131-152, 2006.
DOI : 10.1145/1149121.1149123

]. Kumpula, M. Kivelä, K. Kaski, and J. Saramäki, Sequential algorithm for fast clique percolation, Physical Review E, vol.78, issue.2, 2008.
DOI : 10.1103/PhysRevE.78.026109

O. Kurland and L. Lee, PageRank without hyperlinks, Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '05, p.306, 2005.
DOI : 10.1145/1076034.1076087

O. Kurland and L. Lee, Respect my authority ! HITS without hyperlinks, utilizing cluster-based language models, SIGIR'06 -Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval, p.83, 2006.

. Labbe, D. Labbe, and . Labbe, Que mesure la spécificité du vocabulaire ? Lexicometrica, 2001.

. Labbe, D. Labbe, and . Labbe, La distance intertextuelle, Corpus, vol.2, 2003.
URL : https://hal.archives-ouvertes.fr/halshs-00290974

]. Lafon, Analyse lexicom??trique et recherche des cooccurrences, Mots, vol.3, issue.1, pp.95-148, 1981.
DOI : 10.3406/mots.1981.1041

]. Lafon, Statistiques des localisations des formes d'un texte, Mots, vol.2, issue.1, pp.157-188, 1981.
DOI : 10.3406/mots.1981.1026

A. Lancichinetti, S. Fortunato, and F. Radicchi, Benchmark graphs for testing community detection algorithms, Physical Review E, vol.78, issue.4, 2008.
DOI : 10.1103/PhysRevE.78.046110

A. Lancichinetti, S. Fortunato, and J. Kertész, Detecting the overlapping and hierarchical community structure in complex networks, New Journal of Physics, vol.11, issue.3, 2009.
DOI : 10.1088/1367-2630/11/3/033015

A. Lauf, L. Khouas, and M. Valette, Calcul de l'autorité des pages Web au sein de leurs communautés respectives ? Propositions pour une contextualisation de l'information, IC 2011 -Atelier ExCoCo, 2011.

A. Lauf, M. Valette, and L. Khouas, Analyse du graphe des cooccurrents de deuxième ordre pour la classification non-supervisée de documents, JADT 2012 -Proceedings of the 12th international conference on the statistical analysis of textual data, pp.577-589, 2012.

A. Lauf, M. Valette, and L. Khouas, Analyzing variation patterns in quotes over time, CICLing'13 -Proceedings of the 14th International Conference on Computational Linguistics and interlligent text processing, p.2013, 2013.

L. Deuff, Olivier Le Deuff Autorité et pertinence vs popularité et influence : réseaux sociaux sur Internet et mutations institutionnelles, Séminaire des doctorants du Cersic, pp.21-26, 2006.

A. Leavitt, E. Burchard, D. Fisher, and S. Gilbert, The influentials : new approaches for analyzing influence on Twitter, pp.44-45, 2009.

]. Leblanc and W. Martinez, L'analyse contrastive des réseaux de cooccurrence, JADT 2006 -Proceedings of the 8th international conference on the statistical analysis of textual data, 2006.

L. Daniel and S. Seung, Learning the parts of objects by nonnegative matrix factorization, Nature, vol.401, issue.6755, pp.788-791, 1999.

R. Lee, D. Kitayama, and K. Sumiya, Web-based evidence excavation to explore the authenticity of local events, Proceeding of the 2nd ACM workshop on Information credibility on the web, WICOW '08, 2008.
DOI : 10.1145/1458527.1458543

]. R. Lempel and S. Moran, The stochastic approach for link-structure analysis (SALSA) and the TKC effect, Computer Networks, vol.33, issue.1-6, pp.387-401, 2000.
DOI : 10.1016/S1389-1286(00)00034-7

]. Leskovec, L. Backstrom, R. Kumar, and A. Tomkins, Microscopic evolution of social networks, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD 08, 2008.
DOI : 10.1145/1401890.1401948

J. Leskovec, L. Backstrom, and J. Kleinberg, Meme-tracking and the dynamics of the news cycle, Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD '09, pp.497-506, 2009.
DOI : 10.1145/1557019.1557077

]. Leung, P. Hui, P. Lio, and J. Crowcroft, Towards real-time community detection in large networks, Physical Review E, vol.79, issue.6, 2009.
DOI : 10.1103/PhysRevE.79.066107

Y. Lin, H. Sundaram, Y. C. Tatemura, and B. Tseng, Discovery of blog communities based on mutual awareness, WWW'06 -Proceedings of the 3rd annual workshop on the weblogging ecosystems : aggregation, analysis and dynamics, 2006.

]. Luong, E. Brunet, D. Longrée, D. Mayaffre, S. Mellet et al., La cooccurrence, une relation asymétrique ?, JADT 2010 -Proceedings of the 10th international conference on the statistical analysis of textual data, pp.2010-79, 2010.

S. Macskassy, A. Banerjee, B. D. Davison, and H. Hirsh, Human performance on clustering web pages : a preliminary study, KDD'98 -Proceedings of the 4th ACM SIGKDD international conference on knowledge discovery and data mining, 1998.

]. Martinez, Mise en évidence de rapports synonymiques par la méthode des cooccurrences, JADT 2000 -Proceedings of the 5th international conference on the statistical analysis of textual data, 2000.

]. Martinez, Contribution à une méthodologie de l'analyse des cooccurrences lexicales multiples dans les corpus textuels, 2003.

]. Martinez, Répulsions lexicales : expériences autour de la cooccurrence négative, JADT 2008 -Proceedings of the 9th international conference on the statistical analysis of textual data, 2008.

]. Mayaffre, De l'occurrence à l'isotopie -Les co-occurrences en lexicométrie. Syntaxe et sémantique -Textes, documents numériques, corpus. Pour une science des textes instrumentée, pp.53-72, 2008.

]. Mayaffre, patrie" cooccurrent dans le discours de Nicolas Sarkozy Etude de cas et réflexion théorique sur la co-occurrence, JADT 2008 -Proceedings of the 9th international conference on the statistical analysis of textual data, pp.79-82, 2008.

]. , M. , and C. Zhai, Discovering evolutionary theme patterns from text : an exploration of temporal text mining, KDD'05 -Proceedings of the 11th ACM SIGKDD international conference on knowledge discovery and data mining, pp.198-207, 2005.

]. Mei, X. Shen, and C. Zhai, Automatic labeling of multinomial topic models, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '07, pp.490-499, 2007.
DOI : 10.1145/1281192.1281246

E. Meyer, Sven Meyer zu Eissen. The suffix tree document model revisited, I-KNOW'05 -Proceedings of the 5th international conference on knowledge management, 2005.

]. Meyer, R. Schaneveldt, and M. Ruddy, Loci of contextual effects on visual word-recognition, Attention and performance, pp.98-118, 1975.

]. Miller, WordNet: a lexical database for English, Communications of the ACM, vol.38, issue.11, pp.39-41, 1995.
DOI : 10.1145/219717.219748

]. Mori, T. Miura, and I. Shioya, Topic Detection and Tracking for News Web Pages, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06), 2006.
DOI : 10.1109/WI.2006.171

]. , M. , and G. Hirst, Lexical cohesion computed by thesaural relations as an indicator of the structure of text, Computational linguistics, vol.17, issue.1, pp.21-48, 1991.

]. , M. , and G. Hirst, Non-classical lexical semantic relations, HLT-NAACL 2004 -Proceedings of the computational lexical semantics workshop, pp.46-51, 2004.

A. Marc, H. Najork, M. J. Zaragoza, and . Taylor, Hits on the web : how does it compare ?, SIGIR'07 -Proceedings of the 30th annual international ACM SIGIR conference on research and development in information retrieval, p.471, 2007.

M. Najork, S. Gollapudi, and R. Panigrahy, Less is more, Proceedings of the Second ACM International Conference on Web Search and Data Mining, WSDM '09, p.242, 2009.
DOI : 10.1145/1498759.1498832

Y. Andrew, A. X. Ng, . Zheng, I. Michael, and . Jordan, Stable algorithms for link analysis, SIGIR'01 -Proceedings of the 24th annual international ACM SIGIR conference on research and development in information retrieval, pp.258-266, 2001.

L. Nie, B. D. Davison, and X. Qi, Topical link analysis for web search, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '06, pp.91-118, 2006.
DOI : 10.1145/1148170.1148189

B. D. Lan-nie, B. Davison, and . Wu, From whence does your authority come ? : utilizing community relevance in ranking, pp.1421-1426, 2007.

B. D. Lan-nie, B. Davison, and . Wu, From whence does your authority come ? Utilizing community relevance in ranking, AAAI'07 - Proceedings of the 22nd national conference on artificial intelligence, pp.1421-1426, 2007.

B. Lan-nie, B. D. Wu, and . Davison, Incorporating trust into web search. Rapport technique, 2007.

L. Nie and B. D. Davison, Separate and inequal, Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '08, p.443, 2008.
DOI : 10.1145/1390334.1390411

O. Brendan, R. Connor, B. Balasubramanyan, N. Routledge, and . Smith, From tweets to polls : linking text sentiment to public opinion time series, ICWSM 2010 -Proceedings of the 4th international AAAI conference on weblogs and social media, 2010.

N. Ohsawa, M. Benson, and . Yachida, KeyGraph: automatic indexing by co-occurrence graph based on building construction metaphor, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-, 1998.
DOI : 10.1109/ADL.1998.670375

E. Omodei, T. Poibeau, and J. Cointet, Multilevel modeling of quotation families morphogenesis, Proceedings of the 2012 ASE/IEEE international conference on social computing, pp.101-106, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00783699

]. Page, S. Brin, M. Rajeev, and T. Winograd, The PageRank citation ranking : bringing order to the web, 1999.

A. Pal, D. Singh-tomar, and S. Shrivastava, Effective focused crawling based on content and link structure analysis. International journal of computer science and information security (IJCSIS), 2009.

]. Palermo and J. Jenkins, Word associaiton norms, 1964.

I. Gergely-palla and . Derényi, Uncovering the overlapping community structure of complex networks in nature and society, Nature, vol.387, issue.7043, pp.814-818, 2005.
DOI : 10.1038/nature03248

A. Gergely-palla, T. Barabasi, and . Vicsek, Quantifying social group evolution, Nature, vol.21, issue.7136, pp.664-667, 2007.
DOI : 10.1038/nature05670

]. Palmer, P. Gibbons, and C. Faloutsos, ANF, Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '02, pp.81-90, 2002.
DOI : 10.1145/775047.775059

E. Pariser, The filter bubble : what the internet is hiding from you. Penguin édition, pp.2012-2037, 2012.
DOI : 10.3139/9783446431164

P. Ronan-pichon and . Sébillot, Différencier les sens des mots à l'aide du thème et du contexte de leurs occurrences : une expérience, TALN 1999 -Actes de la 6ème conférence sur le Traitement Automatique des Langues Naturelles, Cargèse, 1999.

P. Ronan-pichon and . Sébillot, From corpus to lexicon : from contexts to semantic features, Practical applications in language corpora (PALC'99), pp.375-389, 2000.

A. Pons-porrata, R. Berlanga-llavori, and J. Ruiz-shulcloper, Building a Hierarchy of Events and Topics for Newspaper Digital Libraries, Proceedings of the 25th European conference on IR research, pp.588-596, 2003.
DOI : 10.1007/3-540-36618-0_46

]. Poulard, T. Waszak, N. Hernandez, and P. Bellot, Repérage de citations, classification des styles de discours rapporté et identification des constituants citationnels en écrits journalistiques, TALN 2008 -Actes de la 15ème conférence sur le traitement automatique des langues naturelles, 2008.

F. Poulard, S. Afantenos, and N. Hernandez, Nouvelles considérations pour la détection de réutilisation de texte, TALN 2009 -Actes de la 16ème conférence sur le traitement automatique des langues naturelles, 2009.

M. Claverie, T. Beigbeder, and . Lafouge, Clusterisation du web en vue d'extraction de corpus homogènes, IN- FORSID 2002 -Actes du 20ème congrès INFORSID, pp.229-242, 2002.

]. Qi and B. D. Davison, Knowing a web page by the company it keeps, Proceedings of the 15th ACM international conference on Information and knowledge management , CIKM '06, 2006.
DOI : 10.1145/1183614.1183650

L. Xiaoguang-qi, B. D. Nie, and . Davison, Measuring similarity to detect qualified links, AIRWeb'07 -Proceedings of the 3rd international workshop on adversarial information retrieval on the web, p.49, 2007.

X. Qi and B. D. Davison, Classifiers without borders, Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '08, p.643, 2008.
DOI : 10.1145/1390334.1390443

]. Qi and B. D. Davison, Web page classification, ACM Computing Surveys, vol.41, issue.2, 2009.
DOI : 10.1145/1459352.1459357

C. Filippo-radicchi, F. Castellano, V. Cecconi, D. Loreto, and . Parisi, Defining and identifying communities in networks, Proceedings of the national academy of sciences, 2004.

U. Nandini-raghavan, R. Albert, and S. Kumara, Near linear time algorithm to detect community structures in large-scale networks, Physical review E, vol.76, issue.3, pp.55-57, 2007.

S. Daniel-ramage, D. Dumais, and . Liebling, Characterizing microblogs with topic models, ICWSM 2010 -Proceedings of the 4th international AAAI conference on weblogs and social media, 2010.

K. Reddy and M. Kitsuregawa, An approach to relate the Web communities through bipartite graphs, Proceedings of the Second International Conference on Web Information Systems Engineering, 2001.
DOI : 10.1109/WISE.2001.996491

C. Reutenauer, E. Jacquey, M. Lecolle, and M. Valette, Sémème au macroscope : genèse et variation sémiques d'une unité lexicale, JADT 2010 -Proceedings of the 10th international conference on the statistical analysis of textual data, p.2010, 2010.

]. Richardson and P. Domingos, The intelligent surfer : probabilistic combination of link and content information in PageRank, NIPS, pp.1441-1448, 2001.

M. Rizoiu, J. Velcin, and J. Chauchat, Regrouper les données textuelles et nommer les groupes à l'aide des classes recouvrantes, EGC 2010 -Actes de la 10ème conférence extraction et gestion des connaissances, pp.561-572, 2010.

R. and S. Romaine, The evolution of linguistic complexity in pidgin and creole languages. The evolution of human languages, pp.213-238, 1992.

]. Saitou and M. Nei, The neighbor-joining method : a new method for reconstructing phylogenetic trees, Molecular biology and evolution, vol.4, issue.4, pp.406-425, 1987.

A. Gerard-salton, C. Wong, and . Yang, A vector space model for automatic indexing, Communications of the ACM, vol.18, issue.11, pp.613-620, 1975.
DOI : 10.1145/361219.361220

H. Jagan-sankaranarayanan, B. Samet, M. Teitler, J. Lieberman, and . Sperling, TwitterStand, Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, GIS '09, pp.42-51, 2009.
DOI : 10.1145/1653771.1653781

F. Saussure, Cours de linguistique générale, 1916.

]. Savoy and J. Picard, Recherche documentaire sur le Web : les hyperliens sont-ils vraiment utiles ?, JADT 2000 -Proceedings of the 5th international conference on the statistical analysis of textual data, 2000.

H. Sayyadi, M. Hurst, and A. Maykov, Event detection and tracking in social streams, ICWSM 2009 -Proceedings of the 3rd international AAAI conference on weblogs and social media, 2009.

P. Froissart and G. Soulez, Rumeurs et emballements -Comment les décrire, comment leur résister ? Médiamorphoses, 2004.

S. Mark and J. Tenenbaum, The large-scale structure of semantic networks : statistical analysis and a model of semantic growth, Cognitive science, vol.29, pp.41-78, 2005.

J. Swan and . Allan, Automatic generation of overview timelines, Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '00, pp.49-56, 2000.
DOI : 10.1145/345508.345546

Y. Sing-hoi-sze, Q. Lu, and . Yang, A polynomial time solvable formulation of multiple sequence alignment, Journal of computational biology, vol.13, issue.2, pp.309-319, 2006.

X. Bin-tan, C. Shen, and . Zhai, Mining long-term search history to improve search accuracy, KDD'06 -Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining, pp.718-723, 2006.

D. Jaime-teevan, . Ramage, and . Morris, #TwitterSearch : a comparison of microblog search and web search, WSDM'11 -Proceedings of the 4th ACM international conference on web search and data mining, pp.35-44, 2011.

J. Thompson, D. Higgins, and T. Gibson, CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice, Nucleic Acids Research, vol.22, issue.22, pp.4673-4680, 1994.
DOI : 10.1093/nar/22.22.4673

]. Toyoda and M. Kitsuregawa, Creating a Web community chart for navigating related communities, Proceedings of the twelfth ACM conference on Hypertext and Hypermedia , HYPERTEXT '01, pp.103-112, 2001.
DOI : 10.1145/504216.504244

M. Shuji-tsukiyama, H. Ide, I. Ariyoshi, and . Shirakawa, A New Algorithm for Generating All the Maximal Independent Sets, SIAM Journal on Computing, vol.6, issue.3, pp.505-517, 1977.
DOI : 10.1137/0206036

M. Valette, A. Estacio-moreno, É. Petitjean, and E. Jacquey, Éléments pour la génération de classes sémantiques à partir de définitions lexicographiques. Pour une approche sémique du sens, Actes de TALN2006, pp.357-366, 2006.

]. Velcin and J. Ganascia, Topic Extraction with AGAPE, ADMA'07 -Proceedings of the 3rd international conference on advanced data mining and applications, pp.377-388, 2007.
DOI : 10.1007/978-3-540-73871-8_35

URL : https://hal.archives-ouvertes.fr/hal-01336130

]. Viprey, Dynamique du vocabulaire des Fleurs du Mal, 1997.

]. Viprey, Structure non-s??quentielle des textes, Langages, vol.163, issue.3, pp.71-85, 2006.
DOI : 10.3917/lang.163.0071

URL : http://www.cairn.info/load_pdf.php?ID_ARTICLE=LANG_163_0071

]. , W. , and B. D. Davison, Counting ancestors to estimate authority, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval -SIGIR '09, 2009.

]. , W. , and B. D. Davison, Counting ancestors to estimate authority, SIGIR'09 -Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval, 2009.

Y. Wang, X. Ni, J. Sun, Y. Tong, and Z. Chen, Representing document as dependency graph for document clustering, Proceedings of the 20th ACM international conference on Information and knowledge management, CIKM '11, pp.2177-2180, 2011.
DOI : 10.1145/2063576.2063920

]. Wasserman and K. Faust, Social network analysis -Methods and applications, 1994.

D. Watts and S. Strogatz, Collective dynamics of 'small-world' networks, Nature, vol.393, issue.6684, pp.440-442, 1998.
DOI : 10.1038/30918

]. Weiss, B. Velez, M. Sheldon, C. Namprempre, P. Szilagyi et al., HyPursuit, Proceedings of the the seventh ACM conference on Hypertext , HYPERTEXT '96, 1996.
DOI : 10.1145/234828.234846

J. Weng, E. Lim, J. Jiang, and Q. He, TwitterRank, Proceedings of the third ACM international conference on Web search and data mining, WSDM '10, pp.261-305, 2010.
DOI : 10.1145/1718487.1718520

]. White, K. Mccain, and . Bibliometrics, Annual review of information science and technology, pp.119-186, 1989.

]. Wilson, Second-hand knowledge. An inquiry into cognitive authority, 1983.

]. Wu and B. D. Davison, Detecting semantic cloaking on the web, Proceedings of the 15th international conference on World Wide Web , WWW '06, 2006.
DOI : 10.1145/1135777.1135901

]. Wu and B. D. Davison, Detecting semantic cloaking on the web, Proceedings of the 15th international conference on World Wide Web , WWW '06, 2006.
DOI : 10.1145/1135777.1135901

]. Wu and B. D. Davison, Undue influence, Proceedings of the 2006 ACM symposium on Applied computing , SAC '06, p.1099, 2006.
DOI : 10.1145/1141277.1141535

D. Sarita-yardi, G. Romero, D. Schoenebeck, and . Boyd, Detecting spam in a Twitter network, 2010.

Y. David, Unsupervised word sense disambiguation rivaling supervised methods, ACL'95 -Proceedings of the 33rd annual meeting on association for computational linguistics, pp.189-196, 1995.

Y. David, Unsupervised word sense disambiguation rivaling supervised methods, Proceedings of the 33rd annual meeting on Association for Computational Linguistics (ACL '95), pp.189-196, 1995.

O. Zamir and O. Etzioni, Web document clustering, Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '98, pp.46-54, 1998.
DOI : 10.1145/290941.290956

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.98.2279

]. Zhong, Efficient online spherical k-means clustering, IJCNN'05 -Proceedings of the IEEE international joint conference on neural networks, pp.3180-3185, 2005.