. Conclusion and . .. Perspectives,

. Bibliographie,

R. Agrawal, T. Imieli?ski, and A. Swami, Mining association rules between sets of items in large databases, Acm sigmod record, vol.22, pp.207-216, 1993.

R. Al-rfou, V. Kulkarni, B. Perozzi, and S. Skiena, Polyglot-NER : Massive multilingual named entity recognition, Proceedings of the 2015 SIAM International Conference on Data Mining, pp.586-594, 2015.

R. Al-rfou, B. Perozzi, and S. Skiena, Polyglot : Distributed word representations for multilingual NLP, Proceedings of the Seventeenth Conference on Computational Natural Language Learning, pp.183-192, 2013.

R. Al-shalabi, G. Kanaan, and M. Gharaibeh, Arabic text categorization using knn algorithm, the Proc. of Int. multi conf. on computer science and information technology CSIT06, 2006.

R. Alfred, L. C. Leong, C. K. On, A. , and P. , Malay named entity recognition based on rule-based approach, International Journal of Machine Learning and Computing, vol.4, pp.300-306, 2014.

A. Allauzen and H. Bonneau-maynard, Training and evaluation of POS Taggers on the French multitag corpus, Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), pp.3373-3377, 2008.

J. Anis, Parlez-vous texto ?, Guide des nouveaux langages du réseau. Éditions du Cherche Midi, 2001.

R. Arulanandam, B. T. Savarimuthu, and M. Purvis, Extracting crime information from online newspaper articles, Second Australasian Web Conference, AWC 2014, pp.31-38, 2014.

V. V. Asch, Macro-and micro-averaged evaluation measures, 2013.

R. Baeza-yates and B. Ribeiro-neto, Modern information retrieval, 1999.

S. Bannour, L. Audibert, and A. Nazarenko, Mesures de similarité distributionnelle entre termes, IC2011, pp.523-538, 2011.

F. Béchet and E. Charton, Unsupervised knowledge acquisition for extracting named entities from speech, Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on, pp.5338-5341, 2010.

S. M. Beitzel, E. C. Jensen, A. Chowdhury, and O. Frieder, Varying approaches to topical web query classification, Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '07, pp.783-784, 2007.

A. L. Berger and V. O. Mittal, Ocelot : A system for summarizing web pages, Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '00, pp.144-151, 2000.

N. Bhatia and . Vandana, Survey of nearest neighbor techniques, 2010.

D. M. Bikel, S. Miller, R. Schwartz, and R. Weischedel, Nymble : a highperformance learning name-finder, Proceedings of the fifth conference on Applied natural language processing, pp.194-201, 1997.

M. Bilenko, R. Mooney, W. Cohen, P. Ravikumar, and S. Fienberg, , 2003.

, Adaptive name matching in information integration, IEEE Intelligent Systems, vol.18, issue.5, pp.16-23

F. Bilhaut, F. Dumoncel, P. Enjalbert, and N. Hernandez, Indexation sémantique et recherche d'information interactive, CORIA, vol.7, pp.65-76, 2007.

A. Blessing and H. Schütze, Self-annotation for fine-grained geospatial relation extraction, COLING 2010, 23rd International Conference on Computational Linguistics, Proceedings of the Conference, pp.80-88, 2010.

P. Bojanowski, E. Grave, A. Joulin, and T. Mikolov, Enriching word vectors with subword information, 2016.

K. Bollacker, C. Evans, P. Paritosh, T. Sturge, T. et al., Freebase : a collaboratively created graph database for structuring human knowledge, Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pp.1247-1250, 2008.

K. Bollacker, P. Tufts, T. Pierce, and R. Cook, A platform for scalable, collaborative, structured information integration, Intl. Workshop on Information Integration on the Web (IIWeb'07), pp.22-27, 2007.

D. Bollegala, Y. Matsuo, and M. Ishizuka, Measuring semantic similarity between words using web search engines, pp.757-766, 2007.

D. Bollegala, Y. Matsuo, and M. Ishizuka, A relational model of semantic similarity between words using automatically extracted lexical pattern clusters from the web, Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol.2, pp.803-812, 2009.

A. Bordes, N. Usunier, A. Garcia-duran, J. Weston, Y. et al., Translating embeddings for modeling multi-relational data, Advances in neural information processing systems, pp.2787-2795, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00920777

S. Borhaninejad, F. Hakimpour, and E. Hamzei, Tags extarction from spatial documents in search engines. The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol.40, pp.111-113, 2015.

E. Borra and B. Rieder, Programmed method : developing a toolset for capturing and analyzing tweets, Aslib Journal of Information Management, vol.66, issue.3, pp.262-278, 2014.

A. Borthwick, J. Sterling, E. Agichtein, and R. Grishman, NYU : Description of the MENE named entity system as used in muc-7, Proceedings of the Seventh Message Understanding Conference, p.7, 1998.

S. Brin and L. Page, The anatomy of a large-scale hypertextual web search engine, Comput. Netw. ISDN Syst, vol.30, issue.1-7, pp.107-117, 1998.

S. Brin and L. Page, Reprint of : The anatomy of a large-scale hypertextual web search engine, Computer networks, vol.56, issue.18, pp.3825-3833, 2012.

L. Bruce, Processing electronic medical records : Ontology-driven information extraction and structuring in the clinical domain, 2012.

C. Brun and M. Ehrmann, Un système de détection d'entités nommées adapté pour la campagne d'évaluation ester 2, Actes de la 17e conférence sur le Traitement Automatique des Langues Naturelles (TALN'10), 2010.

M. H. Btoush, A. Alarabeyyat, and I. Olab, Rule based approach for arabic part of speech tagging and name entity recognition, International Journal of Advanced Computer Science and Applications, vol.7, issue.6, pp.331-335, 2016.

H. Chen, M. Lin, and Y. Wei, Novel association measures using web search with double checking, Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, pp.1009-1016, 2006.

J. Chen, A. G. Cohn, D. Liu, S. Wang, J. Ouyang et al., A survey of qualitative spatial representations. The Knowledge Engineering Review, vol.30, pp.106-136, 2015.

N. Chinchor and E. Marsh, Muc-7 information extraction task definition, Proceeding of the seventh message understanding conference (MUC-7), pp.359-367, 1998.

M. Choudhury, R. Saraf, V. Jain, A. Mukherjee, S. Sarkar et al., Investigation and modeling of the structure of texting language, IJDAR, vol.10, issue.3-4, pp.157-174, 2007.

K. W. Church and P. Hanks, Word association norms, mutual information, and lexicography. Computational linguistics, vol.16, pp.22-29, 1990.

R. L. Cilibrasi and P. M. Vitanyi, The google similarity distance, IEEE Transactions on knowledge and data engineering, vol.19, issue.3, pp.370-383, 2007.

F. Ciravegna, (LP2), an adaptive algorithm for information extraction from web-related texts, Proceedings of the IJCAI-2001 Workshop on Adaptive Text Extraction and Mining, 2001.

O. Collin, A. Guerraz, Y. Hiou, and N. Voisine, , p.65, 2013.

M. Collins and Y. Singer, Unsupervised models for named entity classification, Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, pp.100-110, 1999.

R. Cooper, S. Ali, and C. Bi, Extracting information from short messages, Natural Language Processing and Information Systems, 10th International Conference on Applications of Natural Language to Information Systems, NLDB 2005, Proceedings, pp.388-391, 2005.

R. Cooper and S. Manson, Extracting temporal information from short messages, Data Management. Data, Data Everywhere, 24th British National Conference on Databases, BNCOD 24, Proceedings, pp.224-234, 2007.

L. Cougnon and G. Ledegen, c'est écrire comme je parle. une étude comparatiste de variétés de français dans l'écrit SMS, 2008.

B. Daille, Approche mixte pour l'extraction de terminologie : statistique lexicale et filtres linguistiques, 1994.

D. Basic, B. Kolar, M. Snajder, J. Petrovic, and S. , Comparison of collocation extraction measures for document indexing, CIT. Journal of computing and information technology, vol.14, issue.4, pp.321-327, 2006.

L. Derczynski, A. Ritter, S. Clark, and K. Bontcheva, Twitter part-ofspeech tagging for all : Overcoming sparse and noisy data, RANLP, pp.198-206, 2013.

M. Dinarelli and S. Rosset, Models cascade for tree-structured named entity detection, IJCNLP, pp.1269-1278, 2011.

L. Dini, A. Bittar, and M. Ruhlmann, Approches hybrides pour l'analyse de recettes de cuisine deft, taln-recital 2013, Actes du neuvième DÉfi Fouille de Textes, p.51, 2013.

A. Dittrich, D. Richter, L. , and C. , Analysing the usage of spatial prepositions in short messages, Progress in Location-Based Services, pp.153-169, 2014.

G. R. Doddington, A. Mitchell, M. A. Przybocki, L. A. Ramshaw, S. Strassel et al., The automatic content extraction (ace) program-tasks, data, and evaluation, LREC, vol.2, pp.837-840, 2004.

J. D'souza and V. Ng, Utd : Ensemble-based spatial relation extraction, SemEval@ NAACL-HLT, pp.862-869, 2015.

F. Duchateau, Z. Bellahsene, and M. Roche, Improving quality and performance of schema matching in large scale, vol.13, pp.59-82, 2008.
URL : https://hal.archives-ouvertes.fr/lirmm-00343491

M. J. Egenhofer, Reasoning about binary topological relations, Proceedings of the Second International Symposium on Advances in Spatial Databases, SSD '91, pp.143-160, 1991.

M. J. Egenhofer and R. D. Franzosa, Point-set topological spatial relations, International Journal of Geographical Information System, vol.5, issue.2, pp.161-174, 1991.

O. Etzioni, M. Cafarella, D. Downey, A. Popescu, T. Shaked et al., Unsupervised named-entity extraction from the web : An experimental study, Artif. Intell, vol.165, issue.1, pp.91-134, 2005.

F. Even, Extraction d'Information et modélisation de connaissances à partir de Notes de Communication Orale, 2005.

M. Ezzat, Acquisition de relations entre entités nommées à partir de corpus, 2014.

C. Fairon, J. R. Klein, and S. Paumier, Le langage SMS. Étude d'un corpus informatisé à partir de l'enquête "Faites don de vos SMS à la science, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00621422

B. Fallery and F. Rodhain, Quatre approches pour l'analyse de données textuelles : lexicale, linguistique, cognitive, thématique, XVI ème Conférence de l'Association Internationale de Management Stratégique AIMS, pp.1-16, 2007.

B. Fallery and F. Rodhain, Quatre approches pour l'analyse de données textuelles : lexicale, linguistique, cognitive, thématique, XVI ème Conférence de l'Association Internationale de Management Stratégique AIMS, pp.1-16, 2007.

C. G. Figuerola, A. Zazo-rodríguez, L. A. Berrocal, and J. , Automatic vs manual categorisation of documents in spanish, Journal of Documentation, vol.57, issue.6, pp.763-773, 2001.

J. R. Finkel, T. Grenager, and C. D. Manning, Incorporating non-local information into information extraction systems by gibbs sampling, ACL 2005, 43rd Annual Meeting of the Association for Computational Linguistics, pp.363-370, 2005.

R. Florian, A. Ittycheriah, H. Jing, and T. Zhang, Named entity recognition through classifier combination, pp.168-171, 2003.

K. T. Frantzi, S. Ananiadou, and H. Mima, Automatic recognition of multi-word terms : the c-value/nc-value method, Int. J. on Digital Libraries, vol.3, issue.2, pp.115-130, 2000.

N. Friburger, Reconnaissance automatique des noms propres : application à la classification automatique de textes journalistiques, 2002.

N. Friburger and D. Maurel, Finite-state transducer cascades to extract named entities in texts, Theoretical Computer Science, vol.313, issue.1, pp.93-104, 2004.

M. Gaio, Traitements de l'information géographique : Représentations et structures. Habilitation à diriger des recherches, 2001.

S. Galliano, G. Gravier, and L. Chaubard, The ester 2 evaluation campaign for the rich transcription of french radio broadcasts, Tenth Annual Conference of the International Speech Communication Association, 2009.

G. Geleijnse and J. Korst, Creating a dead poets society : Extracting a social network of historical persons from the web, Proceedings of the 6th International The Semantic Web and 2Nd Asian Conference on Asian Semantic Web Conference, ISWC'07/ASWC'07, pp.156-168, 2007.

G. Geleijnse, J. Korst, and V. De-boer, Instance classification using co-occurrences on the web, Proceedings of the ISWC 2006 workshop on Web Content Mining (WebConMine), 2006.

S. Geman and D. Geman, Stochastic relaxation, gibbs distributions, and the bayesian restoration of images, IEEE Trans. Pattern Anal. Mach. Intell, vol.6, issue.6, pp.721-741, 1984.

K. Gimpel, N. Schneider, B. O'connor, D. Das, D. Mills et al., Part-of-speech tagging for twitter : Annotation, features, and experiments, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics : Human Language Technologies : Short Papers, vol.2, pp.42-47, 2011.

F. Gotti and G. Lapalme, Zodiac : Insertion automatique des signes diacritiques du français, Traitement Automatique des Langues Naturelles, pp.19-20, 2014.

R. Grishman and B. Sundheim, Message understanding conference-6 : A brief history, Proceedings of the 16th conference on Computational linguistics, vol.1, pp.466-471, 1996.

D. Guthrie, B. Allison, W. Liu, L. Guthrie, and Y. Wilks, A closer look at skip-gram modelling, Proceedings of the 5th international Conference on Language Resources and Evaluation (LREC-2006), pp.1-4, 2006.

U. Hahn, E. Buyko, R. Landefeld, M. Mühlhausen, M. Poprat et al., An overview of jcore, the julie lab uima component repository, Proceedings of the LREC, vol.8, pp.1-7, 2008.

B. Han and T. Baldwin, Lexical normalisation of short text messages : Makn sens a #twitter, The 49th Annual Meeting of the Association for Computational Linguistics : Human Language Technologies, Proceedings of the Conference, pp.19-24, 2011.

B. Han, P. Cook, and T. Baldwin, Lexical normalization for social media text, ACM Trans. Intell. Syst. Technol, vol.4, issue.1, p.27, 2013.

M. Hatmi, C. Jacquin, E. Morin, and S. Meignier, Named entity recognition in speech transcripts following an extended taxonomy, SLAM@ INTER-SPEECH, pp.61-65, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00843606

Z. He, J. Hong, and D. Bell, Schema matching across query interfaces on the deep web, British National Conference on Databases, pp.51-62, 2008.

M. A. Hearst, Automatic acquisition of hyponyms from large text corpora, Proceedings of the 14th conference on Computational linguistics, vol.2, pp.539-545, 1992.

I. Hendrickx, S. N. Kim, Z. Kozareva, P. Nakov, D. Séaghdha et al., Semeval-2010 task 8 : Multi-way classification of semantic relations between pairs of nominals, Proceedings of the Workshop on Semantic Evaluations : Recent Achievements and Future Directions, DEW '09, pp.94-99, 2009.

L. L. Hill, Core elements of digital gazetteers : placenames, categories, and footprints, International Conference on Theory and Practice of Digital Libraries, pp.280-290, 2000.

E. Hovy, M. Marcus, M. Palmer, L. Ramshaw, and R. Weischedel, Ontonotes : the 90% solution, Proceedings of the human language technology conference of the NAACL, Companion Volume : Short Papers, pp.57-60, 2006.

H. Isozaki and H. Kazawa, Efficient support vector classifiers for named entity recognition, Proceedings of the 19th international conference on Computational linguistics, vol.1, pp.1-7, 2002.

P. Jaccard, Distribution de la flore alpine dans le bassin des dranses et dans quelques régions voisines, Bulletin de la Société Vaudoise des Sciences Naturelles, vol.37, pp.241-272, 1901.

M. A. Jaro, Advances in record-linkage methodology as applied to matching the 1985 census of tampa, florida, Journal of the American Statistical Association, vol.84, issue.406, pp.414-420, 1989.

J. J. Jiang and D. W. Conrath, Semantic similarity based on corpus statistics and lexical taxonomy, 1997.

T. Joachims, Svmlight : Support vector machine, vol.19, 1999.

C. B. Jones, R. Purves, A. Ruas, M. Sanderson, M. Sester et al., Spatial information retrieval and geographical ontologies an overview of the spirit project, Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pp.387-388, 2002.

K. Kageura and B. Umino, Methods of automatic term recognition : A review, Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication, vol.3, issue.2, pp.259-289, 1996.

A. Kent, M. M. Berry, F. U. Luehrs, and J. W. Perry, Machine literature searching viii. operational criteria for designing information retrieval systems, Journal of the Association for Information Science and Technology, vol.6, issue.2, pp.93-101, 1955.

E. Kergosien, C. Sallabery, M. Bessagnet, L. Parc-lacayrelle, A. et al., Using a GIR tool in a Business Intelligence Context : the case of EGC conferences, Proceedings of the 7th International Conference on Information Systems and Economic Intelligence, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01590207

J. Kim, I. Kang, and K. Choi, Unsupervised named entity classification models and their ensembles, Proceedings of the 19th International Conference on Computational Linguistics, vol.1, pp.1-7, 2002.

C. Kobus, F. Yvon, and G. Damnati, Normalizing SMS : are two metaphors better than one, COLING 2008, 22nd International Conference on Computational Linguistics, Proceedings of the Conference, pp.441-448, 2008.

P. Kordjamshidi, M. Van-otterlo, and M. Moens, Spatial role labeling : Towards extraction of spatial relations from natural language, ACM Transactions on Speech and Language Processing (TSLP), vol.8, issue.3, p.4, 2011.

S. E. Kramdi, O. Haemmerlé, and N. Hernandez, Approche générique pour l'extraction de relations à partir de textes, Journées Francophones d'Ingénierie des Connaissances, pp.97-108, 2009.

G. Lample, M. Ballesteros, S. Subramanian, K. Kawakami, and C. Dyer, Neural architectures for named entity recognition, The 2016 Conference of the North American Chapter of the Association for Computational Linguistics : Human Language Technologies, pp.260-270, 2016.

J. Lesbegueries, Plate-forme pour l'indexation spatiale multi-niveaux d'un corpus territorialisé, 2007.

J. Lesbegueries and P. Loustau, Structuration d'information spatiale qualitative pour la recherche d'information. Représentation et raisonnement sur le temps et l'espace, vol.1, p.4, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00408640

J. Lesbegueries, C. Sallaberry, and M. Gaio, Associating spatial patterns to text-units for summarizing geographic information, Proceedings of ACM SIGIR 2006. GIR, Geographic Information Retrieval, Workshop, pp.40-43, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00325289

V. I. Levenshtein, Binary codes capable of correcting deletions, insertions, and reversals, Soviet physics doklady, vol.10, pp.707-710, 1966.

C. Li and A. Sun, Fine-grained location extraction from tweets with temporal awareness, The 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '14, pp.43-52, 2014.

Z. Liao and H. Wu, Biomedical named entity recognition based on skipchain crfs, Industrial Control and Electronics Engineering (ICICEE), 2012 International Conference on, pp.1495-1498, 2012.

D. Lin, An information-theoretic definition of similarity, Proc. of the Fifteenth Int. Conf. on Machine Learning (ICML), pp.296-304, 1998.

X. Ling and D. S. Weld, Fine-grained entity recognition, Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, pp.94-100, 2012.

J. Lingad, S. Karimi, Y. , and J. , Location extraction from disaster-related microblogs, Proceedings of the 22nd international conference on world wide web, pp.1017-1020, 2013.

P. Lison and A. Kutuzov, Redefining context windows for word embedding models : An experimental study, 2017.

F. Liu, M. Vasardani, and T. Baldwin, Automatic identification of locative expressions from social media text : A comparative analysis, Proceedings of the 4th International Workshop on Location and the Web, LocWeb '14, pp.9-16, 2014.

X. Liu, F. Wei, S. Zhang, and M. Zhou, Named entity recognition for tweets, vol.4, p.3, 2013.

Y. Liu, Q. Guo, K. , and M. , A framework of region-based spatial relations for non-overlapping features and its application in object based image analysis, ISPRS Journal of Photogrammetry and Remote Sensing, vol.63, issue.4, pp.461-475, 2008.

C. Loglisci, D. Ienco, M. Roche, M. Teisseire, and D. Malerba, Toward geographic information harvesting : Extraction of spatial relational facts from web documents, 12th IEEE International Conference on Data Mining Workshops, ICDM Workshops, pp.789-796, 2012.
URL : https://hal.archives-ouvertes.fr/lirmm-00816292

C. Lopez, I. Partalas, G. Balikas, N. Derbas, A. Martin et al., Cap 2017 challenge : Twitter named entity recognition. CoRR, 2017.

J. A. Lossio-ventura, C. Jonquet, M. Roche, and M. Teisseire, Biomedical term extraction : overview and a new methodology, Information Retrieval Journal, vol.19, issue.1-2, pp.59-99, 2016.
URL : https://hal.archives-ouvertes.fr/lirmm-01274539

G. Luo, X. Huang, C. Lin, and Z. Nie, Joint named entity recognition and disambiguation, Proc. EMNLP, pp.879-880, 2015.

A. Maedche and S. Staab, Measuring similarity between ontologies, Knowledge Engineering and Knowledge Management. Ontologies and the Semantic Web, Int. Conf. EKAW, pp.251-263, 2002.

R. Malouf, Markov models for language-independent named entity recognition, Proceedings of the 6th Conference on Natural Language Learning, 2002.

I. Mani, J. Hitzeman, J. Richer, D. Harris, R. Quimby et al., Spatialml : Annotation scheme, corpora, and tools, LREC, 2008.

A. Mansouri, L. S. Affendey, and A. Mamat, Named entity recognition approaches, pp.339-344, 2008.

C. Martineau, E. Tolone, and S. Voyatzi, Les Entités Nommées : usage et degrés de précision et de désambiguïsation, 26ème Colloque international sur le Lexique et la Grammaire (LGC'07), pp.105-112, 2007.

D. Maurel, N. Friburger, J. Antoine, I. Eshkol, and D. Nouvel, Cascades de transducteurs autour de la reconnaissance des entités nommées, vol.52, pp.69-96, 2011.

D. Maurel, N. Friburger, and I. Eshkol, Who are you, you who speak ? Transducer cascades for information retrieval, 4th Language & Technology Conference : Human Language Technologies as a Challenge for Computer Science and Linguistics, pp.220-223, 2009.
URL : https://hal.archives-ouvertes.fr/hal-01174643

D. Maynard, Multi-source and multilingual information extraction, Expert Update, vol.6, issue.3, pp.11-16, 2003.

A. Mikheev, M. Moens, and C. Grover, Named entity recognition without gazetteers, Proceedings of the Ninth Conference on European Chapter of the Association for Computational Linguistics, EACL '99, pp.1-8, 1999.

T. Mikolov, K. Chen, G. Corrado, and J. Dean, Efficient estimation of word representations in vector space, 2013.

L. Moncla, M. Gaio, and S. Mustiere, Automatic itinerary reconstruction from texts, International Conference on Geographic Information Science, pp.253-267, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01082082

D. M. Monteiro, A proposal for an architecture to extract information from SMS messages during emergency situations, 2015.

R. Munro and C. D. Manning, Accurate unsupervised joint named-entity extraction from unaligned parallel text, Proceedings of the 4th Named Entity Workshop, NEWS '12, pp.21-29, 2012.

D. Nadeau, P. D. Turney, and S. Matwin, Unsupervised named-entity recognition : Generating gazetteers and resolving ambiguity, Proceedings of the 19th International Conference on Advances in Artificial Intelligence : Canadian Society for Computational Studies of Intelligence, AI'06, pp.266-277, 2006.

A. Nagesh, G. Ramakrishnan, L. Chiticariu, R. Krishnamurthy, A. Dharkar et al., Towards efficient named-entity rule induction for customizability, Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, vol.12, pp.128-138, 2012.

G. Navarro, A guided tour to approximate string matching, ACM Comp. Surv, pp.31-88, 2001.

C. Neudecker, An open corpus for named entity recognition in historic newspapers, Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC, 2016.

V. T. Nguyen, Méthode d'extraction d'informations géographiques à des fins d'enrichissement d'une ontologie de domaine, 2012.

V. T. Nguyen, M. Gaio, and C. Sallaberry, Recherche de relations spatiotemporelles : une méthode basée sur l'analyse de corpus textuels, 2010.

I. Niles and A. Pease, Towards a standard upper ontology, Proceedings of the international conference on Formal Ontology in Information Systems, pp.2-9, 2001.

P. V. Ogren, P. G. Wetzler, and S. Bethard, Cleartk : A uima toolkit for statistical natural language processing, Towards Enhanced Interoperability for Large HLT Systems : UIMA for NLP, p.32, 2008.

J. Oliva, J. I. Serrano, M. D. Castillo, and Á. Iglesias, SMS normalization : combining phonetics, morphology and semantics, Conference of the Spanish Association for Artificial Intelligence, pp.273-282, 2011.

R. Panckhurst, Short Message Service (SMS) : typologie et problématiques futures, Polyphonies, pour Michelle Lanvin, pp.33-52, 2009.

R. Panckhurst, C. Détrie, C. Lopez, C. Moïse, M. Roche et al., , 2013.

. Sud4science, acquisition d'un grand corpus de SMS en français à l'analyse de l'écriture SMS, Epistémé, vol.9, pp.107-138

R. Panckhurst, C. Détrie, C. Lopez, C. Moïse, M. Roche et al., 88milSMS. a corpus of authentic text messages in french. produit par l'Université Paul-Valéry Montpellier III et le CNRS, en collaboration avec l'Université catholique de Louvain, financé grâce au soutien de la MSH-M et du Ministère de la Culture (Délégation générale à la langue française et aux langues de France) et avec la participation de Praxiling, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01068727

Y. Park and R. J. Byrd, Hybrid text mining for finding abbreviations and their definitions, Proceedings of the 2001 conference on empirical methods in natural language processing, pp.126-133, 2001.

N. Patel, P. Accorsi, D. Inkpen, C. Lopez, and M. Roche, Approaches of anonymisation of an SMS corpus, CICLing : Conference on Intelligent Text Processing and Computational Linguistics, vol.7816, pp.77-88, 2013.
URL : https://hal.archives-ouvertes.fr/lirmm-00816285

S. Paumier, De la reconnaissance des formes linguistiques à l'analyse syntaxique, 2003.

J. Plu, G. Rizzo, and R. Troncy, A hybrid approach for entity recognition and linking, Semantic Web Evaluation Challenge, pp.28-39, 2015.

P. Pandey, D. Amin, and S. G. , Rule based stemmer using marathi wordnet for marathi language, International Journal of Advanced Research in Computer and Communication Engineering, vol.5, pp.278-282, 2016.

J. Pustejovsky, P. Kordjamshidi, M. Moens, A. Levine, S. Dworman et al., Semeval-2015 task 8 : Spaceeval. In Proceedings of the 9th international workshop on semantic evaluation, pp.884-894, 2015.

D. Ramage, D. Hall, R. Nallapati, and C. D. Manning, Labeled lda : A supervised topic model for credit attribution in multi-labeled corpora, Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol.1, pp.248-256, 2009.

L. Ratinov and D. Roth, Design challenges and misconceptions in named entity recognition, Proceedings of the Thirteenth Conference on Computational Natural Language Learning, CoNLL '09, pp.147-155, 2009.

C. Raymond, Robust tree-structured named entities recognition from speech, Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pp.8475-8479, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00830142

C. Raymond and J. Fayolle, Reconnaissance robuste d'entités nommées sur de la parole transcrite automatiquement, Conférence Traitement automatique des langues naturelles, TALN'10, 2010.

C. Reul, P. Köberle, N. Üçeyler, and F. Puppe, Expectation-driven text extraction from medical ultrasound images, MIE, pp.712-716, 2016.

A. Rikitianskii, M. Harvey, and F. Crestani, A personalised recommendation system for context-aware suggestions, ECIR, pp.63-74, 2014.

A. Ritter, S. Clark, . Mausam, and O. Etzioni, Named entity recognition in tweets : An experimental study, Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, vol.2011, pp.1524-1534, 2011.

G. Rizzo and R. Troncy, NERD : evaluating named entity recognition tools in the web of data, ISWC 2011, Workshop on Web Scale Knowledge Extraction (WEKEX'11), 2011.

K. Roberts, M. A. Skinner, and S. M. Harabagiu, Recognizing spatial containment relations between event mentions, IWCS, pp.216-227, 2013.

M. Roche, Fouille de Textes : de l'extraction des descripteurs linguistiques à leur induction, 2011.
URL : https://hal.archives-ouvertes.fr/tel-00816263

M. Roche, Fonctions de rang et fouille du web pour l'identification et la caté-gorisation d'entités nommées, JADT'2012 : 11ièmes Journées internationales d'analyse statistique des données textuelles, pp.859-870, 2012.

M. Roche and Y. Kodratoff, Text and web mining approaches in order to build specialized ontologies, Journal of Digital Information, vol.10, issue.4, pp.1-6, 2009.
URL : https://hal.archives-ouvertes.fr/lirmm-00424463

M. Roche and V. Prince, Acrodef : A quality measure for discriminating expansions of ambiguous acronyms, International and Interdisciplinary Conference on Modeling and Using Context, pp.411-424, 2007.
URL : https://hal.archives-ouvertes.fr/lirmm-00168945

M. Sahami and T. D. Heilman, A web-based kernel function for measuring the similarity of short text snippets, Proceedings of the 15th international conference on World Wide Web, pp.377-386, 2006.

H. A. Salas, E. Kergosien, M. Roche, and M. Teisseire, Animitex project : Image analysis based on textual information, SIMBig : Symposium on Information Management and Big Data, pp.49-52, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01144395

C. Sallaberry, Geographical Information Retrieval in Textual Corpora. FO-CUS -Geographical Information Systems Series, 2013.
URL : https://hal.archives-ouvertes.fr/hal-01094358

C. Sallaberry, M. Baziz, J. Lesbegueries, and M. Gaio, Une approche d'extraction et de recherche d'information spatiale dans les documents textuelsévaluation, CORIA, pp.53-64, 2007.

C. Sallaberry, A. Royer, P. Loustau, M. Gaio, and T. Joliveau, Geostream : Spatial information indexing within textual documents supported by a dynamically parameterized web service, OGRS 2009 : International Opensource Geospatial Research Symposium, p.14, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00451949

G. Salton and C. Buckley, Term-weighting approaches in automatic text retrieval. Information processing & management, vol.24, pp.513-523, 1988.

H. Saneifar, S. Bonniol, A. Laurent, P. Poncelet, and M. Roche, Processus d'extraction et de validation de la terminologie issue de logs, JFO'09 : 3èmes Journées Francophones sur les Ontologies, pp.1-10, 2009.
URL : https://hal.archives-ouvertes.fr/lirmm-00423951

A. Savary and J. Piskorski, Lexicons and grammars for named entity annotation in the national corpus of polish, Intelligent Information Systems, pp.141-154, 2010.
URL : https://hal.archives-ouvertes.fr/hal-01024162

J. Savoy, A stemming procedure and stopword list for general french corpora, JASIS, vol.50, issue.10, pp.944-952, 1999.

H. Schmid, Probabilistic part-of-speech tagging using decision trees, Proceedings of the International Conference on New Methods in Language Processing, 1994.

V. Sehgal, L. Getoor, and P. D. Viechnicki, Entity resolution in geospatial data integration, Proceedings of the 14th annual ACM international symposium on Advances in geographic information systems, pp.83-90, 2006.

L. Serrano, Vers une capitalisation des connaissances orientée utilisateur : extraction et structuration automatiques de l'information issue de sources ouvertes, 2014.

M. Severo, T. Giraud, and H. Pecout, Twitter data for urban policy making : an analysis on four european cities, Handbook of Twitter for Research, pp.132-155, 2015.

C. E. Shannon, A mathematical theory of communication, Bell System Technical Journal, vol.27, pp.623-656, 1948.

D. Shen, J. Sun, Q. Yang, C. , and Z. , Building bridges for web query classification, Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '06, pp.131-138, 2006.

D. Sileo, C. Pradel, P. Muller, and . Van-de-cruys, Synapse at cap 2017 ner challenge : Fasttext crf. CoRR, 2017.

M. Simard and A. Deslauriers, Real-time automatic insertion of accents in french text, Natural Language Engineering, vol.7, issue.2, pp.143-165, 2001.

G. G. Simpson, Mammals and the nature of continents, American Journal of Science, vol.241, issue.1, pp.1-31, 1943.

F. Smadja, K. R. Mckeown, and V. Hatzivassiloglou, Translating collocations for bilingual lexicons : A statistical approach, Computational linguistics, vol.22, issue.1, pp.1-38, 1996.

N. Smail, Contribution à l'analyse et à la recherche d'information en texte intégral : application de la transformée en ondelettes pour la recherche et l'analyse de textes, 2009.

D. A. Smith and G. S. Mann, Bootstrapping toponym classifiers, Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references, vol.1, pp.45-49, 2003.

Y. Song, J. Huang, I. G. Councill, J. Li, and C. L. Giles, Efficient topicbased unsupervised name disambiguation, Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries, pp.342-351, 2007.

R. Stern and B. Sagot, Détection et résolution d'entités nommées dans des dépêches d'agence, Traitement Automatique des Langues Naturelles : TALN 2010, 2010.

G. Stoilos, G. Stamou, and S. Kollias, A string metric for ontology alignment, The Semantic Web-ISWC 2005, pp.624-637, 2005.

S. Tahrat, E. Kergosien, S. Bringay, M. Roche, and M. Teisseire, Text2geo : des données textuelles aux informations géospatiales, vol.13, pp.407-412, 2013.

L. Tarrade and C. Lopez, Corpus de tweets et de SMS annotés pour l'observation de phénomènes linguistiques en français "non standard, Actes TALN'2017, pp.27-34, 2017.

T. Kim-sang, E. F. De-meulder, and F. , Introduction to the CoNLL-2003 shared task : Language-independent named entity recognition, Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003, vol.4, pp.142-147, 2003.

M. Tkachenko and A. Simanovsky, Named entity recognition : Exploring features, KONVENS 2012, Empirical Methods in Natural Language Processing, pp.118-127, 2012.

K. Toutanova and C. D. Manning, Enriching the knowledge sources used in a maximum entropy part-of-speech tagger, Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora : held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics, vol.13, pp.63-70, 2000.

P. D. Turney, Mining the web for synonyms : Pmi-ir versus lsa on toefl, Proceedings of the 12th European Conference on Machine Learning, EMCL '01, pp.491-502, 2001.

A. Turpin, Y. Tsegay, D. Hawking, and H. E. Williams, Fast generation of result snippets in web search, Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '07, pp.127-134, 2007.

E. L. Usery, A feature-based geographic information system model. Photogrammetric Engineering and Remote Sensing, vol.62, pp.833-838, 1996.

M. P. Van-der-loo, The stringdist package for approximate string matching, The R Journal, vol.6, pp.111-122, 2014.

R. Varadarajan and V. Hristidis, A system for query-specific document summarization, Proceedings of the 15th ACM International Conference on Information and Knowledge Management, CIKM '06, pp.622-631, 2006.

J. Vergne, Découverte locale des mots vides dans des corpus bruts de langues inconnues, sans aucune ressource, vol.2, pp.1158-1164, 2004.

U. Visser, T. Vögele, and C. Schlieder, Spatio-terminological information retrieval using the buster system, pp.93-100, 2002.

T. Wakao, R. Gaizauskas, and Y. Wilks, Evaluation of an algorithm for the recognition and classification of proper names, Proceedings of the 16th Conference on Computational Linguistics, vol.1, pp.418-423, 1996.

A. Widlöcher and F. Bilhaut, La plate-forme LinguaStream : un outil d'exploration linguistique sur corpus, Actes de la 12e Conférence Traitement Automatique du Langage Naturel (TALN'05), pp.517-522, 2005.

W. E. Winkler, The state of record linkage and current research problems, 1999.

H. Wu, D. R. Radev, F. , and W. , Towards answer-focused summarization using search engines, 2004.

X. Wu, V. Kumar, J. R. Quinlan, J. Ghosh, Q. Yang et al., Top 10 algorithms in data mining, Knowledge and information systems, vol.14, issue.1, pp.1-37, 2008.

Y. Wu, T. Fan, Y. Lee, Y. , and S. , Extracting named entities using support vector machines, Proceedings of the 2006 International Conference on Knowledge Discovery in Life Science Literature, KDLL'06, pp.91-103, 2006.

Y. Xu, R. Jia, L. Mou, G. Li, Y. Chen et al., Improved relation classification by deep recurrent neural networks with data augmentation, 2016.

Z. Xu, J. Xuan, Y. Liu, K. R. Choo, L. Mei et al., Building spatial temporal relation graph of concepts pair using web repository, Information Systems Frontiers, pp.1-10, 2016.

T. Yang, A. J. Torget, and R. Mihalcea, Topic modeling on historical newspapers, Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, pp.96-104, 2011.

Y. Yang and X. Liu, A re-examination of text categorization methods, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, pp.42-49, 1999.

H. Yu, G. Hripcsak, and C. Friedman, Mapping abbreviations to full forms in biomedical articles, JAMIA, vol.9, issue.3, pp.262-272, 2002.

S. Zenasni, E. Kergosien, M. Roche, and M. Teisseire, Discovering types of spatial relations with a text mining approach, Foundations of Intelligent Systems -22nd International Symposium, ISMIS Proceedings, pp.442-451, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01358479

S. Zenasni, E. Kergosien, M. Roche, and M. Teisseire, Découverte de nouvelles entités et relations spatiales à partir d'un corpus de SMS, 23ème Conférence sur le Traitement Automatique des Langues Naturelles TALN 2016, pp.403-410, 2016.

S. Zenasni, E. Kergosien, M. Roche, and M. Teisseire, Extracting new spatial entities and relations from short messages, Proceedings of the 8th International Conference on Management of Digital EcoSystems MEDES'16, pp.189-196, 2016.
URL : https://hal.archives-ouvertes.fr/lirmm-01400032

S. Zenasni, E. Kergosien, M. Roche, M. ;. Teisseire, and . Cirad-dataverse, A corpus of 1000 authentic SMS in French with spatial labels, 2017.

S. Zenasni, E. Kergosien, M. Roche, M. ;. Teisseire, and . Cirad-dataverse, Dic-ES : Liste d'entités spatiales en français, 2017.

S. Zenasni, E. Kergosien, M. Roche, and M. Teisseire, Spatial information extraction from short messages, Expert Systems With Applications, vol.95, pp.351-367, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01706936

T. Zhang and D. Johnson, A robust risk minimization based named entity recognition system, Proceedings of the Seventh Conference on Natural Language Learning, CoNLL, pp.204-207, 2003.

, Au prochain croisement, tournez à gauche, poursuivez tout droit sur 1 km, puis à gauche toute vers Vignogoul pour une arrivée grandiose sur l'abbaye entrevue depuis le vignoble, vol.67, pp.47-72

J. Chanet, ferailleur à Poussan (Hérault) a été mis en examen hier pour meurtre et laissé en liberté sous contrôle judiciaire avec interdiction de résider à Poussan, vol.56

. Archéologie and . Lun, mer, jeu, ven 10 h-12 h, 13 h 30-17 h 30 ; sam, dim 14 h-18 h. Musée Henri-Prades, vol.67, p.99

, Départ Cruciera de Manolo, route de Candillagues, arrivée aux Arènes, boulevard Jean-Macé

, Du côté de Bouzigues, les autos se sont agglutinées jusqu'au rond-point où on les faisait repartir en sens inverse. Et du côté de Poussan le trafic s'est accumulé sans qu'un nouvel itinéraire ne soit proposé. Sidérant.Philippe MALRIC

, peut confirmer sa belle prestation chez l'AOC en recevant le bon dernier de la classe, Villenueve-lès-Béziers et ainsi se replacer dans la course à la qualification

). , candidat du Front de gauche aux cantonales sur Montpellier IX (La Paillade)

, la canalisation d'eau potable longeant les marais de la Grande Palude et rejoignant le chemin

, les joueurs visiteront notamment Agde, Le Cap d'Agde et Le Grau d'Agde. Fabriquée par la maison d'édition officielle du Monopoly

. Dimanche, . .. Gigea, and . Religion-gigean, Montbazin et Poussan Paroisse du Bon Pasteur

, 18 h-19 h 30. Fermé lundi 13 août. Rue Teyron, vol.04, p.10