J. M. Adam, La linguistique textuelle : Introduction à l'analyse textuelle des discours. 2ème éd, 2008.

J. M. Adam, Les textes : types et prototypes. 2ème éd, 2008.

J. L. Austin, How to do Things with Words, 1962.
DOI : 10.1093/acprof:oso/9780198245537.001.0001

F. Bavaud, Generalized Factor Analyses for Contingency Tables, Classification, Clustering, and Data Mining Applications, Studies in Classification, Data Analysis, and Knowledge Organisation, pp.597-606, 2004.
DOI : 10.1007/978-3-642-17103-1_56

F. Bavaud, Aggregation invariance in general clustering approaches Advances in Data Analysis and Classification, pp.205-225, 2009.

F. Bavaud, Euclidean Distances, Soft and Spectral Clustering on Weighted Graphs, 2010.
DOI : 10.1007/978-3-642-15880-3_13

F. Bavaud, On the Schoenberg Transformations in Data Analysis: Theory and Illustrations, Journal of Classification, vol.46, issue.3, pp.297-314, 2011.
DOI : 10.1007/s00357-011-9092-x

F. Bavaud, Testing spatial autocorrelation in weighted networks: the modes permutation test, Journal of Geographical Systems, vol.15, issue.3, pp.233-247, 2013.
DOI : 10.1007/s10109-013-0179-2

F. Bavaud and C. Et-cocco, accepté pour publication) Factor Analysis of Local Formalism, Data Analysis, Learning by Latent Structures, and Knowledge Discovery, Studies in Classification, Data Analysis, and Knowledge Organization

F. Bavaud, C. Cocco, and A. Et-xanthos, Textual autocorrelation : formalism and illustrations, JADT 2012 : 11èmes Journées internationales d'Analyse statistique des Données Textuelles, pp.109-120, 2012.

F. Bavaud, C. Cocco, and A. Et-xanthos, accepté pour publication) Textual navigation and autocorrelation, Sequences in Language and Text, Quantitative Linguistics
DOI : 10.1515/9783110362879-004

URL : http://my.unil.ch/serval/document/BIB_5F0E654B2C33.pdf

D. Biber, Variation across Speech and Writing, 1988.
DOI : 10.1017/CBO9780511621024

G. E. Box and G. M. Et-jenkins, Time series analysis : forecasting and control, 1976.
DOI : 10.1002/9781118619193

K. Boyer, E. Y. Ha, R. Phillips, M. Wallis, M. Vouk et al., Dialogue Act Modeling in a Complex Task-Oriented Domain, Proceedings of the SIGDIAL 2010 Conference, pp.297-305, 2010.

J. P. Bronckart, Activité langagière, textes et discours : Pour un interactionisme sociodiscursif, 1996.

S. Camiz, The Guttman Effect : its Interpretation and a New Redressing Method, Tetradia Analushsq Dedomenwn (Data Analysis Bulletin), vol.5, pp.7-34, 2005.

G. Celeux and G. Et-govaert, A classification EM algorithm for clustering and two stochastic versions, Computational Statistics & Data Analysis, vol.14, issue.3, pp.315-332, 1992.
DOI : 10.1016/0167-9473(92)90042-E

URL : https://hal.archives-ouvertes.fr/inria-00075196

P. Charaudeau, Grammaire du sens et de l'expression, 1992.

A. D. Cliff and J. K. Et-ord, Spatial Processes : Models and Applications, 1981.

C. Cocco, Catégorisation automatique de propositions textuelles en types de discours In Lire demain : des manuscrits antiques à l'ère digitale = Reading tomorrow : from ancient manuscripts to the digital era, pp.689-707, 2012.

C. Cocco, Discourse Type Clustering using POS n-gram Profiles and High-Dimensional Embeddings, Proceedings of the Student Research Workshop at the 13th Conference of the European Chapter of the Association for Computational Linguistics, pp.55-63, 2012.

C. Cocco, Classification supervisée multi-étiquette en actes de dialogue : analyse discriminante et transformations de schoenberg, JADT 2014 : 12èmes Journées internationales d'Analyse statistique des Données Textuelles, pp.147-160, 2014.

C. Cocco and F. Et-bavaud, accepté pour publication) Correspondence Analysis, Cross- Autocorrelation and Clustering in Polyphonic Music, Data Analysis, Learning by Latent Structures, and Knowledge Discovery, Studies in Classification, Data Analysis, and Knowledge Organization

C. Cocco, R. Pittier, F. Bavaud, and A. Et-xanthos, Segmentation and Clustering of Textual Sequences : a Typological Approach, Proceedings of the International Conference Recent Advances in Natural Language Processing, pp.427-433, 2011.

W. W. Cohen, V. R. Carvalho, and T. M. Mitchell, Learning to Classify Email into " Speech Acts, Proceedings of EMNLP 2004, pp.309-316, 2004.

N. Colineau and J. Et-caelen, Étude de marqueurs dans les actes de dialogue dans un corpus de conception, 01Design'95 : Aspects communicatifs en conception, 4ème table ronde francophone sur la conception, pp.127-139, 1995.

F. Critchley and B. Et-fichet, The partial order by inclusion of the principal classes of dissimilarity on a finite set, and some of their basic properties, Classification and Dissimilarity Analysis, n o 93 in Lecture Notes in Statistics, pp.5-65, 1994.
DOI : 10.1007/978-1-4612-2686-4_2

C. M. Cuadras and J. Et-fortiana, Weighted continuous metric scaling, Multidimensional Statistical Analysis and Theory of Random Matrices, pp.27-40, 1996.
DOI : 10.1016/b978-0-444-81531-6.50009-x

F. Daoust, Y. Marcoux, and J. M. Et-viprey, L'annotation structurelle, JADT 2010 : 10th International Conference on Statistical Analysis of Textual Data, 2010.

G. De-maupassant, Le voleur. Gil Blas, 1882.

G. De-maupassant, L'orient. Le Gaulois, 1883.

G. De-maupassant, Un fou ? Le Figaro, 1884.

G. De-maupassant, Un fou. Le Gaulois, 1885.

C. Dejean, M. Fortun, C. Massot, V. Pottier, F. Poulard et al., Un étiqueteur de rôles grammaticaux libre pour le français intégré à Apache UIMA, Actes de la 17e Conférence sur le Traitement Automatique des Langues Naturelles, 2010.

L. Denoeud and A. Et-guénoche, Comparison of Distance Indices Between Partitions, Data Science and Classification, Studies in Classification, Data Analysis, and Knowledge Organization, pp.21-28, 2006.
DOI : 10.1007/3-540-34416-0_3

F. Dupuis and L. Et-lebart, Visualization, validation and seriation, Historical Linguistics 2007, n o 308 in Current Issues in Linguistic Theory, pp.269-284, 2009.
DOI : 10.1075/cilt.308.22dup

B. Efron and R. J. Et-tibshirani, An Introduction to the Bootstrap. N o 57 in Monographs on Statistics and Applied Probability, 1993.

D. Ellis and G. E. Et-poliner, Identifying `Cover Songs' with Chroma Features and Dynamic Programming Beat Tracking, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, pp.1429-1432, 2007.
DOI : 10.1109/ICASSP.2007.367348

V. Estivill-castro, Why so many clustering algorithms, ACM SIGKDD Explorations Newsletter, vol.4, issue.1, pp.65-75, 2002.
DOI : 10.1145/568574.568575

Z. Faget, Un modèle pour la gestion des séquences temporelles synchronisées. Application aux données musicales symboliques, Thèse de doctorat, 2011.

C. Fellbaum, WordNet : An Electronic Lexical Database, 1998.

O. Ferschke, I. Gurevych, and Y. Et-chebotar, Behind the Article : Recognizing Dialog Acts in Wikipedia Talk Pages, Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, pp.777-786, 2012.

L. Filliettaz, Les types de discours Círculo de ling?uisticaling?uistica aplicada a la communicación, 2001.

R. A. Fisher, THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS, Annals of Eugenics, vol.59, issue.2, pp.179-188, 1936.
DOI : 10.1111/j.1469-1809.1936.tb02137.x

W. N. Francis and H. Et-ku?era, Computational analysis of present-day American English, 1967.

W. N. Francis and H. Et-ku?era, Frequency analysis of English usage : lexicon and grammar, 1982.

H. G. Gauch, R. H. Whittaker, and T. R. Et-wentworth, A Comparative Study of Reciprocal Averaging and Other Ordination Techniques, Journal of Ecology, vol.65, issue.1, pp.157-174, 1977.

R. C. Geary, The Contiguity Ratio and Statistical Mapping. The Incorporated Statistician, pp.115-145, 1954.

J. Goldstein and R. Et-sabin, Using Speech Acts to Categorize Email and Identify Email Genres, Proceedings of the 39th Annual Hawaii International Conference on System Sciences (HICSS'06), p.50, 2006.
DOI : 10.1109/HICSS.2006.528

M. J. Greenacre, Theory and Applications of Correspondence Analysis, 1984.

M. Halkidi, Y. Batistakis, and M. Vazirgiannis, Cluster validity methods, ACM SIGMOD Record, vol.31, issue.2, pp.31-40, 2002.
DOI : 10.1145/565117.565124

M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann et al., The WEKA data mining software, ACM SIGKDD Explorations Newsletter, vol.11, issue.1, pp.10-18, 2009.
DOI : 10.1145/1656274.1656278

T. Hawker and M. Et-honnibal, Improved default sense selection forword sense disambiguation, Proceedings of the Australasian Language Technology Workshop, pp.11-17, 2006.

G. H. Hildebrand and A. Et-mace, The Employment Multiplier in an Expanding Industrial Market : Los Angeles County, The Review of Economics and Statistics, vol.32, issue.3, pp.1940-1987, 1950.

M. Houle, H. P. Kriegel, P. Kröger, E. Schubert, and A. Et-zimek, Can Shared-Neighbor Distances Defeat the Curse of Dimensionality?, Scientific and Statistical Database Management, t. 6187 de Lecture Notes in Computer Science, pp.482-500, 2010.
DOI : 10.1007/978-3-642-13818-8_34

L. Hubert and P. Et-arabie, Comparing partitions, Journal of Classification, vol.78, issue.1, pp.193-218, 1985.
DOI : 10.1007/BF01908075

D. Huron, The Humdrum Toolkit : Reference Manual, 1994.

D. Huron, Humdrum User's Guide, 1998.

F. Husson, J. Josse, S. Le, and J. Et-mazet, FactoMineR : Multivariate Exploratory Data Analysis and Data Mining with R. R package version 1, p.25, 2013.

A. K. Jain, M. N. Murty, and P. J. Et-flynn, Data clustering: a review, ACM Computing Surveys, vol.31, issue.3, pp.31-264, 1999.
DOI : 10.1145/331499.331504

J. Karlgren and D. Et-cutting, Recognizing text genres with simple metrics using discriminant analysis, Proceedings of the 15th conference on Computational linguistics -, pp.1071-1075, 1994.
DOI : 10.3115/991250.991324

URL : http://arxiv.org/abs/cmp-lg/9410008

S. N. Kim, L. Cavedon, and T. Et-baldwin, Classifying Dialogue Acts in One-on-One Live Chats, Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp.862-871, 2010.

M. Koppel and J. Et-schler, Exploiting Stylistic Idiosyncrasies for Authorship Attribution, IJCAI'03 Workshop on Computational Approaches to Style Analysis and Synthesis, pp.69-72, 2003.

V. Kriesel, Music Synchronization, Audio Matching, Pattern Detection, and User Interfaces for a Digital Music Library System, Thèse de doctorat, 2013.

V. Lavrenko and J. Et-pickens, Polyphonic music modeling with random fields, Proceedings of the eleventh ACM international conference on Multimedia , MULTIMEDIA '03, pp.120-129, 2003.
DOI : 10.1145/957013.957041

S. Lê, J. Josse, and F. Et-husson, FactoMineR : An R Package for Multivariate Analysis, Journal of Statistical Software, vol.25, issue.1, pp.1-18, 2008.

L. Roux, B. Et-rouanet, and H. , Geometric Data Analysis : From Correspondence Analysis to Structured Data Analysis, 2004.
URL : https://hal.archives-ouvertes.fr/hal-00269083

L. Roux, B. Et-rouanet, and H. , Multiple Correspondence Analysis. N o 163 in Quantitative Applications in the Social Sciences, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00171885

L. Lebart and A. Salem, Statistique textuelle, 1994.

L. Lebart, Analyse Statistique de la Contiguïté. Publications de l, pp.81-112, 1969.

L. Lebart, Which Bootstrap for Principal Axes Methods?, Selected Contributions in Data Analysis and Classification , Studies in Classification, Data Analysis, and Knowledge Organization, pp.581-588, 2007.
DOI : 10.1007/978-3-540-73560-1_55

L. Lebart, A. Morineau, and M. Et-piron, Statistique exploratoire multidimensionnelle, 1995.

Y. Li, C. Luo, and S. M. Chung, Text Clustering with Feature Selection by Using Statistical Data, IEEE Transactions on Knowledge and Data Engineering, vol.20, pp.641-652, 2008.

O. Luaces, J. Dìez, J. Barranquero, J. J. Del-coz, and A. Et-bahamonde, Binary relevance efficacy for multilabel classification, Progress in Artificial Intelligence, vol.40, issue.7, pp.303-313, 2012.
DOI : 10.1007/s13748-012-0030-x

URL : http://digibuo.uniovi.es/dspace/bitstream/10651/30616/2/Binary%20Relevance%20Efficacy%20for%20Multilabel%20Classification.pdf

J. Macqueen, Some methods for classification and analysis of multivariate observations, Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, pp.281-297, 1967.

D. Malrieu and F. Et-rastier, Genres et variations morphosyntaxiques, pp.547-577, 2001.
URL : https://hal.archives-ouvertes.fr/halshs-00161624

C. D. Manning and H. Et-schütze, Foundations of Statistical Natural Language Processing. 1ère éd, 1999.

K. V. Mardia, J. T. Kent, and J. M. Et-bibby, Multivariate analysis, 1979.

G. Matheron, Les variables régionalisées et leur estimation : une application de la théorie des fonctions aléatoires aux sciences de la nature, 1965.

G. J. Mclachlan and T. Et-krishnan, The EM algorithm and extensions, 1997.

M. Meil?ameil?a, Comparing Clusterings by the Variation of Information, Learning Theory and Kernel Machines, t. 2777 de Lecture Notes in Computer Science, pp.173-187, 2003.

G. A. Miller, WordNet: a lexical database for English, Communications of the ACM, vol.38, issue.11, pp.39-41, 1995.
DOI : 10.1145/219717.219748

G. W. Milligan and M. C. Cooper, A Study of the Comparability of External Criteria for Hierarchical Cluster Analysis, Multivariate Behavioral Research, vol.21, issue.4, pp.441-458, 1986.
DOI : 10.1207/s15327906mbr2104_5

P. A. Moran, NOTES ON CONTINUOUS STOCHASTIC PHENOMENA, Biometrika, vol.37, issue.1-2, pp.17-23, 1950.
DOI : 10.1093/biomet/37.1-2.17

B. Morando, L'analyse statistique des partitions de musique, Pratique de l'analyse des données, tome 3 : Linguistique et lexicologie, pp.507-522, 1981.

M. Müller and S. Et-ewert, Chroma Toolbox : Matlab Implementations for Extracting Variants of Chroma-based Audio Features, Proceedings of the 12th International Conference on Music Information Retrieval (ISMIR), pp.215-220, 2011.

F. Murtagh and P. Et-legendre, Ward's hierarchical clustering method : Clustering criterion and agglomerative algorithm, 2011.

O. Nenadic and M. Et-greenacre, Correspondence Analysis in R, with Two-and Threedimensional Graphics : The ca Package, Journal of Statistical Software, vol.20, issue.3, pp.1-13, 2007.

A. Palmer, E. Ponvert, J. Baldridge, and C. Smith, A Sequencing Model for Situation Entity Classification, Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp.896-903, 2007.

C. H. Park and M. Et-lee, On applying linear discriminant analysis for multi-labeled problems, Pattern Recognition Letters, vol.29, issue.7, pp.878-887, 2008.
DOI : 10.1016/j.patrec.2008.01.003

T. Pedersen, S. Patwardhan, and J. Et-michelizzi, WordNet::Similarity -Measuring the Relatedness of Concepts, HLT-NAACL 2004 : Demonstration Papers, pp.38-41, 2004.

D. Pfitzner, R. Leibbrandt, and D. Et-powers, Characterization and evaluation of similarity measures for pairs of clusterings, Knowledge and Information Systems, vol.8, issue.3, pp.361-394, 2009.
DOI : 10.1007/s10115-008-0150-6

A. Qadir and E. Et-riloff, Classifying Sentences as Speech Acts in Message Board Posts, Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pp.748-758, 2011.

R. Team, R : A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, 2013.

J. Read, B. Pfahringer, G. Holmes, and E. Frank, Classifier chains for multi-label classification, Machine Learning, pp.333-359, 2011.
DOI : 10.1007/978-3-642-04174-7_17

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.148.1174

J. Rennie, WordNet::QueryData : a Perl module for accessing the WordNet database, 2000.

P. Resnik, Using Information Content to Evaluate Semantic Similarity in a Taxonomy, Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI- 95), pp.448-453, 1995.

P. Resnik, Semantic Similarity in a Taxonomy : An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language, Journal of Artificial Intelligence Research, vol.11, pp.95-130, 1999.

P. Robert and Y. Et-escoufier, A Unifying Tool for Linear Multivariate Statistical Methods: The RV- Coefficient, Applied Statistics, vol.25, issue.3, pp.257-265, 1976.
DOI : 10.2307/2347233

K. Rose, E. Gurewitz, and G. C. Et-fox, Statistical mechanics and phase transitions in clustering, Physical Review Letters, vol.65, issue.8, pp.65-945, 1990.
DOI : 10.1103/PhysRevLett.65.945

A. L. Rukhin and R. Et-vallejos, Codispersion coefficients for spatial and temporal series, Statistics & Probability Letters, vol.78, issue.11, pp.78-1290, 2008.
DOI : 10.1016/j.spl.2007.11.017

URL : http://hdl.handle.net/10533/85873

G. Salton and M. J. Mcgill, Introduction to Modern Information Retrieval. McGraw-Hill computer science series, 1983.

A. V. Samsonovich, Semantic cross-correlation as a measure of social interaction, Biologically Inspired Cognitive Architectures, vol.7, pp.1-8, 2014.
DOI : 10.1016/j.bica.2013.12.001

G. Saporta, Probabilités, analyse des données et statistique, 2006.

C. S. Sapp, Online Database of Scores in the Humdrum File Format, Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR), pp.664-665, 2005.

H. Schmid, Probabilistic part-of-speech tagging using decision trees, Proceedings of the International Conference on New Methods in Language Processing, pp.44-49, 1994.

I. J. Schoenberg, On Certain Metric Spaces Arising From Euclidean Spaces by a Change of Metric and Their Imbedding in Hilbert Space, The Annals of Mathematics, vol.38, issue.4, pp.787-793, 1937.
DOI : 10.2307/1968835

I. J. Schoenberg, Metric Spaces and Positive Definite Functions. Transactions of the, pp.522-536, 1938.
DOI : 10.1090/s0002-9947-1938-1501980-0

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.377.3750

J. R. Searle, Speech Acts : An Essay in the Philosophy of Language, 1969.
DOI : 10.1017/CBO9781139173438

F. Sebastiani, Machine learning in automated text categorization, ACM Computing Surveys, vol.34, issue.1, pp.1-47, 2002.
DOI : 10.1145/505282.505283

C. S. Smith, Modes of Discourse : The Local Structure of Texts. N o 103 in Cambridge Studies in Linguistics, 2003.
DOI : 10.1017/CBO9780511615108

M. Sokolova and G. Et-lapalme, A systematic analysis of performance measures for classification tasks, Information Processing & Management, vol.45, issue.4, pp.427-437, 2009.
DOI : 10.1016/j.ipm.2009.03.002

A. Stolcke, K. Ries, N. Coccaro, E. Shriberg, R. Bates et al., Dialogue Act Modeling for Automatic Tagging and Recognition of Conversational Speech, Computational Linguistics, vol.41, issue.3, pp.339-373, 2000.
DOI : 10.1109/TIT.1967.1054010

G. Tsoumakas, I. Katakis, and I. Et-vlahavas, Mining Multi-label Data, Data Mining and Knowledge Discovery Handbook, pp.667-685, 2010.
DOI : 10.1007/978-0-387-09823-4_34

V. Van-asch, Macro-and micro-averaged evaluation measures, 2012.

C. J. Van-rijsbergen, Information retrieval. 2ème éd, 1979.

I. Vatolkin, Improving Supervised Music Classification by Means of Multi-Objective Evolutionary Feature Selection, Thèse de doctorat, 2013.

M. J. Warrens, On Association Coefficients for 2??2 Tables and Properties That??Do??Not??Depend on??the??Marginal Distributions, Psychometrika, vol.2, issue.4, pp.777-789, 2008.
DOI : 10.1007/s11336-008-9070-3

C. Weihs, U. Ligges, F. Mörchen, and D. Et-müllensiefen, Classification in Music Research Advances in Data Analysis and Classification, pp.255-291, 2007.

Y. Yang, An Evaluation of Statistical Approaches to Text Categorization, Information Retrieval, vol.1, issue.1/2, pp.69-90, 1999.
DOI : 10.1023/A:1009982220290

Y. Yang and J. O. Et-pedersen, A Comparative Study on Feature Selection in Text Categorization, Proceedings of the 14th International Conference on Machine Learning, pp.412-420, 1997.

G. Youness and G. Et-saporta, Une Méthodologie pour la Comparaison de Partitions, pp.97-120, 2004.

G. Young and A. Et-householder, Discussion of a set of points in terms of their mutual distances, Psychometrika, vol.45, issue.1, pp.19-22, 1938.
DOI : 10.1007/BF02287916

G. U. Yule, On the Association of Attributes in Statistics: With Illustrations from the Material of the Childhood Society, &c, Containing Papers of a Mathematical or Physical Character, pp.252-261, 1900.
DOI : 10.1098/rsta.1900.0019

G. U. Yule, On the Methods of Measuring Association Between Two Attributes, Journal of the Royal Statistical Society, vol.75, issue.6, pp.579-652, 1912.
DOI : 10.2307/2340126