C. Sommaire,

, Description Mixture of logistic regressions parameters (H)estimation with (U)spectral methods. The main methods take d-dimensional inputs and a vector of binary outputs, and return parameters according to the GLMs mixture model (General Linear Model), the PhD thesis of Mor-Absa Loum

. Author-benjamin-auder, Mor-Absa Loum "Mor-Absa.Loum@u-psud.fr

M. Benjamin, Auder@u-psud.fr" Depends R(>= 3.0.0), Imports MASS, jointDiag, methods

. Collate-'utils.r'-'a-namespace,

. Date,

, C.1 R topics documented

, Details The package devtools should be useful in development stage, since we rely on testthat for unit tests, and roxygen2 for documentation. knitr is used to generate the package vignette. Concerning the other suggested packages : ? tensor is used for comparing to some reference functions initially coded in R, Description Mixture of logistic regressions parameters (H)estimation with (U)spectral methods

, ? jointDiag allows to solve a joint diagonalization problem, providing a more robust solution compared to a single diagonalization

. Bibliographie,

. Manoj-kumar-mohapatra, R. Patra, and . Agrawala, Manifestation and outcome of concurrent malaria and dengue infection, Journal of Vector Borne Diseases, vol.49, issue.4, p.2012

M. B. Mushtaq, M. Qadri, and A. Rashid, Concurrent infection with dengue and malaria : An unusual presentation, Medecine, issue.2, 2013.

B. Carme, S. Matheus, G. Donutil, O. Raulin, M. Nacher et al., Concurent dengue and malaria in cayenne hospital, french guiana. Emerging Infections Diseases, vol.15, pp.668-671, 2009.

C. Subhash-arya, L. K. Mehta, N. Agarwal, K. Bharat, G. Mathai et al., Episodes of concurrent dengue and malaria, Dengue Bulletin, p.29, 2005.

S. Deresinski, Concurrent plasmodium vivax malaria and dengue, Emerging Infectious Diseases, vol.12, issue.11, p.1802, 2006.

N. Ali, A. Nadeem, M. Anwar, W. Uz-zaman, R. Tariq et al., Dengue fever in malaria endemic areas, Journal of theCollege of Physicians and Surgeons-Pakistan, vol.16, issue.5, pp.340-342, 2006.

N. Senn, D. Suarkia, D. Manong, P. M. Siba, and W. Mcbride, Contribution of dengue fever to the burden of acute febrile illnesses in papua new guinea : An age-specific prospective study, The American Journal of Tropical Medecine and Hygiene, vol.85, issue.1, pp.132-137, 2011.

N. Remi, P. Charrel, C. Brouqui, X. Foucault, and . De-lamballerie, Concurrent dengue and malaria. Emerging Infections Diseases, vol.11, pp.1153-1154, 2005.

. Vitor-r-mendonça, . Bruno-b-andrade, M. Belisa, . Souza, C. Ligia et al., Unravelling the patterns of host immune responses in plasmodium vivax malaria and dengue co-infection, vol.14, p.315, 2015.

M. Baba, C. H. Logue, B. Oderinde, H. Abdulmaleek, J. Williams et al., Evidence of arbovirus co-infection in suspected febrile malaria and typhoid patients in nigeria, The Journal of Infection in Developing Countries, vol.7, issue.1, pp.51-59, 2013.

S. Thiam, M. Thior, B. Faye, M. B. Médoune-n-diop,-mamadou-lamine-diouf, I. Diouf et al., Major reduction in anti-malarial drug consumption in senegal after nation-wide introduction of malaria rapid diagnostic tests, PloS One, vol.6, issue.4, pp.1-7, 2011.

A. Sow, C. Loucoubar, D. Diallo, O. Faye, Y. Ndiaye et al., Denis Malvy, and Amadou Alpha Sall. Concurrent malaria and arbovirus infections in kedougou, southeastern senegal, Malaria Journal, vol.15, p.47, 2016.

J. A. Nelder and R. W. Wedderburn, Generalized linear models, Journal of the Royal Statistical Society. Series A (General), vol.135, issue.3, pp.370-384, 1972.

P. Mccullagh and J. A. Nelder, Generalized linear models, second edition, 1989.

A. Agresti, Categorical Data Analysis, 2013.

A. Antoniadis, J. Berruyer, and R. Carmona, Régression non linéaire et applications, vol.01, 1992.

W. David, S. Hosmer, and . Lemeshow, Applied Logistic Regression, 2001.

S. Newcomb, A generalized theory of the combination of observations so as to obtain the best result, American Journal of Mathematics, vol.8, issue.4, p.1886

K. Pearson, Iii. contributions to the mathematical theory of evolution, Philosophical Transactions of the Royal Society of London A : Mathematical, Physical and Engineering Sciences, vol.185, pp.71-110, 1894.
URL : https://hal.archives-ouvertes.fr/in2p3-00803630

S. Frühwirth and -. Schnatter, Finite mixture and Markov switching models

B. Grün and F. Leisch, Finite mixtures of generalized linear regression models, Recent Advances in Linear Models and Related Areas : Essays in Honour of Helge Toutenburg, pp.205-230, 2008.

M. Aitkin, A general maximum likelihood analysis of overdispersion in generalized linear models, Statistics and Computing, vol.6, issue.3, pp.251-262, 1996.

A. Dean, D. Follmann, and . Lambert, Generalizing logistic regression by nonparametric mixing, Journal of the American Statistical Association, vol.84, issue.405, pp.295-300, 1989.

P. Wang, M. L. Puterman, I. Cockburn, and N. Le, Mixed poisson regression models with covariate dependent rates, Biometrics, vol.52, issue.2, pp.381-400, 1996.

P. Wang and M. L. Puterman, Mixed logistic regression models, Journal of Agricultural, Biological, and Environmental Statistics, vol.3, issue.2, pp.175-200, 1998.

G. Boiteau, M. Singh, R. P. Singh, G. C. Tai, and T. R. Turner, Rate of spread of pvyn by alatemyzus persicae (sulzer) from infected to healthy plants under laboratory conditions, Potato Research, vol.41, issue.4, pp.335-344, 1998.

A. P. Dempster, N. M. Laird, and D. B. Rubin, Maximum likelihood from incomplete data via the em algorithm, Journal of the Royal Statistical Society. Series B (Methodological), vol.39, issue.1, pp.1-38, 1977.

M. Christopher, M. Bishop, and . Svensén, Bayesian hierarchical mixtures of experts, 2012.

M. I. Jordan and R. A. Jacobs, Hierarchical mixtures of experts and the em algorithm, Neural Comput, vol.6, issue.2, pp.181-214, 1994.

L. Xu, M. I. Jordan, and G. E. Hinton, An alternative model for mixtures of experts, Advances in Neural Information Processing Systems, vol.7, pp.633-640, 1995.

B. Grün, Fitting finite mixtures of linear mixed models with the EM algorithm, Compstat 2008-Proceedings in Computational Statistics, vol.II, pp.165-173, 2008.

P. Mccullagh, Tensor methods in statistics. Monographs on statistics and applied probability (Series), 1987.

H. Sedghi, M. Janzamin, and A. Anandkumar, Provable tensor methods for learning mixtures of generalized linear models, Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, vol.51, pp.9-11, 2016.

A. Anandkumar, D. Hsu, and S. M. Kakade, A method of moments for mixture models and hidden markov models, 2012.

A. Anandkumar, R. Ge, D. Hsu, M. Sham, M. Kakade et al., Tensor decompositions for learning latent variables models, Journal of machine learning, 2014.

A. Anandkumar, M. Janzamin, and H. Sedghi, Provable tensor methods for learning mixtures of generalized linear model, 2015.

A. Podosinnikova, F. Bach, and S. Lacoste-julien, Rethinking LDA : Moment Matching for Discrete ICA, NIPS 2015-Advances in Neural Information Processing Systems, vol.28, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01225271

A. Souloumiac, Nonorthogonal joint diagonalization by combining givens and hyperbolic rotations, IEEE Transactions on Signal Processing, vol.57, issue.6, pp.2222-2231, 2009.

B. Afsari, Sensitivity analysis for the problem of matrix joint diagonalization, SIAM Journal on Matrix Analysis and Applications, vol.30, issue.3, pp.1148-1171, 2008.

S. Coles, An introduction to Statistical Modeling of Extreme Values, 2001.

M. R. Leadbetter, G. Lindgren, and H. Rootzen, Extremes and Related Properties of Random Sequences and Processes, 1983.

J. Pickands, Statistical inference using extreme order statistics, Annals of Statistics, vol.3, issue.1, pp.119-131, 1975.

G. Toulemonde, Estimation et tests en théorie des valeurs extrêmes, 2008.

P. Ndao, Modélisation de valeurs extrêmes conditionnelles en présence de censure, 2015.

M. Garrido, Modélisation desévènementsdesévènements rares et estimation des quantiles extrêmes, méthodes de sélection de modèles pour les queues de distribution, 2002.

A. Lekina, Estimation non-paramétrique des quantiles extrêmes conditionnels, 2010.

G. Stupfler, Un modèle de Markov caché en assurance et Estimation defrontì ere et de point terminal, 2011.

L. Gardes, Estimation d'une fonction quantile extrême, 2003.

J. El-methni, ContributionsàContributionsà l'estimation de quantiles extrêmes, 2013.

Y. Hu, Extreme Value Mixture Modelling with Simulation Study and Applications in Finance and Insurance, 2013.

A. E. Macdonald, Extreme Value Mixture Modelling with Medical and Industrial Applications, 2012.

M. A. Mor-absa-loum, A. Poursat, and . Sow, Multinomial logistic model for coinfection diagnosis between arbovirus and paludisme in kedougou, The International Journal of Biostatistics, 2017.

B. Mor-absa-loum, E. Auder, and . Gassiat, Mixture of generalized linear models : identifiability and applications, 2018.

A. Mor, B. Loum, and . Auder, Package r : Morpheus, mixture of generalized linear models, 2018.

G. Mor-absa-loum, A. Celeux, and . Diop, Extreme value mixture under random censoring, 2018.

A. Macdonald, C. J. Scarrott, D. Lee, B. Darlow, M. Reale et al., A flexible extreme value mixture model, Computational Statistics and Data Analysis, vol.55, issue.6, pp.2137-2157, 2011.

R. Genuer, J. Poggi, and C. Tuleau-malot, Variable selection using random forests, Pattern Recognition Letters, vol.31, issue.14, pp.2225-2236, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00755489

H. Akaike, A new look at the statistical model identification, IEEE transactions on automatic control, vol.19, issue.6, pp.716-722, 1974.

G. James, D. Witten, T. Hastie, and R. Tibshirani, An Introduction to Statistical Learning : With Applications in R, 2014.

P. Branco, L. Torgo, and R. P. Ribeiro, A survey of predictive modeling on imbalanced domains, ACM Computing Surveys (CSUR), vol.49, issue.31, 2016.

L. Breiman, Random forest, Machine Learning, vol.45, pp.5-32, 2001.

J. Leo-breiman, R. Friedman, C. Olshen, and . Stone, Classification And Regression Trees, 1984.

R. Genuer, Forêts aléatoires : aspects théoriques, sélection de variables et applications, 2010.

R. Genuer and J. Poggi, Arbres cart et forêts aléatoires, importance et sélection de variables, 2017.

R. Genuer, J. Poggi, and C. Tuleau-malot, Vsurf : An r package for variable selection using random forests, The R Journal, vol.7, issue.2, p.2015
URL : https://hal.archives-ouvertes.fr/hal-01251924

C. Chen, A. Liaw, and L. Breiman, Using random forests to learn imbalanced data, 2006.

V. Nitesh and . Chawla, Data mining for imbalanced datasets : An overview, Data Mining and Knowledge Discovery Hand-book, SPRINGER, pp.875-886, 2010.

B. Krawczyk, Learning from imbalanced data : open challenges and future directions, Progress in Artificial Intelligence, vol.5, issue.4, pp.221-232, 2016.

S. Kakade, A. T. Kalai, V. Kanade, and O. Shamir, Efficient learning of generalized linear and single index models with isotonic regression, 2011.

A. Quattoni, M. Collins, and T. Darrell, Conditional random fields for object recognition, Advances in Neural Information Processing Systems, vol.17, pp.1097-1104, 2005.

Y. Wang and G. Mori, Max-margin hidden conditional random fields for human action recognition, 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp.872-879, 2009.

S. Petrov and D. Klein, Discriminative log-linear grammars with latent variables, Advances in Neural Information Processing Systems, vol.20, pp.1153-1160, 2008.

P. Liang, A. Bouchard-côté, D. Klein, and B. Taskar, An end-toend discriminative approach to machine translation, Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, pp.761-768, 2006.

H. Sedghi, M. Janzamin, and A. Anandkumar, Provable tensor methods for learning mixtures of generalized linear models, Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, vol.51, pp.9-11, 2016.

C. Stein, A bound for the error in the normal approximation to the distribution of a sum of dependent random variables, Proceedings of the Sixth Berkeley Symposium on Mathematical Statistics and Probability, vol.2, pp.583-602, 1972.

A. W. Van-der and . Vaart, Asymptotic Statistics. Cambridge Series in Statistical and Probabilistic Mathematics, 1998.

C. Gouy-pailler, Vers une modélisation dynamique de l'activité cérébrale pour la conception d'interfaces cerveau-machines asynchrones, 2009.

P. Tichavsky and A. Yeredor, Fast approximate joint diagonalization incorporating weight matrices, IEEE Transactions on Signal Processing, vol.57, issue.3, pp.878-891, 2009.

A. Frigessi, O. Haug, and H. Rue, A dynamic mixture model for unsupervised tail estimation without threshold selection, Extremes, vol.5, issue.3, pp.219-235, 2002.

N. Cible, . Behrens, F. Hedibert, D. Lopes, and . Gamerman, Bayesian analysis of extreme events with threshold estimation, Statistical Modelling, vol.4, pp.227-244, 2004.

B. Vaz-de, M. Mendes, and H. Lopes, Data driven estimates for mixtures, Computational Statistics and Data Analysis, vol.47, issue.3, pp.583-598, 2004.

A. Tancredi, C. Anderson, and A. Hagan, Accounting for threshold uncertainty in extreme value estimation, Extremes, vol.9, issue.2, p.87, 2006.
DOI : 10.1007/s10687-006-0009-8

URL : http://paduaresearch.cab.unipd.it/7320/1/WP_2002_12.pdf

X. Zhao, C. Scarrott, L. Oxley, and M. Reale, Extreme value modelling for forecasting market crisis impacts, Applied Financial Economics, vol.20, issue.1-2, pp.63-72, 2010.
DOI : 10.1080/09603100903262947

X. Zhao, C. J. Scarrott, L. Oxley, and M. Reale, Garch dependence in extreme value models with bayesian inference, Selected Papers of the Combined IMACS World Congress and MSSANZ 18th Biennial Conference on Modelling and Simulation, vol.81, pp.13-17, 2009.
DOI : 10.1016/j.matcom.2010.08.002

URL : http://www.math.canterbury.ac.nz/%7Em.reale/pub/Zhaoetal2011.pdf

D. Lee, W. K. Li, and T. Wong, Modeling insurance claims via a mixture exponential model combined with peaks-over-threshold approach, Insurance : Mathematics and Economics, vol.51, issue.3, pp.538-550, 2012.
DOI : 10.1016/j.insmatheco.2012.07.008

F. Ferraz-do-nascimento, D. Gamerman, and H. Lopes, A semiparametric bayesian approach to extreme value estimation, Statistics and Computing, vol.22, issue.2, pp.661-675, 2012.

J. Carreau and Y. Bengio, A hybrid pareto mixture for conditional asymmetric fat-tailed distributions, IEEE Transactions on Neural Networks, vol.20, issue.7, pp.1087-1101, 2009.
DOI : 10.1109/tnn.2009.2016339

J. Carreau and Y. Bengio, A hybrid pareto model for asymmetric fat-tailed data : the univariate case, Extremes, vol.12, issue.1, pp.53-76, 2009.
DOI : 10.1007/s10687-008-0068-0

G. Stuart, J. A. Coles, and . Tawn, A bayesian analysis of extreme rainfall data, Journal of the Royal Statistical Society. Series C (Applied Statistics), vol.45, issue.4, pp.463-478, 1996.

G. Stuart, E. A. Coles, and . Powell, Bayesian methods in extreme value modelling : A review and new developments, International Statistical Review / Revue Internationale de Statistique, vol.64, issue.1, pp.119-136, 1996.

D. Babichev and F. Bach, Slice inverse regression with score functions. working paper or preprint, 2016.
DOI : 10.1214/18-ejs1428

URL : https://hal.archives-ouvertes.fr/hal-01388498

S. Wang, W. Yao, and M. Huang, A note on the identifiability of nonparametric and semiparametric mixtures of glms, Statistics and Probability Letters, vol.93, pp.41-45, 2014.

B. Grün and F. Leisch, Identifiability of finite mixtures of multinomial logit models with varying and fixed effects, Journal of Classification, vol.25, issue.2, pp.225-247, 2008.

A. Dean, D. Follmann, and . Lambert, Identifiability of finite mixtures of logistic regression models, Journal of Statistical Planning and Inference, vol.27, issue.3, pp.375-381, 1991.

P. Xu, H. Peng, and T. Huang, Unsupervised learning of mixture regression models for longitudinal data, Computational Statistics and Data Analysis, vol.125, pp.44-56, 2018.
DOI : 10.1016/j.csda.2018.03.012

URL : http://arxiv.org/pdf/1703.06277

A. Komarek and L. Komarkova, Clustering for multivariate continuous and discrete longitudinal data, The Annals of Applied Statistics, vol.7, issue.1, pp.177-200, 2013.

H. Kasahara and K. Shimotsu, Nonparametric identification of finite mixture models of dynamic discrete choices, Econometrica, vol.77, issue.1, pp.135-175, 2009.

M. Mondelli and A. Montanari, On the connection between learning twolayers neural networks and tensor decomposition, 2018.

S. Frühwirth and -. Schnatter, Finite mixture and Markov switching models, Springer Series in Statistics, 2006.

B. Auder and M. Loum, Morpheus : An R package to estimate parameters of logistic regressions mixtures, CRAN, 2018.

B. Grün and F. Leisch, Flexmix : An R package for finite mixture modelling. R News, vol.7, pp.8-13, 2007.

A. Dean, D. Follmann, and . Lambert, Identifiability of finite mixtures of logistic regression models, J. Statist. Plann. Inference, vol.27, issue.3, pp.375-381, 1991.

S. Wang, W. Yao, and M. Huang, A note on the identifiability of nonparametric and semiparametric mixtures of GLMs, Statist. Probab. Lett, vol.93, pp.41-45, 2014.

B. Grün and F. Leisch, Identifiability of finite mixtures of multinomial logit models with varying and fixed effects, J. Classification, vol.25, issue.2, pp.225-247, 2008.

A. Komárek and L. Komárková, Clustering for multivariate continuous and discrete longitudinal data, Ann. Appl. Stat, vol.7, issue.1, pp.177-200, 2013.