[. Bibliography and . Abdallah, Fraud detection system: A survey, Journal of Network and Computer Applications, 2016.

R. Acuna, E. Acuna, and C. Rodriguez, The treatment of missing values and its effect on classifier accuracy. Classification, clustering and data mining applications, 2004.

J. Alaiz-rodriguez, R. Japkowicz, and N. , Assessing the impact of changing environments on classifier performance. Conference of the Canadian Society for Computational Studies of Intelligence, 2008.

A. , Consumer-facing technology fraud: Economics, attack methods and potential solutions, Future Generation Computer Systems, vol.100, 2019.

. Altmann, Permutation importance: a corrected feature importance measure, Bioinformatics, vol.26, issue.10, 2010.

. Baesens, Fraud analytics using descriptive, predictive, and social network techniques: A guide to data science for fraud detection, 2015.

[. Bahnsen, Detecting credit card fraud using periodic features, IEEE 14th International Conference on Machine Learning and Applications, 2015.

[. Bahnsen, Feature engineering strategies for credit card fraud detection, Expert Systems with Applications, 2016.

J. P. Barddal and F. Enembreck, Learning regularized hoeffding trees from data streams, 34th ACM/SIGAPP Symposium on Applied Computing (SAC2019), 2019.

[. Batista, Applying one-sided selection to unbalanced datasets, MICAI 2000: Advances in Artificial Intelligence, 2000.

[. Batista, A study of the behavior of several methods for balancing machine learning training data, ACM SIGKDD Explorations Newsletter, 2004.

K. Bauer, E. Bauer, and R. Kohavi, An empirical comparison of voting classification algorithms: Bagging, boosting, and variants, Machine learning, vol.36, pp.1-2, 1999.

L. E. Baum, An inequality and associated maximization technique. Statistical estimation for probabilistic functions of Markov processes, 1972.

J. S. Bayer-;-bayer, Learning sequence representations, 2015.

F. Bengio, Y. Bengio, and P. Frasconi, An input output hmm architecture, Advances in neural information processing systems, 1995.

[. Bengio, Learning long-term dependencies with gradient descent is difficult, 1994.

. Best, D. J. Fisher-;-best, and N. I. Fisher, Efficient simulation of the von mises distribution, Journal of the Royal Statistical Society, 1979.

[. Bhattacharyya, Data mining for credit card fraud: A comparative study, 2011.

. Bickel, Discriminative learning for differing training and test distributions, Proceedings of the 24th International conference on Machine learning, 2007.

G. Bifet, A. Bifet, and R. Gavalda, Adaptive learning from evolving data streams, International Symposium on Intelligent Data Analysis, 2009.

[. Blattberg, Database marketing: analyzing and managing customers, 2008.

. Bolton, R. J. Hand-;-bolton, and D. J. Hand, Unsupervised profiling methods for fraud detection, Credit Scoring and Credit Control, 2001.

. Bolton, R. J. Hand-;-bolton, and D. J. Hand, Statistical fraud detection: a review, 2002.

. Bontemps, Collective anomaly detection based on long short-term memory recurrent neural networks, International Conference on Future Data and Security Engineering, 2016.

L. Breiman, Bagging predictors, Machine learning, vol.24, issue.2, 1996.

L. Breiman, Random forests, Machine learning, vol.45, issue.1, 2001.

[. Carcillo, Scarff; a scalable framework for streaming credit card fraud detection with spark, 2018.

[. Chandola, Anomaly detection for discrete sequences: A survey, IEEE Transactions on Knowledge and Data Engineering, 2012.

N. V. Chawla-;-chawla, Data mining for imbalance datasets: an overview. Data mining and knowledge discovery handbook, 2005.

[. Chawla, Smote: Synthetic minority over-sampling technique, Journal of Artificial Intelligence Research, vol.16, 2002.

[. Chawla, Editorial: special issue on learning from imbalanced data sets, ACM SIGKDD Explorations Newsletter, 2004.

[. Chawla, Smoteboost: Improving prediction of the minority class in boosting, 2003.

. De-fortuny, Corporate residence fraud detection, Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, 2014.

[. Delamaire, Credit card fraud and detection techniques: a review. Banks and Bank systems, International Journal of Soft Computing and Engineering, 2009.

T. G. Dietterich, Machine learning for sequential data: A review, Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR), 2002.

[. Dorj, A bayesian hidden markov model-based approach for anomaly detection in electronic systems, IEEE Aerospace conference, 2013.

[. Drummond, C. Holte-;-drummond, and R. Holte, C4.5, class imbalance, and cost sensitivity: why undersampling beats over-sampling, Workshop on Learning from Imbalanced Datasets II, 2003.

C. Elkan-;-elkan, The foundations of cost-sensitive learning, ternational Joint Conference on Artificial Intelligence, 2001.

[. Ergen, Unsupervised and semi-supervised anomaly detection with lstm neural networks, 2017.

. Estabrooks, A multiple resampling method for learning from imbalanced data sets, Computational Intelligence, 2004.

, Situation report: payment card fraud in the european union. perspective of law enforcement agencies, 2012.

R. Fabry, equensworldline fraud risk management. Credit Card Fraud Detection Workshop, 2019.

[. Fan, Adacost: misclassification cost-sensitive boosting, 1999.

P. Fawcett, T. Fawcett, and F. Provost, Adaptive fraud detection. Data mining and knowledge discovery, 1997.

[. Forrest, Detecting intrusions using system calls: Alternate data models, Proceedings of the 1999 IEEE ISRSP, 1999.

G. Fossi, L. Fossi, and G. Gianini, Managing a pool of rules for credit card fraud detection by a game theory based approach, Future Generations Computer Systems, 2019.

[. Fu, Credit card fraud detection using convolutional neural networks. International Conference on Neural Information Processing, 2016.

[. Gao, A general framework for mining concept-drifting data streams with skewed distributions, Proceedings of the 2007 SIAM International Conference, 2007.

R. ;. Ghosh, S. Ghosh, and D. L. Reilly, Credit card fraud detection with a neural-network, IEEE Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences, 1994.

[. Gomes, Adaptive random forest for evolving data stream classification, Machine Learning, vol.106, issue.9, 2017.

[. Gornitz, Hidden markov anomaly detection, 32nd International Conference on Machine Learning, 2015.

A. Graves-;-graves, Supervised sequence labelling with recurrent neural networks, vol.385, 2012.

J. Graves, A. Graves, and N. Jaitly, Towards end-to-end speech recognition with recurrent neural networks, Proceedings of the 31st International Conference on Machine Learning, 2014.

[. Gretton, Covariate shift by kernel mean matching, Dataset shift in machine learning, vol.3, issue.4, 2009.

[. Guo, C. Berkhahn-;-guo, and F. Berkhahn, Entity embeddings of categorical variables, 2016.

[. Han, Borderlinesmote: a new oversampling method in imbalanced data sets learning, Advances in intelligent computing, 2005.

[. He, Adasyn: Adaptive synthetic sampling approach for imbalanced learning, IEEE International Joint Conference on Neural Networks, 2008.

J. Hoare-;-hoare, What is a decision tree? www.displayr.com/whatis-a-decision-tree, 2019.

S. Hochreiter and J. Schmidhuber, Long short-term memory, Neural computation, vol.9, 1997.

. Hoens, Heuristic updatable weighted random subspaces for non-stationary environments, IEEE 11th International Conference on Data Mining, 2011.

[. Hofmeyr, Intrusion detection using sequences of system calls, Journal of Computer Security, 1998.

[. Hyunsoo, Missing value estimation for dna microarray gene expression data: local least squares imputation, Bioinfomatics, vol.21, issue.2, 2005.

S. ;. Japkowicz, N. Japkowicz, and S. Stephen, The class imbalance problem: a systematic study, 2002.

[. Jha, Employing transaction aggregatin strategy to detect credit card fraud, Expert Systems with Applications, 2012.

[. Jurgovsky, Sequence classification for credit-card fraud detection, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01699528

[. Karax, Decision tree-based feature ranking in concept drifting data streams, 34th ACM/SIGAPP Symposium on Applied Computing (SAC2019), 2019.

. Kelly, The impact of changing populations on classifier performance, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining ACM, 1999.

C. Kingsford and S. L. Salzberg, What are decision trees?, Nat Biotechnol, vol.26, issue.9, 2008.

R. Klinkenberg, Learning drifting concepts: Example selection vs. example weighting, Intelligent Data Analysis, vol.8, 2003.

[. Kolter, J. Z. Maloof-;-kolter, and M. A. Maloof, Dynamic weighted majority: A new ensemble method for tracking concept drift, Third IEEE International conference on data mining, 2003.

S. Kotsiantis, Supervised machine learning: A review of classification techniques, 2007.

M. Kubat and S. Matwin, Addressing the curse of imbalanced training sets: one-sided selection, 1997.

K. Kukar, M. Kukar, and I. Kononenko, Costsensitive learning with neural networks, 1998.

[. Lafferty, Conditional random fields: Probabilistic models for segmenting and labeling sequence data, Proceedings of the 18th International Conference on Machine Learning, 2001.

N. Laleh and M. A. Azgomi, A taxonomy of frauds and fraud detection techniques. International Conference on Information Systems, Technology and Management, 2009.

T. Brodley-;-lane and C. E. Brodley, Approaches to online learing and concept drift for user identification in computer security, 1998.

R. N. Lichtenwalter and N. V. Chawla, Adaptive methods for classification in arbitrarily imbalanced and drifting data streams, Pacific-Asia Conference on Knowledge Discovery and Data Mining, 2009.

[. Liu, Exploratory undersampling for class-imbalance learning, Systems, Man, and Cybernetics, part B: Cybernetics, 2009.

[. Lucas, Multiple perspectives hmm-based feature engineering for credit card fraud detection, 34th ACM/SIGAPP Symposium on Applied Computing (SAC2019), 2019.
URL : https://hal.archives-ouvertes.fr/hal-02178012

[. Lucas, Towards automated feature engineering for credit card fraud detection using multi-perspective hmms, Future Generations Computer Systems Special, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02278223

[. Lucas, Dataset shift quantification for credit card fraud detection, Artificial intelligence and knowledge engineering, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02178042

[. Malhotra, Long short term memory networks for anomaly detection in time series, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, 2015.

[. Mani, I. Zhang-;-mani, and I. Zhang, knn approach to unbalanced data distributions: a case study involving information extraction, Proceedings of Workshop on Learning from Imbalanced Datasets, 2003.

[. Mccallum, Maximum entropy markov models for information extraction and segmentation, 17th International Conference on Machine Learning, 2000.

[. Mikolov, Empirical evaluation and combination of advanced language modeling techniques, 2011.

. Moreno-torres, A unifying view on dataset shift in classification, 2012.

. Moreno-torres, Repairing fractures between data using genetic programming-based feature extraction: A case study in cancer diagnosis, 2013.

F. F. Noghani and M. H. Moattar, Ensemble classification and extended feature selection for credit card fraud detection, Journal of AI and Data Mining, 2015.

[. Oba, A bayesian missing value estimation method for gene expression profile data, Bioinfomatics, vol.19, issue.16, 2003.

E. Pastor and E. Baralis, Explaining black box models by means of local rules, 34th ACM/SIGAPP Symposium on Applied Computing (SAC2019), 2019.

[. Patidar, Credit card fraud detection using neural network, International Journal of Soft Computing and Engineering, 2011.

[. Pedregosa, Scikit-learn: Machine learning in Python, Journal of Machine Learning Research, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00650905

D. Power-;-power, Evaluation: from precision, recall and f-measure to roc, informedness, markedness and correlation, Journal of machine learning technology, issue.1, 2011.

A. D. Pozzolo, Adaptive machine learning for credit card fraud detection, 2015.

[. Pozzolo, Credit card fraud detection: a realistic modeling and a novel learning strategy, 2017.

[. Pozzolo, Learned lessons in credit card fraud detection from a practitioner perspective, 2014.

R. J. Quinlan-;-quinlan, Bagging, boosting and c4.5. AAAI/IAAI, vol.1, 1996.

L. R. Rabiner and B. H. Juang, Hidden markov models for speech recognition, 1991.

[. Rashid, A new take on detecting insider threats: exploring the use of hidden markov models, ACM CCS International Workshop on Managing Insider Security Threats (MIST), 2016.

R. , A. Robinson, W. N. Aria, and A. , Sequential fraud detection for prepaid cards using hidden markov model divergence. Expert Systems with Applications, 2018.

[. Rumelhart, Learning representations by back-propagating errors. Cognitive modeling, 1986.

[. Russac, Embeddings of categorical variables for sequential data in fraud context, International Conference on Advanced Machine Learning Technologies and Applications, 2018.

L. S. Shapley, A value for n-person games, Annals of Mathematics Study, 1953.

H. Shimodaira, Improving predictive inference under covariate shift by weighting the log-likelihood function, Journal of statistical planning and inference, vol.90, issue.2, 2000.

H. M. Shirazi and N. Vasconcelos, Risk minimization, probability elicitation and cost-sensitive svms, 2010.

[. Srivastava, Credit card fraud detection using hidden markov model, IEEE Transactions on dependable and secure computing, 2008.

D. J. Stekhoven and P. Buhlmann, , 2011.

, Missforest-non-parametric missing value imputation for mixed-type data, Bioinfomatics, vol.28, issue.1

A. Storkey, When training and test sets are different: characterizing learning transfer. Dataset shift in machine learning, 2009.

[. Sugiyama, , 2007.

, Covariate shift adaptation by importance weighted cross validation, Journal of Machine Learning Research, vol.8

. Sun, , 2007.

, Cost-sensitive boosting for classification of imbalanced data. Pattern Recognition

[. Troyanskaya, Missing value estimation methods for dna microarrays, Bioinfomatics, vol.17, issue.6, 2001.

S. Visa and A. Ralescu, Issues in mining imbalanced data sets: a review paper, Proceedings of the sixteen midwest artificial intelligence and cognitive science conference, 2005.

A. J. Viterbi, Error bounds for convolutional codes and an asymptotically optimum decoding algorithm, IEEE Transactions Information Theory, p.13, 1967.

[. Vlasselaer, Apate: A novel approach for automated credit card transactions fraud detection using network-based extensions, 2015.

[. Wang, Mining changes of classification by correspondance tracing, Proceedings of the 2003 SIAM International COnference on Data Mining, 2003.

[. Wang, Diversity exploration and negative correlation learning on imbalanced data sets, International Joint Conference on Neural Networks, 2009.

T. Webb, G. I. Webb, and K. M. Ting, On the application of roc analysis to predict classification performance under varying class distributions, Machine Learning, vol.58, 2005.

P. Weiss, G. M. Weiss, and F. Provost, The effect of class distribution on classifier learning: an empirical study, 2001.

[. Whitrow, Transaction aggregatin as a strategy for credit card fraud detection, Data Mining and Knowledge Discovery, vol.18, issue.1, 2008.

K. Widmer, G. Widmer, and M. Kubat, Learning in the presence of context drift and hidden contexts, Machine Learning, vol.23, 1996.

C. ;. Wu, G. Wu, and E. Y. Chang, Class-boundary alignment for imbalanced dataset learning, ICML workshop on learning from imbalanced data sets II, 2003.

[. Yan, On predicting rare classes with svm ensembles in scene classification, IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003.

[. Zadrozny, Costsensitive learning by cost-proportionate example weighting, Data Mining ICDM, 2003.

[. Zintgraf, Visualising deep neural network decisions: prediction difference analysis, 2017.