Consensus Modeling for HTS Assays Using In silico Descriptors Calculates the Best Balanced Accuracy in Tox21 Challenge, Frontiers in Environmental Science, vol.4, issue.2, 2016. ,
Improving the prediction of organism-level toxicity through integration of chemical, protein target and cytotoxicity qHTS data, Toxicology Research, vol.5, issue.3, pp.883-894, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01907207
Introduction to machine learning, 2009. ,
,
A map of human genome variation from population-scale sequencing, Nature, vol.467, pp.1061-1073, 2010. ,
In Silico Prediction of Chemical-Induced Hepatocellular Hypertrophy Using Molecular Descriptors, Toxicological Sciences, vol.162, issue.2, pp.667-675, 2018. ,
Toxicity Testing in the 21st Century: Bringing the Vision to Life, Toxicological Sciences, vol.107, issue.2, pp.324-330, 2009. ,
The Vision of Toxicity Testing in the 21st Century: Moving from Discussion to Action, Toxicological Sciences, vol.117, issue.1, pp.17-24, 2010. ,
,
Adverse outcome pathways: A conceptual framework to support ecotoxicology research and risk assessment, Environmental Toxicology and Chemistry, vol.29, issue.3, pp.730-741, 2010. ,
Building a developmental toxicity ontology, Birth Defects Research, vol.110, pp.502-518, 2018. ,
Computational methods for prediction of in vitro effects of new chemical structures, Journal of Cheminformatics, vol.8, issue.1, p.51, 2016. ,
, Machine Learning Methods in Computational Toxicology, pp.119-139
An analysis of four missing data treatment methods for supervised learning, Applied Artificial Intelligence, vol.17, issue.5-6, pp.519-533, 2003. ,
How well can carcinogenicity be predicted by high throughput "characteristics of carcinogens" mechanistic data?, Regulatory Toxicology and Pharmacology, vol.90, pp.185-196, 2017. ,
,
In vitro to in vivo extrapolation for high throughput prioritization and decision making, Toxicology in Vitro, vol.47, pp.213-227, 2018. ,
The CAESAR project for in silico models for the REACH legislation, Chemistry Central journal, 4, 2010. ,
Predictive Models for Carcinogenicity and Mutagenicity: Frameworks, Stateof-the-Art, and Perspectives, Journal of Environmental Science and Health, vol.27, issue.2, pp.57-90, 2009. ,
VEGA-QSAR: AI inside a platform for predictive toxicology, CEUR Workshop Proceedings, vol.1107, p.2013 ,
Structure-Activity Relationship Studies of Chemical Mutagens and Carcinogens: Mechanistic Investigations and Prediction Approaches, Chemical Reviews, vol.105, issue.5, pp.1767-1800, 2005. ,
Endocrine Disruptors: Data-based survey of in vivo tests, predictive models and the Adverse Outcome Pathway, Regulatory Toxicology and Pharmacology, vol.86, pp.18-24, 2017. ,
Evaluation of the applicability of existing (Q)SAR models for predicting the genotoxicity of pesticides and similarity 160 ,
, analysis related with genotoxicity of pesticides for facilitating of grouping and read across, vol.16, p.1598, 2019.
ChEMBL: a large-scale bioactivity database for drug discovery, Nucleic Acids Research, vol.40, issue.D1, pp.1100-1107, 2011. ,
KNIME: The Konstanz Information Miner, Studies in Classification, Data Analysis, and Knowledge Organization, 2007. ,
RepDose and FeDTex: Two databases focusing on systemic toxicity: First examples from analyses of repeated dose toxicity and reprotoxicity studies, Toxicology Letters, vol.180, pp.202-210, 2008. ,
REP-DOSE: A database on repeated dose toxicity studies of commercial chemicals-A multifunctional tool, Regulatory Toxicology and Pharmacology, vol.46, issue.3, pp.202-210, 2006. ,
A Guide to Recurrent Neural Networks and Backpropagation, The Dallas project, 2002. ,
Network-based Approaches in Pharmacology, Molecular Informatics, vol.36, issue.10, p.1700048, 2017. ,
IPCS framework for analyzing the relevance of a noncancer mode of action for humans, Critical reviews in toxicology, vol.38, issue.2, pp.87-96, 2008. ,
, The Human Toxome Project. Alternatives to animal experimentation, vol.32, pp.112-124, 2015.
A comparison of ranking methods for classification algorithm selection, European Conference on Machine Learning, pp.63-75, 2000. ,
Bagging predictors, Machine Learning, vol.24, pp.123-140, 1996. ,
Random Forests, Machine Learning, vol.45, pp.5-32, 2001. ,
Algorithm 457: finding all cliques of an undirected graph, Communications of the ACM, vol.16, issue.9, pp.575-577, 1973. ,
Screening Chemicals for Estrogen Receptor Bioactivity Using a Computational Model, Environmental Science & Technology, vol.49, issue.14, pp.8804-8814, 2015. ,
Toxicity Testing in the 21st Century: A View from the Chemical Industry, Toxicological Sciences, vol.112, issue.2, pp.297-302, 2009. ,
Assessing compound carcinogenicity in vitro using connectivity mapping, Carcinogenesis, vol.35, issue.1, pp.201-207, 2014. ,
QSAR Modeling of Tox21 Challenge Stress Response and Nuclear Receptor Signaling Toxicity Assays, Frontiers in Environmental Science, vol.4, issue.43, pp.3389-3392, 2016. ,
Molecular fingerprint similarity search in virtual screening, Methods, vol.71, pp.58-63, 2015. ,
Evaluation of 309 Environmental Chemicals Using a Mouse Embryonic Stem Cell Adherent Cell Differentiation and Cytotoxicity Assay, PLoS ONE, vol.6, issue.6, p.18540, 2011. ,
LIBSVM: a library for support vector machines, ACM Transactions on Intelligent Systems and Technology, vol.2, issue.3, p.27, 2011. ,
Application of Reverse Dosimetry to, Compare In Vitro and In Vivo Estrogen Receptor Activity. Applied In Vitro Toxicology, vol.1, issue.1, pp.33-44, 2015. ,
Semi-supervised learning, IEEE Transactions on Neural Networks, vol.20, issue.3, pp.542-542, 2009. ,
SMOTE: Synthetic Minority Over-sampling Technique, Journal of Artificial Intelligence Research, vol.16, pp.321-357, 2002. ,
Special issue on learning from imbalanced data sets, ACM Sigkdd Explorations Newsletter, vol.6, issue.1, pp.1-6, 2004. ,
An optimal convex hull algorithm in any fixed dimension. Discrete & Computational Geometry, vol.10, pp.377-409, 1993. ,
Decision threshold adjustment in class prediction, SAR and QSAR in environmental research, vol.17, pp.337-52, 2006. ,
Xgboost: A scalable tree boosting system, Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp.785-794, 2016. ,
Computational models to predict endocrine-disrupting chemical binding with androgen or oestrogen receptors, Ecotoxicology and Environmental Safety, vol.110, pp.280-287, 2014. ,
, QSAR Modeling: Where Have You Been? Where Are You Going To, vol.57, pp.4977-5010, 2014.
, , 2015.
The standard for the exchange of nonclinical data (SEND): Challenges and promises, Toxicologic Pathology, vol.46, issue.8, pp.1006-1012, 2018. ,
Advancing Computational Toxicology in the Big Data Era by Artificial Intelligence: Data-Driven and Mechanism-Driven Modeling for Chemical Toxicity, Chemical Research in Toxicology, 2019. ,
Chemical carcinogenicity revisited 3: Risk assessment of carcinogenic potential based on the current state of knowledge of carcinogenesis in humans, Regulatory Toxicology and Pharmacology, vol.103, pp.100-105, 2019. ,
Scientific opinion on the hazard assessment of endocrine disruptors: scientific criteria for identification of endocrine disruptors and appropriateness of existing test methods for assessing effects mediated by these substances on human health and the environment, EFSA Journal, vol.11, issue.3, p.3132, 2013. ,
, Risk Assessment in the Federal Government: Managing the Process, 1983.
, Toxicity Testing in the 21st Century: A Vision and a Strategy, 2007.
, National Research Council et al. Science and judgment in risk assessment, 1994.
Chapter 1 an introduction to chemical grouping, categories and read-across to predict toxicity, Chemical Toxicity Prediction: Category Formation and Read-Across, pp.1-29, 2013. ,
Highlight report: Launch of a large integrated European in vitro toxicology project: EU-ToxRisk, Archives of Toxicology, vol.90, issue.5, pp.1021-1024, 2016. ,
,
The mahalanobis distance. Chemometrics and intelligent laboratory systems, vol.50, pp.1-18, 2000. ,
How not to develop a quantitative structure-activity or structure-property relationship (QSAR/QSPR), SAR and QSAR in Environmental Research, vol.20, issue.3-4, pp.241-266, 2009. ,
Thiazopyr and Thyroid Disruption: Case Study Within the Context of the 2006 IPCS Human Relevance Framework for Analysis of a Cancer Mode of Action, Critical Reviews in Toxicology, vol.36, issue.10, pp.793-801, 2006. ,
Machine learning algorithms: a review, International Journal of Computer Science and Information Technologies, vol.7, issue.3, pp.1174-1179, 2016. ,
,
Endocrine-Disrupting Chemicals: An Endocrine Society Scientific Statement, Endocrine Reviews, vol.30, issue.4, pp.293-342, 2009. ,
Ensemble methods in machine learning, Multiple Classifier Systems, pp.1-15, 2000. ,
QSAR Toolbox -workflow and major functionalities, SAR and QSAR in Environmental Research, vol.27, issue.3, pp.203-219, 2016. ,
The EDKB: an established knowledge base for endocrine disrupting chemicals, BMC Bioinformatics, vol.11, issue.6, p.5, 2010. ,
The ToxCast Program for Prioritizing Toxicity Testing of Environmental Chemicals, Toxicological Sciences, vol.95, issue.1, pp.5-12, 2007. ,
,
Chemical carcinogenicity revisited 2: Current knowledge of carcinogenesis shows that categorization as a carcinogen or non-carcinogen is not scientifically credible, Regulatory Toxicology and Pharmacology, vol.103, pp.124-129, 2019. ,
Applied regression analysis, vol.326, 2014. ,
Molecular similarity-based predictions of the Tox21 screening outcome, Frontiers in Environmental Science, vol.3, p.54, 2015. ,
In Silico Prediction of Chemicals Binding to Aromatase with, Machine Learning Methods. Chemical Research in Toxicology, vol.30, pp.1209-1218, 2017. ,
, Computational Methods in Developing Quantitative Structure-Activity Relationships (QSAR): A Review. Combinatorial Chemistry & High Throughput Screening, vol.9, pp.213-228, 2006.
Reoptimization of MDL Keys for Use in Drug Discovery, Journal of Chemical Information and Computer Sciences, vol.42, issue.6, pp.1273-1280, 2002. ,
, Guidance for the identification of endocrine disruptors in the context of, European Chemical Agency (ECHA) and European Food Safety Authority (EFSA) with the technical support of the Joint Research Centre (JRC), p.16
Prediction of human population responses to toxic compounds by a collaborative competition, Nature Biotechnology, vol.33, issue.9, pp.933-940, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01246684
, Guidance on the establishment of the residue definition for dietary risk assessment, EFSA Journal, vol.14, 2016.
, A Computational Systems Biology Software Platform for Multiscale Modeling and Simulation: Integrating Whole-Body Physiology, Disease Biology, and Molecular Reaction Networks, vol.2, 2011.
The Foundations of Cost-Sensitive Learning, Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, pp.973-978, 2001. ,
Chemical Category Formation and Read-Across for the Prediction of Toxicity, pp.209-219, 2010. ,
, Guidance for the setting of an acute reference dose (ARfD), 2001.
, concerning the Registration, Evaluation, Authorization, and Restriction of Chemicals (REACH), establishing a European Chemicals Agency, amending Directive 1999/45/EC and repealing Council Regulation (EEC) No 793/93 and Commission Regulation (EC) No 1488/94 as well as Council Directive 76/769/EEC and Commission Directives 91/155/EC, European Commission. Regulation, 1907.
, on classification, labelling and packaging of substances and mixtures, amending and repealing Directives 67/548/EEC and 1999/45/EC, and amending Regulation (EC) No, vol.16, 1907.
, Regulation (EC) No 1107/2009 of the European Parliament and of the Council of 21 October 2009 concerning the placing of plant protection products on the market and repealing Council Directives, European Commission
In vitro-to-in vivo extrapolation (IVIVE) by PBTK modeling for animal-free risk assessment approaches of potential endocrine-disrupting compounds, Archives of Toxicology, vol.93, issue.2, pp.401-416, 2019. ,
Adaptive fraud detection, Data Mining Knowledge Discovery, vol.1, issue.3, pp.291-316, 1997. ,
Global or local QSAR: Is there a way out?, QSAR & Combinatorial Science, vol.28, pp.850-855, 2009. ,
The endocrine disruptor screening program developed by the us environmental protection agency, Ecotoxicology, vol.9, issue.1-2, pp.85-91, 2000. ,
Development and evaluation of a genomic signature for the prediction and mechanistic assessment of nongenotoxic hepatocarcinogens in the rat, Toxicological Sciences, 2011. ,
, The ToxCast Pipeline for High-Throughput Screening Data, vol.33, pp.618-320, 2016.
CPDB: Carcinogenic Potency Database, Medical Reference Services Quarterly, vol.27, issue.3, pp.303-311, 2008. ,
, United Nations Economic Commission for European Secretariat. Globally Harmonized System of Classification and Labelling of Chemicals (GHS), 2009.
Trust, but verify: On the importance of chemical structure curation in cheminformatics and QSAR modeling research, vol.50, pp.1189-1204, 2010. ,
A comparison of the performance of threshold criteria for binary classification in terms of predicted prevalence and kappa, Ecological Modelling, vol.217, issue.1-2, pp.48-58, 2008. ,
Greedy function approximation: a gradient boosting machine, Annals of statistics, pp.1189-1232, 2001. ,
QSAR Modeling of ToxCast Assays Relevant to the Molecular Initiating Events of AOPs Leading to Hepatic Steatosis, Journal of Chemical Information and Modeling, vol.58, issue.8, pp.1501-1517, 2018. ,
URL : https://hal.archives-ouvertes.fr/ineris-02006100
Review of Software Tools for Toxicity Prediction, European Commision JRC, 2010. ,
Stability of the random neural network model, Neural Computation, vol.2, issue.2, pp.239-247, 1990. ,
Learning in the recurrent random neural network, Neural Computation, vol.5, issue.1, pp.154-164, 1993. ,
Random neural networks with synchronized interactions, Neural Computation, vol.20, pp.2308-2324, 2008. ,
Bias vs Variance Decomposition for Regression and Classification, pp.733-746, 2010. ,
Prediction of Hydrophobic (Lipophilic) Properties of Small Organic Molecules Using Fragmental Methods: An Analysis of ALOGP and CLOGP Methods, The Journal of Physical Chemistry A, vol.102, issue.21, pp.3762-3772, 1998. ,
The SEURAT-1 approach towards animal free human safety assessment. Alternatives to animal experimentation, vol.32, pp.9-24, 2015. ,
Feature Selection Methods in QSAR Studies, Journal of AOAC International, vol.95, issue.3, pp.636-651, 2012. ,
Chemical in vitro bioactivity profiles are not informative about the long-term in vivo endocrine mediated toxicity, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02540249
Stacked Generalization with Applicability Domain Outperforms Simple QSAR on in Vitro Toxicological Data, Journal of Chemical Information and Modeling, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02051775
G-Networks to Predict the Outcome of Sensing of Toxicity, Sensors, vol.18, issue.10, p.3483, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-02051779
Machine Learning to Predict Toxicity of Compounds, The 27th International Conference on Artificial Neural Networks (ICANN), pp.335-345, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-02051852
The Comparative Toxicogenomics Database: update 2019, Nucleic Acids Research, vol.47, issue.D1, pp.948-954, 2018. ,
Combining machine learning models of in vitro and in vivo bioassays improves rat carcinogenicity prediction, Regulatory Toxicology and Pharmacology, vol.94, pp.8-15, 2018. ,
An Introduction to Variable and Feature Selection, Journal of Machine Learning Research, vol.3, pp.1157-1182, 2003. ,
An Empirical Overview of the No Free Lunch Theorem and Its Effect on Real-World Machine Learning Classification, Neural Computation, vol.28, issue.1, pp.216-228, 2016. ,
Stacked ensemble models for improved prediction accuracy, SAS Global Forum Proceedings, 2017. ,
Learning from Imbalanced Data, IEEE Transactions on Knowledge and Data Engineering, vol.21, issue.9, pp.1263-1284, 2009. ,
Electrotopological State Indices for Atom Types: A Novel Combination of Electronic, Topological, and Valence State Information, Journal of Chemical Information and Computer Sciences, vol.35, issue.6, pp.1039-1045, 1995. ,
Borderline-SMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning, Advances in Intelligent Computing, pp.878-887 ,
Developing and validating predictive decision tree models from mining chemical structural fingerprints and high-throughput screening data in PubChem, BMC Bioinformatics, vol.9, issue.1, p.401, 2008. ,
A review of the global pesticide legislation and the scale of challenge in reaching the global harmonization of food safety standards, Integrated Environmental Assessment and Management, vol.11, issue.4, pp.525-536, 2015. ,
Quantitative structure-activity relationships and the unnamed science, Accounts of Chemical Research, vol.26, issue.4, pp.147-153, 1993. ,
Correlation of Biological Activity of Phenoxyacetic Acids with Hammett Substituent Constants and Partition Coefficients, Nature, vol.194, pp.178-180, 1962. ,
, Toxicology Ontology Perspectives. Alternatives to animal experimentation, pp.139-156, 2012.
Open PHACTS: A Semantic Knowledge Infrastructure for Public and Commercial Drug Discovery Research, Knowledge Engineering and Knowledge Management, pp.1-7, 2012. ,
ADASYN: Adaptive synthetic sampling approach for imbalanced learning, IEEE International Joint Conference on Neural Networks, pp.1322-1328, 2008. ,
Evaluation of Quantitative Structure Activity Relationship Modeling Strategies: Local and Global Models, Journal of Chemical Information and Modeling, vol.50, issue.4, pp.677-689, 2010. ,
InChI -the worldwide chemical structure identifier standard, Journal of Cheminformatics, vol.5, issue.1, p.7, 2013. ,
A survey of outlier detection methodologies, Artificial Intelligence Review, vol.22, issue.2, pp.85-126, 2004. ,
A review on evaluation metrics for data classification evaluations, International Journal of Data Mining & Knowledge Management Process, vol.5, issue.2, p.1, 2015. ,
Advances in high-throughput screening technology for toxicology, International Journal of Risk Assessment and Management, vol.20, issue.1/2/3, pp.109-135, 2017. ,
Tox21 challenge to build predictive models of nuclear receptor and stress response pathways as mediated by exposure to environmental toxicants and drugs, Frontiers in Environmental Science, vol.5, p.85, 2017. ,
,
Modelling the Tox21 10K chemical profiles for in vivo toxicity prediction and mechanism characterization, Nature communications, 2016. ,
Advancing Exposure Characterization for Chemical Evaluation and Risk Assessment, Journal of Toxicology and Environmental Health, vol.13, issue.2-4, pp.299-313, 2010. ,
Open TG-GATEs: A large-scale toxicogenomics database, Nucleic acids research, vol.43, pp.921-927, 2014. ,
, Global Assessment of the State-of-Science of Endocrine Disruptors. Geneva: World Health Organization, IPCS, 2002.
IPCS risk assessment terminology. Geneva: World Health Organization, 2004. ,
The Simcyp Population-based ADME Simulator, Expert Opinion on Drug Metabolism & Toxicology, vol.5, issue.2, pp.211-223, 2009. ,
Summary of a Workshop on Regulatory Acceptance of (Q)SARs for Human Health and Environmental Endpoints. Environmental health perspectives, vol.111, pp.1358-1360, 2003. ,
QSAR applicabilty domain estimation by projection of the training set descriptor space: a review. Alternatives to laboratory animals, vol.33, pp.445-459, 2005. ,
A review of feature selection methods with applications, 38th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), pp.1200-1205, 2015. ,
, Vitro Screening of Environmental Chemicals for Targeted Testing Prioritization: The ToxCast Project, vol.118, pp.485-492, 2010.
ACToR -Aggregated Computational Toxicology Resource, Toxicology and Applied Pharmacology, vol.233, issue.1, pp.7-13, 2008. ,
,
Perspectives on validation of high-throughput assays supporting 21st century toxicity testing. Alternatives to animal experimentation, vol.30, pp.51-57, 2013. ,
Integrated Model of Chemical Perturbations of a Biological Pathway Using 18 In Vitro High-Throughput Screening Assays for the Estrogen Receptor, Toxicological sciences : an official journal of the Society of Toxicology, vol.148, issue.1, pp.137-54, 2015. ,
A review on endocrine disruptors and their possible impacts on human health, Environmental Toxicology and Pharmacology, vol.40, issue.1, pp.241-258, 2015. ,
Reducibility among Combinatorial Problems, pp.85-103, 1972. ,
International Harmonization of Nomenclature and Diagnostic Criteria (IN-HAND) progress to date and future plans, Journal of Toxicologic Pathology, vol.28, issue.1, pp.51-53, 2015. ,
Mechanism Profiling of Hepatotoxicity Caused by Oxidative Stress Using the Antioxidant Response Element Reporter Gene Assay Models and Big Data, Environmental Health Perspectives, vol.124, pp.634-641, 2015. ,
PubChem structureactivity relationship (SAR) clusters, Journal of Cheminformatics, 2015. ,
Identifying Attributes That Influence In Vitro-to-In Vivo Concordance by Comparing In Vitro Tox21 Bioactivity Versus In Vivo DrugMatrix Transcriptomic Responses Across 130 Chemicals, Toxicological Sciences, vol.167, issue.1, pp.157-171, 2018. ,
Development and Validation of a Computational Model for Androgen Receptor Activity, Chemical Research in Toxicology, vol.30, issue.4, pp.946-964, 2017. ,
,
In Vitro Perturbations of Targets in Cancer Hallmark Processes Predict Rodent Chemical Carcinogenesis, Toxicological Sciences, vol.131, issue.1, pp.40-55, 2013. ,
Predictive models for acute oral systemic toxicity: A workshop to bridge the gap from research to regulation, Computational Toxicology, vol.8, pp.21-24, 2018. ,
Bayes' theorem, Bayesian Inference with Geodetic Applications, pp.4-8 ,
, , 1990.
The self-organizing map, Proceedings of the IEEE, vol.78, issue.9, pp.1464-1480, 1990. ,
Supervised Machine Learning: A Review of Classification Techniques, Proceedings of the 2007 Conference on Emerging Artificial Intelligence Applications in Computer Engineering, pp.3-24, 2007. ,
Toxicity Testing in the 21st Century: A Vision and A Strategy, Journal of toxicology and environmental health. Part B, Critical reviews, vol.13, pp.51-138, 2010. ,
Machine learning for the detection of oil spills in satellite radar images. Machine learning, vol.30, pp.195-215, 1998. ,
Applied Concepts in PBPK Modeling: How to Build a PBPK/PD Model, CPT: Pharmacometrics & Systems Pharmacology, vol.5, issue.10, pp.516-531 ,
The caret Package, 2009. ,
QSAR Modelling of Rat Acute Toxicity on the Basis of PASS Prediction, Molecular Informatics, vol.30, issue.2-3, pp.241-250, 2011. ,
,
The Connectivity Map: Using Gene-Expression Signatures to, Genes, and Disease. Science, vol.313, issue.5795, pp.1929-1935, 2006. ,
CEBS: A comprehensive annotated database of toxicological data, Nucleic acids research, vol.45, pp.964-971, 2016. ,
Deep learning, Nature, vol.521, issue.7553, p.436, 2015. ,
Gradient-based learning applied to document recognition, Proceedings of the IEEE, pp.2278-2324, 1998. ,
Adverse outcome pathways: opportunities, limitations and open questions, Archives of Toxicology, vol.91, issue.11, pp.3477-3505, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01968849
Multilayer feedforward networks with a nonpolynomial activation function can approximate any function, Neural networks, vol.6, issue.6, pp.861-867, 1993. ,
Using discriminant analysis for multi-class classification: an experimental investigation, Knowledge and Information Systems, vol.10, issue.4, pp.453-472, 2006. ,
In Silico Prediction of Chemical Acute Oral Toxicity Using Multi-Classification Methods, Journal of Chemical Information and Modeling, vol.54, issue.4, pp.1061-1069, 2014. ,
Navigating chemical space for biology and medicine, Nature, vol.432, issue.7019, pp.855-861, 2004. ,
Predicting Hepatotoxicity Using ToxCast in Vitro Bioactivity and Chemical Structure, Chemical Research Toxicology, vol.28, issue.4, pp.738-751, 2015. ,
Predicting Organ Toxicity Using in Vitro Bioactivity Data and Chemical Structure, Chemical Research Toxicology, vol.30, issue.11, pp.2046-2059, 2017. ,
Classification and regression tree methods, Wiley StatsRef: Statistics Reference Online, 2008. ,
Use of Physiologically Based Kinetic Modeling-Based Reverse Dosimetry to Predict in Vivo Toxicity from in Vitro Data, Chemical Research in Toxicology, vol.30, issue.1, pp.114-125, 2017. ,
Integrative Approaches for Predicting In Vivo Effects of Chemicals from their Structural Descriptors and the Results of Short-Term Biological Assays, Current Topics in Medicinal Chemistry, vol.14, issue.11, pp.1356-1364, 2014. ,
Predicting drug-induced hepatotoxicity using QSAR and toxicogenomics approaches, Chemical Research in Toxicology, 2011. ,
Penalized feature selection and classification in bioinformatics, Briefings in bioinformatics, vol.9, pp.392-403, 2008. ,
Data Imbalance and Classifiers: Impact and Solutions from a Big Data Perspective, IJCIR, vol.13, issue.9, pp.2267-2281, 2017. ,
Recherche des sous-matrices premières d'une matrice à coefficients binaires. applications à certains problèmes de graphe, Proceedings of the Deuxième Congrès de l'AFCALTI, pp.231-242, 1962. ,
, Silico Tools for Sharing Data and Knowledge on Toxicity and Metabolism: Derek for Windows, Meteor, and Vitic. Toxicology mechanisms and methods, vol.18, pp.177-87, 2008.
Profile-QSAR: A Novel meta-QSAR Method that Combines Activities across the Kinase Family To Accurately Predict Affinity, Selectivity, and Cellular Activity, Journal of Chemical Information and Modeling, vol.51, issue.8, pp.1942-1956, 2011. ,
Profile-QSAR 2.0: Kinase Virtual Screening Accuracy Comparable to Four-Concentration IC50s for Realistically Novel Compounds, Journal of Chemical Information and Modeling, vol.57, issue.8, pp.2077-2088, 2017. ,
,
Predictive model of rat reproductive toxicity from ToxCast high throughput screening, Biology of reproduction, vol.85, issue.2, pp.327-366, 2011. ,
The Comparative Toxicogenomics Database (CTD), vol.111, pp.793-795, 2003. ,
DeepTox: Toxicity Prediction using Deep Learning, Frontiers in Environmental Science, vol.3, issue.80, 2016. ,
Training neural network classifiers for medical decision making: The effects of imbalanced datasets on classification performance, Neural Networks, vol.21, issue.2-3, pp.427-436, 2008. ,
Pragmatic Challenges for the Vision of Toxicity Testing in the 21st Century in a Regulatory Context: Another Ames Test, Toxicological Sciences, vol.108, issue.1, pp.19-21, 2009. ,
Mode of action human relevance (species concordance) framework: Evolution of the Bradford Hill considerations and comparative analysis of weight of evidence, Journal of applied toxicology, vol.34, pp.595-606, 2014. ,
A comparison of random forest and its gini importance with standard chemometric methods for the feature selection and classification of spectral data, BMC Bioinformatics, vol.10, issue.1, p.213, 2009. ,
Support Vector Machines. The Interface to libsvm in package e1071, 2001. ,
URL : https://hal.archives-ouvertes.fr/hal-00555258
Machine learning methods in chemoinformatics, Wiley Interdisciplinary Reviews: Computational Molecular Science, vol.4, issue.5, pp.468-481, 2014. ,
Current status of methods for defining the applicability domain of (quantitative) structure-activity relationships, ATLA, vol.33, pp.155-173, 2005. ,
Development and Validation of Decision Forest Model for Estrogen Receptor Binding Prediction of Chemicals Using Large Data Sets, Chemical Research in Toxicology, vol.28, issue.12, pp.2343-2351, 2015. ,
What is a Support Vector Machine?, Nature biotechnology, vol.24, pp.1565-1572, 2007. ,
Conformal prediction classification of a large data set of environmental chemicals from toxcast and tox21 estrogen receptor assays, Chemical research in toxicology, vol.29, issue.6, pp.1003-1010, 2016. ,
Towards a Universal SMILES representation -A standard method to generate canonical SMILES based on the InChI, Journal of cheminformatics, vol.4, p.22, 2012. ,
Pybel: a Python wrapper for the Open-Babel cheminformatics toolkit, Chemistry Central Journal, vol.2, issue.1, 2008. ,
, Conceptual Framework for Testing and Assessment of Endocrine Disrupters, Official Journal of the European Union, 2002.
/10/EC of the European Parliament and of the Council of 11 February 2004. The OECD Principles of Good Laboratory Practice (GLP), Official Journal of the European Union, 2004. ,
, OECD. Guidance Document on the Validation of (Quantitative) Structure-Activity Relationship [(Q)SAR] Models, p.154, 2014.
, Guidance Document for the use of Adverse Outcome Pathways in developing Integrated Approaches to Testing and Assessment (IATA), Series on Testing and Assessment, vol.260, 2017.
An overview of clustering methods, vol.11, pp.583-605, 2007. ,
Logistic regression model for prediction of roof fall risks in bord and pillar workings in coal mines: An approach, Safety Science -SAF SCI, vol.47, pp.88-96, 2009. ,
Toxicological screening, Journal of Pharmacology & Pharmacotherapeutics, vol.2, pp.74-79, 2011. ,
An evaluation of the implementation of the Cramer classification scheme in the Toxtree software, SAR and QSAR in environmental research, vol.19, pp.495-524, 2008. ,
, Normalization: A preprocessing stage, 2015.
Scikit-learn: Machine learning in Python, Journal of Machine Learning Research, vol.12, pp.2825-2830, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00650905
The maximum edge biclique problem is NP-complete, Discrete Applied Mathematics, vol.131, issue.3, pp.651-654, 2003. ,
How Many Principal Components? Stopping Rules for Determining the Number of Non-Trivial Axes Revisited, Computational Statistics and Data Analysis, vol.49, pp.974-997, 2005. ,
Endocrine Disruptors Leading to Obesity and Related Diseases, International Journal of Environmental Research and Public Health, vol.14, issue.10, p.1282, 2017. ,
Challenges in using the ToxRefDB as a resource for toxicity prediction modeling, Regulatory Toxicology and Pharmacology, vol.72, issue.3, pp.610-614, 2015. ,
An ensemble model of QSAR tools for regulatory risk assessment, Journal of Cheminformatics, vol.8, issue.1, p.48, 2016. ,
R: A Language and Environment for Statistical Computing, 2013. ,
In silico toxicology: computational methods for the prediction of chemical toxicity, Wiley Interdisciplinary Reviews: Computational Molecular Science, vol.6, pp.147-172, 2016. ,
, Silico toxicology -non-testing methods, vol.2, p.33, 2011.
Profiling Chemicals Based on Chronic Toxicity Results from the U.S. EPA ToxRef Database, Environmental Health Perspectives, vol.117, issue.3, pp.392-399, 2008. ,
Predictive Modeling of Estrogen Receptor Binding Agents Using Advanced Cheminformatics Tools and Massive Public Data, Frontiers in Environmental Science, vol.4, p.12, 2016. ,
DSSTox web site launch: Improving public access to databases for building structure-toxicity prediction models, vol.2, pp.103-108, 2004. ,
ToxCast Chemical Landscape: Paving the Road to 21st Century Toxicology, Chemical Research in Toxicology, vol.29, issue.8, pp.1225-1251, 2016. ,
Toxicity Data Informatics: Supporting a New Paradigm for Toxicity Prediction, Toxicology Mechanisms and Methods, vol.18, issue.2-3, pp.103-118, 2008. ,
Using Information from Historical High-Throughput Screens to Predict Active Compounds, Journal of Chemical Information and Modeling, vol.54, issue.7, pp.1880-1891, 2014. ,
Extended-Connectivity Fingerprints, Journal of Chemical Information and Modeling, vol.50, issue.5, pp.742-754, 2010. ,
Chemical substructures that enrich for biological activity, Bioinformatics, vol.24, issue.21, pp.2518-2525, 2008. ,
Contribution of New Technologies to Characterisation and Prediction of Adverse Effects, Critical Reviews in Toxicology, vol.45, pp.1-12, 2015. ,
Thyroid tumor formation in the male mouse induced by fluopyram is mediated by activation of hepatic CAR/PXR nuclear receptors, Regulatory Toxicology and Pharmacology, vol.70, issue.3, pp.673-680, 2014. ,
Re-evaluation of animal numbers and costs for in vivo tests to accomplish REACH legislation requirements for chemicals -a report by the transatlantic think tank for toxicology. Alternatives to animal experimentation, vol.26, pp.187-208, 2009. ,
The principles of humane experimental technique, vol.238, 1959. ,
Predictive Modeling of Chemical Hazard by Integrating Numerical Descriptors of Chemical Structures and Shortterm Toxicity Assay Data, Toxicological Sciences, vol.127, issue.1, pp.1-9, 2012. ,
A user's guide for accessing and interpreting toxcast data ,
y-Randomization and Its Variants in ,
, Journal of Chemical Information and Modeling, vol.47, issue.6, pp.2345-2357, 2007.
Hazard Evaluation Support System (HESS) for predicting repeated dose toxicity using toxicological categories, journal = SAR and QSAR in Environmental Research, vol.24, issue.5, pp.351-363, 2013. ,
Drug Solubility: Importance and Enhancement Techniques, ISRN Pharmaceutics, vol.2012, pp.1-10, 2012. ,
Learning with kernels: support vector machines, regularization, optimization, and beyond, 2001. ,
, Endocrine Disruptors: Past Lessons and Future Directions, vol.30, pp.833-847, 2016.
Selection of data sets for qsars: Analyses of tetrahymena toxicity from aromatic compounds, SAR and QSAR in Environmental Research, vol.14, issue.1, pp.59-81, 2003. ,
Use of in Vitro HTS-Derived Concentration-Response Data as Biological Descriptors Improves the Accuracy of QSAR Models of in Vivo Toxicity, Environmental Health Perspectives, vol.119, issue.3, pp.364-370, 2011. ,
Using Nuclear Receptor Activity to Stratify Hepatocarcinogens, PLoS ONE, vol.6, issue.2, p.14584, 2011. ,
Normalization as a Preprocessing Engine for Data Mining and the Approach of Preference Matrix, 2006 International Conference on Dependability of Computer Systems, pp.207-214, 2006. ,
EADB: An estrogenic activity database for assessing potential endocrine activity, Toxicological Sciences, pp.277-291, 2013. ,
Global Quantitative Structure-Activity Relationship Models vs Selected Local Models as Predictors of Off-Target Activities for Project Compounds, Journal of Chemical Information and Modeling, vol.54, issue.4, pp.1083-1092, 2014. ,
Similarity to molecules in the training set is a good discriminator for prediction accuracy in QSAR, J. Chem. Inf. Comput. Sci, vol.44, issue.6, pp.1912-1928, 2004. ,
Density estimation for statistics and data analysis. Routledge, 2018. ,
, , p.331
, Enzymatic and Receptor Signaling Assays, Chemical Research in Toxicology, vol.26, issue.6, pp.878-895, 2013.
Predictive Models of Prenatal Developmental Toxicity from ToxCast High-Throughput Screening Data, Toxicological Sciences, vol.124, issue.1, pp.109-127, 2011. ,
Application of connectivity mapping in predictive toxicology based on gene-expression similarity, Toxicology, vol.268, pp.143-146, 2009. ,
,
, IPCS Conceptual Framework for Evaluating a Mode of Action for Chemical Carcinogenesis, Regulatory Toxicology and Pharmacology, vol.34, issue.2, pp.146-152, 2001.
The eTOX Consortium: To Improve the Safety Assessment of New Drug Candidates, Pharmazeutische Medizin, vol.1, pp.3-13, 2017. ,
In silico prediction of in vivo toxicity -the first steps of the eTox consortium, Toxicology Letters, vol.196, pp.250-251, 2010. ,
Conditional Variable Importance for Random Forests, BMC Bioinformatics, vol.9, issue.1, p.307, 2008. ,
,
A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles, Cell, vol.171, issue.6, pp.1437-1452, 2017. ,
, Silico Prediction of Endocrine Disrupting Chemicals Using Single-Label and Multilabel Models, vol.59, pp.973-982, 2019.
Introduction to reinforcement learning, vol.135, 1998. ,
Modelling compound cytotoxicity using conformal prediction and PubChem HTS data, Toxicology Research, vol.6, issue.1, pp.73-80, 2017. ,
Random Forest: A Classification and Regression Tool for Compound Classification and QSAR Modeling, Journal of Chemical Information and Computer Sciences, vol.43, issue.6, pp.1947-1958, 2003. ,
Human environmental disease network: A computational model to assess toxicology of contaminants. Alternatives to animal experimentation, vol.34, pp.289-300, 2017. ,
Feature selection for classification: A review, Data Classification: Algorithms and Applications, vol.01, pp.37-64, 2014. ,
OpenTox predictive toxicology framework: Toxicological ontology and semantic media wiki-based OpenToxipedia, Journal of biomedical semantics, vol.3, issue.1, p.7, 2012. ,
ICH M7, chapter 24, pp.667-699, 2017. ,
Estimation of Aqueous Solubility of Chemical Compounds Using E-State Indices, Journal for Chemical Information and Computer Scientists, vol.41, issue.6, pp.1488-1493, 2001. ,
The US Federal Tox21 Program: A Strategic and Operational Plan for Continued Leadership. Alternatives to animal experimentation, vol.35, pp.163-168, 2018. ,
A Comprehensive Statistical Analysis of Predicting In Vivo Hazard Using High-Throughput In Vitro Screening, Toxicological Sciences, vol.128, issue.2, pp.398-417, 2012. ,
Improving the Human Hazard Characterization of Chemicals: A Tox21 Update, Environmental Health Perspectives, vol.121, issue.7, pp.756-765, 2013. ,
Chance correlations in structure-activity studies using multiple regression analysis, Journal of Medicinal Chemistry, vol.15, issue.10, pp.1066-1068, 1972. ,
Best Practices for QSAR Model Development, Validation, and Exploitation, Molecular Informatics, vol.29, issue.6-7, pp.476-488, 2010. ,
The Japanese toxicogenomics project: Application of toxicogenomics, Molecular Nutrition & Food Research, vol.54, issue.2, pp.218-227 ,
Dimensionality reduction: a comparative, Journal of Machine Learning Research, vol.10, pp.66-71, 2009. ,
,
Hormones and Endocrine-Disrupting Chemicals: Low-Dose Effects and Nonmonotonic Dose Responses, Endocrine Reviews, vol.33, issue.3, pp.378-455, 2012. ,
Modern Applied Statistics with S, 2002. ,
Adverse Outcome Pathway (AOP) Development I: Strategies and Principles, Toxicological Sciences, vol.142, issue.2, pp.312-320, 2014. ,
PubChem BioAssay: 2017 update, Nucleic Acids Research, vol.45, issue.D1, pp.955-963, 2017. ,
PubChem: a public information system for analyzing bioactivities of small molecules, Nucleic Acids Research, pp.623-633, 2009. ,
Representation of chemical structures, Wiley Interdisciplinary Reviews: Computational Molecular Science, vol.1, pp.557-579, 2011. ,
CEBS Chemical Effects in Biological Systems: a public data repository integrating study design and toxicity data with microarray and proteomics data, Nucleic Acids Research, vol.36, pp.892-900, 2007. ,
ToxRefDB 2.0: Improvements in Capturing Qualitative and Quantitative Data from in vivo Toxicity Studies, 2017. ,
SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules, Journal of Chemical Information and Computer Sciences, vol.28, issue.1, pp.31-36, 1988. ,
Predictive Multitask Deep Neural Network Models for ADME-Tox Properties: Learning from Large Data Sets, Journal of Chemical Information and Modeling, vol.59, issue.3, pp.1253-1268, 2019. ,
Adverse Outcome Pathways Knowledge Base (AOP-KB), Toxicology Letters, vol.238, p.309, 2015. ,
Principal component analysis. Chemometrics and Intelligent Laboratory Systems, vol.2, pp.37-52, 1987. ,
Chemical carcinogenicity revisited 1: A unified theory of carcinogenicity based on contemporary knowledge, Regulatory Toxicology and Pharmacology, vol.103, pp.86-92, 2019. ,
Stacked generalization, Neural Networks, vol.5, issue.2, pp.241-259, 1992. ,
The Lack of A Priori Distinctions Between Learning Algorithms, Neural Computation, vol.8, issue.7, pp.1341-1390, 1996. ,
The Role of QSAR Methodology in the Regulatory Assessment of Chemicals, pp.367-382, 2010. ,
Integrating Drug's Mode of Action into Quantitative Structure-Activity Relationships for Improved Prediction of Drug-Induced Liver Injury, Journal of Chemical Information and Modeling, vol.57, issue.4, pp.1000-1006, 2017. ,
Monte Carlo Cross Validation. Chemometrics and Intelligent Laboratory Systems, vol.56, pp.1-11, 2001. ,
Development of a COSMOS DB to support in silico modelling for cosmetics ingredients and related chemicals. The Toxicologist -A Supplement to Toxicological Sciences, pp.132-185, 2013. ,
, A Review of Ensemble Methods in Bioinformatics. Current Bioinformatics, vol.5, issue.4, pp.296-308, 2010.
PaDEL-descriptor: An open source software to calculate molecular descriptors and fingerprints, Journal of Computational Chemistry, vol.32, issue.7, pp.1466-1474, 2011. ,
A Literature Survey on Association Rule Mining Algorithms, Southeast Europe Journal of Soft Computing, vol.5, p.1859, 2016. ,
Cluster-based Under-sampling Approaches for Imbalanced Data Distributions, Expert Systems with Applications, vol.36, pp.5718-5727, 2006. ,
Single-cell based random neural network for deep learning, 2017 International Joint Conference on Neural Networks (IJCNN), pp.86-93, 2017. ,
Advance and prospects of AdaBoost algorithm, Acta Automatica Sinica, vol.39, issue.6, pp.745-758, 2013. ,
Binary Classification of a Large Collection of Environmental Chemicals from Estrogen Receptor Assays by Quantitative Structure-Activity Relationship and Machine Learning Methods, Journal of Chemical Information and Modeling, vol.53, issue.12, pp.3244-3261, 2013. ,
In silico Prediction of Drug Induced Liver Toxicity Using Substructure Pattern Recognition Method, Molecular Informatics, vol.35, issue.3-4, pp.136-144, 2016. ,
Data Preparation for Data Mining, Applied Artificial Intelligence, vol.17, pp.375-381, 2003. ,
On finding bicliques in bipartite graphs: a novel algorithm and its application to the integration of diverse biological data types, BMC Bioinformatics, vol.15, issue.1, p.110, 2014. ,
Cross-validation based weights and structure determination of Chebyshev-polynomial neural networks for pattern classification, Pattern Recognition, vol.47, issue.10, pp.3414-3428, 2014. ,
A Novel Two-Step Hierarchical Quantitative Structure-Activity Relationship Modeling Work Flow for Predicting Acute Toxicity of Chemicals in Rodents, Environmental health perspectives, vol.117, pp.1257-1264, 2009. ,
Big Data in Chemical Toxicity Research: The Use of High-Throughput Screening Assays To Identify Potential Toxicants, Chemical Research in Toxicology, vol.27, issue.10, pp.1643-1651, 2014. ,
Hybrid in silico models for drug-induced liver injury using chemical descriptors and in vitro cell-imaging information, Journal of applied toxicology, vol.34, pp.281-288, 2014. ,
KNN approach to unbalanced data distributions: a case study involving information extraction, Proceedings of the International Conference on Machine Learning, vol.126, 2003. ,