3 -Medical tests values considered and discretized according to reference ranges. Examples are given between square bracket. Category Medical test (unit) Blood cells lymphocite percentage (%) ,
,
,
,
,
,
,
,
, Hemoglobin mean corpuscular hemoglobin concentration (%) [ ; < 30%; 30 ? 35%
,
, Coagulation sedimentation rates (mm)
HDL cholesterol (mmol/l) ,
, > 2] Chemistry albuminemia (µmol/l)
, > 80], chloremia (mmol/l)
, Biology serum glutamo-oxaloacetate transferase (IU/l)
«Fast and scalable neural embedding models for biomedical sentence classification, BMC bioinformatics, vol.19, issue.1, p.12, 2018. ,
, «Inter-coder agreement for computational linguistics», Computational Linguistics, vol.34, pp.555-596, 2008.
«Some unintended consequences of information technology in health care: the nature of patient care information system-related errors», Journal of the American Medical Informatics Association, vol.11, issue.2, pp.104-112, 2004. ,
«Random search for hyper-parameter optimization, Journal of Machine Learning Research, vol.13, p.57, 2012. ,
Uses of electronic health records for public health surveillance to advance public health, Annual review of public health, vol.36, p.7, 2015. ,
, Enriching word vectors with subword information», 2016.
«Translating embeddings for modeling multi-relational data», dans Advances in neural information processing systems, vol.33, pp.2787-2795, 2013. ,
«Clinical practice guidelines and quality of care for older patients with multiple comorbid diseases: implications for pay for performance, Jama, vol.294, issue.6, pp.716-724, 2005. ,
, Machine learning, vol.45, issue.1, p.57, 2001.
, Classbased n-gram models of natural language», Computational linguistics, vol.18, pp.467-479, 1992.
,
«On over-fitting in model selection and subsequent selection bias in performance evaluation, Journal of Machine Learning Research, vol.11, p.57, 2010. ,
«Dexter: an open source framework for entity linking, dans Proceedings of the sixth international workshop on Exploiting semantic annotations in information retrieval, pp.17-20, 2013. ,
Libsvm: a library for support vector machines, ACM transactions on intelligent systems and technology (TIST), vol.2, p.57, 2011. ,
Gram: graph-based attention model for healthcare representation learning, dans Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, vol.32, pp.787-795, 2017. ,
«Estimation of ten-year risk of fatal cardiovascular disease in europe: the score project, European heart journal, vol.24, issue.11, pp.987-1003, 2003. ,
«The kgram abstract machine for knowledge graph querying, dans Web Intelligence and Intelligent Agent Technology (WI-IAT), vol.1, pp.338-341, 2010. ,
Gate: A framework and graphical development environment for robust nlp tools and applications, dans Proc. 40th annual meeting of the association for computational linguistics, vol.35, pp.168-175, 2002. ,
«Improving efficiency and accuracy in multilingual entity extraction, dans Proceedings of the 9th International Conference on Semantic Systems (I-Semantics), vol.34, p.37, 2013. ,
Basic ict adoption and use by general practitioners: an analysis of primary care systems in 31 european countries, BMC medical informatics and decision making, vol.15, issue.1, p.70, 2015. ,
«Statistical comparisons of classifiers over multiple data sets, Journal of Machine learning research, vol.7, pp.1-30, 2006. ,
, «Bert: Pre-training of deep bidirectional transformers for language understanding», vol.11, p.29, 2018.
«General cardiovascular risk profile for use in primary care, Circulation, vol.117, issue.6, pp.743-753, 2008. ,
, Multifit: Efficient multi-lingual language model fine-tuning», 2019.
«A novel data-driven workflow combining literature and electronic health records to estimate comorbidities burden for a specific disease: a case study on autoimmune comorbidities in patients with celiac disease, BMC medical informatics and decision making, vol.17, issue.1, pp.140-149, 2017. ,
«Electronic health records and evidence-based practice: Solving the little-data problem, dans Proceedings of the International Symposium on Human Factors and Ergonomics in Health Care, vol.7, p.74, 2018. ,
Description and retrieval of medical visual information based on language modelling, p.10, 2014. ,
«Apples-to-apples in cross-validation studies: pitfalls in classifier performance measurement, ACM SIGKDD Explorations Newsletter, vol.12, issue.1, p.60, 2010. ,
«A machine learning approach for identifying disease-treatment relations in short texts, IEEE transactions on knowledge and data engineering, vol.23, issue.6, pp.801-814, 2011. ,
«Évaluation des améliorations de prédiction d'hospitalisation par l'ajout de connaissances métier aux dossiers médicaux, 2019. ,
, Revue des Nouvelles Technologies de l'Information (RNTI), vol.35
, Injection of Automatically Selected DBpedia Subjects in Electronic Medical Records to boost Hospitalization Prediction», dans SAC2020 -The 35th ACM/SIGAPP Symposium On Applied Computing, 2020.
«Designing the Interaction with a prediction system to prevent hospitalization», dans RJCIA 2019 -Rencontres des Jeunes Chercheurs en Intelligence Artificielle PFIA, pp.54-58, 2019. ,
«Injecting domain knowledge in electronic medical records to improve hospitalization prediction, dans The 16th Extended Semantic Web Conference (ESWC 2019), vol.11503, pp.116-130, 2019. ,
2017, «Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review, Journal of the American Medical Informatics Association, vol.24, issue.1, pp.198-208 ,
A generative entity-mention model for linking entities with knowledge base, dans Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol.1, pp.945-954, 2011. ,
, Distributional structure, vol.10, p.13, 1954.
«Modeling heterogeneous clinical sequence data in semantic space for adverse drug event detection, dans Data VII APPENDIX A. APPENDIX Science and Advanced Analytics (DSAA), pp.1-8, 2015. ,
Adding value to the electronic health record through secondary use of data for quality assurance, research, and surveillance, Clin Pharmacol Ther, vol.81, issue.2, pp.126-134, 2007. ,
«Can electronic medical record systems transform health care? potential health benefits, savings, and costs, Health affairs, vol.24, issue.5, pp.1103-1117, 2005. ,
Universal language model fine-tuning for text classification, 2018. ,
, Eaglet-a named entity recognition and entity linking gold standard checking tool», dans European Semantic Web Conference, vol.34, pp.149-154, 2017.
«Predicting the risk of heart failure with ehr sequential data modeling, IEEE Access, vol.6, pp.9256-9261, 2018. ,
«Derivation and external validation of a simple risk tool to predict 30-day hospital readmissions after transcatheter aortic valve replacement.», EuroIntervention: journal of EuroPCR in collaboration with the Working Group on Interventional Cardiology of the, European Society of Cardiology, vol.15, issue.2, p.75, 2019. ,
Estimating the reliability, systematic error and random error of interval data, Educational and Psychological Measurement, vol.30, issue.1, pp.61-70, 1970. ,
Utilisation des enregistrements médicaux électroniques, exemple dútilisation dans le cadre du projet PRIMEGE PACA ; quels sont les principaux motifs de recours, diagnostics et prescriptions en soins primaires., thèse de doctorat, vol.47, 2016. ,
, Creation of the first french database in primary care using the icpc2: Feasibility study.», Studies in health technology and informatics, vol.245, pp.462-466, 2017.
,
, Distributed representations of sentences and documents, pp.1188-1196, 2014.
Biobert: a pre-trained biomedical language representation model for biomedical text mining, Bioinformatics, vol.36, issue.4, pp.1234-1240, 2020. ,
,
Feature engineering and selection for rheumatoid arthritis disease activity classification using electronic medical records, dans ICML Workshop on Machine Learning for Clinical Data Analysis, p.16, 2012. ,
«Deep ehr: Chronic disease prediction using medical notes, 2018. ,
Learning the joint representation of heterogeneous temporal events for clinical endpoint prediction, 2018. ,
, Nltk: the natural language toolkit, 2002.
«The use of routinely collected computer data for research in primary care: opportunities and challenges, Family practice, vol.23, issue.2, pp.253-263, 2005. ,
«Part-of-speech tagging using decision trees, dans European Conference on Machine Learning, pp.25-36, 1998. ,
, Generalized linear models, vol.37, p.57, 1989.
«Computing numeric representations of words in a high-dimensional space, vol.037, p.14, 2015. ,
Predicting activities of daily living for cancer patients using an ontology-guided machine learning methodology», Journal of biomedical semantics, vol.8, issue.1, p.31, 2017. ,
«Mag: A multilingual, knowledge-base agnostic and deterministic entity linking approach», dans Proceedings of the Knowledge Capture Conference, p.35, 2017. ,
«Feasibility of reidentifying individuals in large national physical activity data sets from which protected health information has been removed with use of machine learning, JAMA network open, vol.1, issue.8, 2018. ,
«Inference for the generalization error, Mach. Learn, vol.52, pp.239-281, 2003. ,
«Activity recognition using hybrid generative/discriminative models on home environments using binary sensors, Sensors, vol.13, issue.5, pp.5460-5477, 2013. ,
on lines and planes of closest fit to systems of points in space, Journal of Science, vol.2, issue.11, pp.559-572 ,
Scikit-learn: Machine learning in Python, Journal of Machine Learning Research, vol.12, p.57, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00650905
, «Glove: Global vectors for word representation», dans Proceedings of the 2014 conference on empirical methods in natural language processing, vol.10, p.32, 2014.
, Misspelling oblivious word embeddings», dans Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol.1, pp.3226-3234, 2019.
«Why should i trust you?: Explaining the X APPENDIX A. APPENDIX predictions of any classifier, dans Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, vol.22, pp.1135-1144, 2016. ,
Nerd: A framework for evaluating named entity recognition tools in the web of data, dans 10th International Semantic Web Conference (ISWC'11), vol.34, pp.1-4, 2011. ,
Using ontologies for the online recognition of activities of daily living, Sensors, vol.18, p.32, 2018. ,
«Pre-training of graph augmented transformers for medication recommendation, 2019. ,
«Incorporating temporal ehr data in predictive models for risk stratification of renal function deterioration, Journal of biomedical informatics, vol.53, p.28, 2015. ,
«Practical bayesian optimization of machine learning algorithms», dans Advances in neural information processing systems, vol.25, pp.2951-2959, 2012. ,
, European countries on their journey towards national ehealth infrastructures, 2011.
, An introduction to conditional random fields», Foundations and Trends® in Machine Learning, vol.4, p.27, 2012.
Evaluating word representation features in biomedical named entity recognition tasks, BioMed research international, p.11, 2014. ,
Sifr annotator: ontology-based semantic annotation of french biomedical text and clinical notes, BMC bioinformatics, vol.19, issue.1, p.52, 2018. ,
URL : https://hal.archives-ouvertes.fr/lirmm-01934127
«Regression shrinkage and selection via the lasso», Journal of the Royal Statistical Society: Series B (Methodological), vol.58, issue.1, p.59, 1996. ,
«Potential pitfalls of disease-specific guidelines for patients with multiple conditions, N Engl J Med, vol.351, issue.1, pp.2870-2874, 2004. ,
, Gerbil: general entity annotator benchmarking framework», dans Proceedings of the 24th international conference on World Wide Web, International World Wide Web Conferences Steering Committee, vol.35, pp.1133-1143, 2015.
, Glue: A multi-task benchmark and analysis platform for natural language understanding», 2018.
, Bioportal: enhanced functionality via new web services from the national center for biomedical ontology to access and use ontologies in software applications», Nucleic acids research, vol.39, pp.541-545, 2011.
, Ernie: Enhanced language representation with informative entities», 2019.
, Random erasing data augmentation», 2017.