Automatic or semi-automatic detection of companies in difficulty or weakened by the crisis - Laboratoire d'informatique de l'X (LIX) Accéder directement au contenu
Mémoire D'étudiant Année : 2021

Automatic or semi-automatic detection of companies in difficulty or weakened by the crisis

Résumé

In this report, we will attempt to improve a failure prediction model that is currently used by the French Ministry of Economy and Finance. First, we studied several models and benchmarked them in order to compare our results with those of the articles studied. As a result, we were able to select four models that stood out from the rest, and that we should improve as much as possible. Secondly, we decided to look at the data itself. We realized that our dataset was static. That is, for each row in our table, we had data only for a time T. We therefore decided to add variables, which we will call temporal features, in order to take temporality into account in the model. This addition was more than conclusive, because it allowed us to obtain excellent results that had not been achieved until then. Afterwards, we will proceed with this new dataset. In order to further improve our results, we have started to search by sector of activity. We separated our dataset into several datasets, the separation being done on the sector of activity of the companies. In doing so, we realized that if we applied different models for each business sector, we would get much better results. Depending on the operational needs, we conclude that this is an area to consider seriously. Finally, we decided to look at the importance of features in our models. To do this, we looked at the importance of the variables in the classifiers, and we realized that only a fraction of the input variables were actually useful, and among those, our temporal features that we added. It would therefore be appropriate to reduce the number of input variables, and to go even further in temporalizing the model on the small remaining dataset.
Fichier principal
Vignette du fichier
Article_SF (2).pdf (970.99 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03523010 , version 1 (12-01-2022)

Licence

Domaine public

Identifiants

  • HAL Id : hal-03523010 , version 1

Citer

Thomas Meunier. Automatic or semi-automatic detection of companies in difficulty or weakened by the crisis. Artificial Intelligence [cs.AI]. 2021. ⟨hal-03523010⟩
71 Consultations
41 Téléchargements

Partager

Gmail Facebook X LinkedIn More