Skip to Main content Skip to Navigation

Etude de la classification dans un trés grand nombre de catégories.

Raphael Puget 1 
1 MLIA - Machine Learning and Information Access
LIP6 - Laboratoire d'Informatique de Paris 6
Abstract : The increase in volume of the data nowadays is at the origin of new problematics for which machine learning does not possess adapted answers. The usual classification task which requires to assign one or more classes to an example is extended to problems with thousands or even millions of different classes. Those problems bring new research fields like the complexity reduction of the classification process. That classification process has a complexity usually linear with the number of classes of the problem, which can be an issue if the number of classes is too large. Various ways to deal with those new problems have emerged like the construction of a hierarchy of classifiers or the adaptation of ECOC ensemble methods. The work presented here describes two new methods to answer this extreme classification task. The first one consists in a new asymmetrical measure to help the partitioning of the classes in order to build a hierarchy of classes. The second one proposes a sequential way to aggregate effectively the most interesting classifiers.
Document type :
Complete list of metadata
Contributor : ABES STAR :  Contact
Submitted on : Thursday, February 16, 2017 - 3:46:06 PM
Last modification on : Sunday, June 26, 2022 - 9:46:03 AM
Long-term archiving on: : Wednesday, May 17, 2017 - 8:53:16 PM


Version validated by the jury (STAR)


  • HAL Id : tel-01469603, version 1


Raphael Puget. Etude de la classification dans un trés grand nombre de catégories.. Base de données [cs.DB]. Université Pierre et Marie Curie - Paris VI, 2016. Français. ⟨NNT : 2016PA066312⟩. ⟨tel-01469603⟩



Record views


Files downloads