Thesis of Aimene Belfodil
Subject:
Defense date: 30/09/2019
Advisor: Céline Robardet
Coadvisor: Mehdi Kaytoue
Summary:
As the title of this dissertation may suggest, the aim of this thesis is to provide an order-theoretic point of view on the task of subgroup discovery. Subgroup discovery is the automatic task of discovering interesting hypotheses in databases. That is, given a database, the hypothesis space the analyst wants to explore and a formal way of how the analyst gauges the quality of the hypotheses (e.g. a quality measure); the automated task of subgroup discovery aims to extract the interesting hypothesis w.r.t. these parameters. In order to elaborate fast and efficient algorithms for subgroup discovery, one should understand the underlying properties of the hypothesis space on the one hand and the properties of its quality measure on the other. In this thesis, we extend the state-of-the-art by: (i) providing a unified view of the hypotheses space behind subgroup discovery using the well-founded mathematical tool of order theory, (ii) proposing the new hypothesis space of conjunction of linear inequalities in numerical databases and the algorithms enumerating its elements and (iii) proposing an anytime algorithm for discriminative subgroup discovery on numerical datasets providing guarantees upon interruption.
Jury:
M. Crémilleux Bruno | Professeur(e) | Université de Caen | Rapporteur(e) |
M. Ganter Bernhard | Professeur(e) | Technische Universitaet Dresden | Rapporteur(e) |
Mme. Robardet Céline | Professeur(e) | INSA Lyon | Directeur(trice) de thèse |
M. Kaytoue Mehdi | Maître de conférence | Infologic | Co-directeur (trice) |
MMe. Cellier Peggy | Maître de conférence | INSA Rennes | Examinateur(trice) |
M. Couceiro Miguel | Professeur(e) | Université de Lorraine | Président(e) |
M. Siebes Arno | Professeur(e) | Universiteit Utrecht | Examinateur(trice) |
M. Kuznetsov Sergei O. | Professeur(e) | Higher School of Economics (Moscow) | Invité(e) |
M. Zarka Julien | Mobile Devices Ingenierie | Invité(e) |