Minimax Classifier with Box Constraint on the Priors - Université Nice Sophia Antipolis Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2019

Minimax Classifier with Box Constraint on the Priors

Résumé

Learning a classifier in safety-critical applications like medicine raises several issues. Firstly, the class proportions, also called priors, are in general imbalanced or uncertain. Sometimes, experts are able to provide some bounds on the priors and taking into account this knowledge can improve the predictions. Secondly, it is also necessary to consider any arbitrary loss function given by experts to evaluate the classification decision. Finally, the dataset may contain both categorical and numeric features. In this paper, we propose a box-constrained minimax classifier which addresses all the mentioned issues. To deal with both categorical and numeric features, many works have shown that discretizing the numeric attributes can lead to interesting results. Here, we thus consider that numeric features are discretized. In order to address the class proportions issues, we compute the priors which maximize the empirical Bayes risk over a box-constrained probabilistic simplex. This constraint is defined as the intersection between the simplex and a box constraint provided by experts, which aims at bounding independently each class proportions. Our approach allows to find a compromise between the empirical Bayes classifier and the standard minimax classifier, which may appear too pessimistic. The standard minimax classifier, which has not been studied yet when considerring discrete features, is still accessible by our approach. When considering only discrete features, we show that, for any arbitrary loss function, the empirical Bayes risk, considered as a function of the priors, is a concave non-differentiable multivariate piecewise affine function. To compute the box-constrained least favorable priors, we derive a projected subgradient algorithm. The convergence of our algorithm is established. The performance of our algorithm is illustrated with experiments on the Framingham study database to predict the risk of Coronary Heart Disease (CHD).
Fichier principal
Vignette du fichier
NeurIPS19_HAL_version.pdf (2.54 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02296592 , version 1 (25-09-2019)
hal-02296592 , version 2 (01-04-2020)
hal-02296592 , version 3 (02-03-2021)

Identifiants

  • HAL Id : hal-02296592 , version 1

Citer

Cyprien Gilet, Susana Barbosa, Lionel Fillatre. Minimax Classifier with Box Constraint on the Priors. 2019. ⟨hal-02296592v1⟩
540 Consultations
215 Téléchargements

Partager

Gmail Facebook X LinkedIn More