Multiclass classification

Main page

What are your thoughts?

Be the first to start a discussion here.

Recent from talks

Be the first to start a discussion here.

Recent from talks

Be the first to start a discussion here.

Multiclass classification

Community hub0 subscribers

Talks overview Knowledge Base overview

About hubStatsRules

Wikipedia

Grokipedia

In machine learning and statistical classification, multiclass classification or multinomial classification is the problem of classifying instances into one of three or more classes (classifying instances into one of two classes is called binary classification). For example, deciding on whether an image is showing a banana, peach, orange, or an apple is a multiclass classification problem, with four possible classes (banana, peach, orange, apple), while deciding on whether an image contains an apple or not is a binary classification problem (with the two possible classes being: apple, no apple).

While many classification algorithms (notably multinomial logistic regression) naturally permit the use of more than two classes, some are by nature binary algorithms; these can, however, be turned into multinomial classifiers by a variety of strategies.

Multiclass classification should not be confused with multi-label classification, where multiple labels are to be predicted for each instance (e.g., predicting that an image contains both an apple and an orange, in the previous example).

From the confusion matrix of a multiclass model, we can determine whether a model does better than chance. Let $K\geq 3$ be the number of classes, ${\mathcal {O}}$ a set of observations, ${\hat {y}}:{\mathcal {O}}\to \{1,...,K\}$ a model of the target variable $y:{\mathcal {O}}\to \{1,...,K\}$ and $n_{i,j}$ be the number of observations in the set $\{y=i\}\cap \{{\hat {y}}=j\}$ . We note $n_{i.}=\sum _{j}n_{i,j}$ , $n_{.j}=\sum _{i}n_{i,j}$ , $n=\sum _{j}n_{.j}=\sum _{i}n_{i.}$ , $\lambda _{i}={\frac {n_{i.}}{n}}$ and $\mu _{j}={\frac {n_{.j}}{n}}$ . It is assumed that the confusion matrix $(n_{i,j})_{i,j}$ contains at least one non-zero entry in each row, that is $\lambda _{i}>0$ for any $i$ . Finally we call "normalized confusion matrix" the matrix of conditional probabilities $(\mathbb {P} ({\hat {y}}=j\mid y=i))_{i,j}=\left({\frac {n_{i,j}}{n_{i.}}}\right)_{i,j}$ .

The lift is a way of measuring the deviation from independence of two events $A$ and $B$ :

$\mathrm {Lift} (A,B)={\frac {\mathbb {P} (A\cap B)}{\mathbb {P} (A)\mathbb {P} (B)}}={\frac {\mathbb {P} (A\mid B)}{\mathbb {P} (A)}}={\frac {\mathbb {P} (B\mid A)}{\mathbb {P} (B)}}$

We have $\mathrm {Lift} (A,B)>1$ if and only if events $A$ and $B$ occur simultaneously with a greater probability than if they were independent. In other words, if one of the two events occurs, the probability of observing the other event increases.

A first condition to satisfy is to have $\mathrm {Lift} (y=i,{\hat {y}}=i)\geq 1$ for any $i$ . And the quality of a model (better or worse than chance) does not change if we over- or undersample the dataset, that is if we multiply each row $R_{i}$ of the confusion matrix by a constant $c_{i}$ . Thus the second condition is that the necessary and sufficient conditions for doing better than chance need only depend on the normalized confusion matrix.

See all

Hub AI

Multiclass classification AI simulator

(@Multiclass classification_simulator)

Wikipedia

Grokipedia

Hub AI

Multiclass classification

The lift is a way of measuring the deviation from independence of two events $A$ and $B$ :

$\mathrm {Lift} (A,B)={\frac {\mathbb {P} (A\cap B)}{\mathbb {P} (A)\mathbb {P} (B)}}={\frac {\mathbb {P} (A\mid B)}{\mathbb {P} (A)}}={\frac {\mathbb {P} (B\mid A)}{\mathbb {P} (B)}}$

See all

Talk Channels

Knowledge Base

Special Pages

Talk Channels

Knowledge Base

Special Pages

Multiclass classification

Multiclass classification

Recent from talks

Recent from talks

Knowledge base stats:

Talk channels stats:

Members stats:

Multiclass classification

Hub AI

Multiclass classification

Contribute something to knowledge base

History

History

Multiclass classification

Multiclass classification

Recent from talks

Recent from talks

Knowledge base stats:

Talk channels stats:

Members stats:

Multiclass classification

Hub AI

Multiclass classification

Contribute something to knowledge base