PERFORMANCE OF A CLASSIFIER

DEFINITIONS

in a classification problem where class is a binary attribute the follow schema can be produced in order to study data

	POS-PRED	NEG-PRED
POS-TRUE	$TP$	$F P_{a - b}$
NEG-TRUE	$F P_{b - a}$	$TN$

where:

$TP$ true positives
$TN$ true negatives
$FP$ false positives
$FN$ false negatives

these measurements can be used to calculate some interesting performance metrics such as

SUCCESS RATE (ACCURACY)

accuracy of the classifier

$$
\frac{TP + TN}{Ntest}
$$

ERROR RATE

$1 - a cc u r a cy$
PRECISION

rate of true positives among positive classifications
$\frac{TP}{TP + FP}$
RECALL

rate of positives that the classifier can catch (sensitivity)
$\frac{TP}{TP + FN}$
SPECIFICITY

rate of negatives that the classifier can catch
$\frac{TN}{TN + FP}$
F1 SCORE

armonic mean of precision and recall
$2 * \frac{p rec i s i o n * rec a ll}{p rec i s i o n + rec a ll}$

accuracy gives an inital idea of the performance but can be misleading when classes are unbalanced

f1 score is insteresting because is higher when precision and recall are balanced

if the cost of positive and negative errors are different than precision and recall should be considered

MULTI CLASS CASE

in a problem with non binary class attribute the previous table can be extended, it’s called confusion matrix

	a	b	c	Total
a	$T P_{a}$	$F P_{a - b}$	$F P_{c - a}$	$T_{a}$
b	$F P_{b - a}$	$T P_{b}$	$F P_{c - b}$	$T_{b}$
c	$F P_{c - a}$	$F P_{b - c}$	$T P_{c}$	$T_{c}$
Total	$P_{a}$	$P_{b}$	$P_{c}$	N

$T_{x}$ true number of $x$ labels in the dataset
$P_{x}$ total number of predictions of class $x$
$T P_{x}$ true positives for class $x$
$F P_{i - j}$ false positives for class $i$ predicted as $j$
ACCURACY
$\frac{\sum _{i} TP i}{N}$
PRECISION $i$
$\frac{TP i}{P i}$
RECALL $i$
$\frac{TP i}{T i}$

these measures can be global:

f (C) = \frac{\sum f ( c i )}{C}

these measures can be weighted:

f (C) = \frac{\sum f ( c i ) * C i}{C}

PREVIOUS NEXT

Explorer

PERFORMANCE OF A CLASSIFIER

DEFINITIONS

SUCCESS RATE (ACCURACY)

ERROR RATE

PRECISION

RECALL

SPECIFICITY

F1 SCORE

MULTI CLASS CASE

ACCURACY

PRECISION $i$

RECALL $i$

Graph View

Table of Contents

Backlinks

Explorer

PERFORMANCE OF A CLASSIFIER

DEFINITIONS

SUCCESS RATE (ACCURACY)

ERROR RATE

PRECISION

RECALL

SPECIFICITY

F1 SCORE

MULTI CLASS CASE

ACCURACY

PRECISION i

RECALL i

Graph View

Table of Contents

Backlinks

PRECISION $i$

RECALL $i$