False Positive Rate


The False Positive Rate (FPR) is a performance index that states how many negative cases are wrongly classified as positive. Formally:

fFPR({yi,y^i}i)=i=1NI(yi=neg or yiy^i)i=1NI(yi=neg)f_\text{FPR}(\{ y^{i},\hat{y}^{i} \}_{i})=\frac{\sum_{i=1}^{N}\mathrm{I}(y^{i}=\text{neg or }y^{i}\neq \hat{y}^{i})}{\sum_{i=1}^{N} \mathrm{I}(y^{i}=\text{neg})}

FPR is defined between 0 and 1 (a percentage). If there are no positives, it is undefined (divides by zero). FPR is best when its lowest. It's commonly provided alongside the False Negative Rate.

A more convenient notation is

FPR=FPN\text{FPR}=\frac{\text{FP}}{\text{N}}

where FP\text{FP} is the number of false positives and N\text{N} is the number of negatives. This assumes the presence of a real and predicted set {yi,y^i}\{ y^{i},\hat{y}^{i} \} even if not written. Counting false positives needs both yiy^{i} and y^i\hat{y}^{i}, whereas counting negatives only need yiy^{i}.

A sibling quantity is the True Positive Rate, which is the count of correctly classified positives. It is the complement of FPR:

TPR=1FPR\text{TPR}=1-\text{FPR}

Like FPR, TPR is defined on [0,1][0,1]. You want TPR to be as close to 1 as possible. TPR is also called sensitivity, because it's measures how much a classifier is sensitive to errors and capable of discerning between a true positive and false positive. This term is more common in diagnostic tests.