Scikit Learn Algorithm has incorrect predictions, but is the ROC curve perfect?

Question

Scikit Learn Algorithm has incorrect predictions, but is the ROC curve perfect?

Its my first time using scikit to study metrics, and I want a graphical curve to use this library.

This ROC curve says AUC = 1.00, which, as I know, is incorrect. Here is the code:

from sklearn.metrics import roc_curve, auc
import pylab as pl

def show_roc(test_target, predicted_probs):

# set number 1

actual = [1, -1, -1, -1, -1, 1, -1, -1, 1, -1, -1, -1, -1, -1, -1, -1, 1, -1, -1, -1]
prediction_probas = [0.374,  0.145,  0.263,  0.129,  0.215,  0.538, 0.24, 0.183, 0.402, 0.2, 0.281,
                0.277, 0.222, 0.204, 0.193, 0.171, 0.401, 0.204, 0.213, 0.182]

fpr, tpr, thresholds = roc_curve(actual, prediction_probas)
roc_auc = auc(fpr, tpr)

# Plot ROC curve
pl.clf()
pl.plot(fpr, tpr, label='ROC curve (area = %0.2f)' % roc_auc)
pl.plot([0, 1], [0, 1], 'k--')
pl.xlim([-0.1, 1.2])
pl.ylim([-0.1, 1.2])
pl.xlabel('False Positive Rate')
pl.ylabel('True Positive Rate')
pl.title('Receiver operating characteristic example')
pl.legend(loc="lower right")
pl.show()

for this first set, here is the graph: http://i.stack.imgur.com/pa93c.png

The probabilities are very low, especially for positive ones, I don’t know why it displays the ideal ROC chart for these inputs.

# set number 2

actual = [1,1,1,0,0,0]
prediction_probas = [0.9,0.9,0.1,0.1,0.1,0.1]

fpr, tpr, thresholds = roc_curve(actual, prediction_probas)
roc_auc = auc(fpr, tpr)

# Plot ROC curve
pl.clf()
pl.plot(fpr, tpr, label='ROC curve (area = %0.2f)' % roc_auc)
pl.plot([0, 1], [0, 1], 'k--')
pl.xlim([-0.1, 1.2])
pl.ylim([-0.1, 1.2])
pl.xlabel('False Positive Rate')
pl.ylabel('True Positive Rate')
pl.title('Receiver operating characteristic example')
pl.legend(loc="lower right")
pl.show()

for the second set, here is the graph output:

This seems more reasonable, and I included it for comparison.

I read scikit learning documentation almost all day, and I'm at a dead end.

+4

python scikit-learn roc

Steven billard Apr 28 '15 at 0:26

source share

1

Tommy · Answer 1 · 2015-04-28T01:29:08+0000

, aka actual aka prediction_probas. , TP , - 1s -1s, , .

1 -1 -1s 1

Scikit Learn Algorithm has incorrect predictions, but is the ROC curve perfect?

More articles: