Leaderboard
Evaluation Metrics
- AUPR : pixel-wise Area Under Precision Recall curve
- FPR95 : pixel-wise False Positive Rate at a true positive rate of 95%
- sIoU gt : adjusted Intersection over Union averaged over all ground truth segmentation components
- PPV : predictive positive value (or precision) averaged over all predicted segmentation components
- mean F1 : component-wise F1-score averaged over different detection thresholds
We consider the Cityscapes dataset as the only in-distribution training data.
Any additional datasets are categorized as OoD data.
We encourage readers to also check the original papers of the methods to double check the use of OoD data.
For a more detailed explanation of the metrics and OoD data, we refer to our paper.