Calibrated Surrogate Losses for Classification with Label-Dependent Costs

We present surrogate regret bounds for arbitrary surrogate losses in the context of binary classification with label-dependent costs. Such bounds relate a classifier's risk, assessed with respect to a

Calibrated Surrogate Losses for Classification with Label-Dependent   Costs

We present surrogate regret bounds for arbitrary surrogate losses in the context of binary classification with label-dependent costs. Such bounds relate a classifier’s risk, assessed with respect to a surrogate loss, to its cost-sensitive classification risk. Two approaches to surrogate regret bounds are developed. The first is a direct generalization of Bartlett et al. [2006], who focus on margin-based losses and cost-insensitive classification, while the second adopts the framework of Steinwart [2007] based on calibration functions. Nontrivial surrogate regret bounds are shown to exist precisely when the surrogate loss satisfies a “calibration” condition that is easily verified for many common losses. We apply this theory to the class of uneven margin losses, and characterize when these losses are properly calibrated. The uneven hinge, squared error, exponential, and sigmoid losses are then treated in detail.


💡 Research Summary

The paper addresses the theoretical foundations of cost‑sensitive binary classification, where the mis‑classification costs differ between the positive and negative classes. While surrogate loss functions are widely used to replace the non‑convex 0‑1 loss in standard (cost‑insensitive) learning, it has been unclear how to relate the surrogate risk to the true cost‑sensitive risk. The authors develop two complementary frameworks that yield explicit regret bounds linking a classifier’s performance under an arbitrary surrogate loss φ to its performance under the true cost‑sensitive loss.

The first framework generalizes the margin‑based analysis of Bartlett et al. (2006). By introducing the cost‑weighted threshold
\


📜 Original Paper Content

🚀 Synchronizing high-quality layout from 1TB storage...