A new strategy for finite-sample valid prediction of future insurance claims in the regression setting
The extant insurance literature demonstrates a paucity of finite-sample valid prediction intervals of future insurance claims in the regression setting. To address this challenge, this article proposes a new strategy that converts a predictive method in the unsupervised iid (independent identically distributed) setting to a predictive method in the regression setting. In particular, it enables an actuary to obtain infinitely many finite-sample valid prediction intervals in the regression setting.
š” Research Summary
The paper addresses a longāstanding gap in actuarial science: the lack of finiteāsample valid prediction intervals for future insurance claims when explanatory variables are available (the regression setting). Existing approaches fall into three categories. Parametric models rely on strong distributional assumptions and are vulnerable to model misspecification and the selection effect. Nonāparametric methods avoid misspecification but typically require tuning parameters, which reāintroduce a selection effect, and they only guarantee asymptotic validity. Modelāfree methods such as decision trees or random forests sidestep both issues but still do not provide finiteāsample guarantees. Conformal prediction is the only known modelāfree technique that can deliver finiteāsample validity, yet prior conformal methods for regression either require approximations that break the guarantee or are computationally infeasible.
The authors propose a novel ātransformationābasedā strategy that bridges the gap between the unsupervised iid setting (where many finiteāsample valid conformal intervals exist) and the supervised regression setting. The key idea is to introduce an arbitrary, userāchosen transformation function (h:\mathbb{R}^p\to\mathbb{R}) (subject to (h(x)\ge 0) for insurance claims) and rewrite the dataāgenerating equation \
Comments & Academic Discussion
Loading comments...
Leave a Comment