arXiv (CS.LG)
2026-06-11 12:00
DOI:
arXiv:2602.02229
Prediction-Powered Risk Monitoring of Deployed Models for Detecting Harmful Distribution Shifts
作者:
摘要 / Abstract
arXiv:2602.02229v2 Announce Type: replace
Abstract: We study the problem of monitoring model performance in dynamic environments where labeled data are limited. To this end, we propose prediction-powered risk monitoring (PPRM), a semi-supervised risk-monitoring approach based on prediction-powered inference (PPI). PPRM constructs anytime-valid lower bounds on the running risk by combining synthetic labels with a small set of true labels. Harmful shifts are detected via a threshold-based comparison with an upper bound on the nominal risk, satisfying assumption-free finite-sample guarantees on the type-I error. We demonstrate the effectiveness of PPRM through extensive experiments on image classification, large language model (LLM), and telecommunications monitoring tasks.