← 返回大厅
arXiv (CS.CL) 2026-06-18 12:00 DOI: arXiv:2606.18856

Approximate Structured Diffusion for Sequence Labelling

摘要 / Abstract

Sequence labelling, a core task of Natural Language Processing (NLP), consists in assigning each token of an input sentence a label. From a Machine Learning point of view, sequence labelling is often cast as a Linear-Chain Conditional Random Field (CRF) parametrised by a neural network. While this approach gives good empirical results, CRFs assume a finite decision span (eg label bigrams) which can limit their expressivity and hurt performance when long-range dependencies are required. We show we can leverage diffusion to train a CRF conditioned on an entire label sequence, with the caveat that the condition is on a noisy version of labels. We show experimentally that this method, in conjunction with approximate CRF inference, improves label accuracy with a 16.5% error reduction for POS-tagging.

同行评议区

登录学者账户后即可在此处发表评述或点赞。

立即登录

暂无评议记录。