Coping with AI errors with provable guarantees

Tyukin, Ivan Y; Tyukina, Tatiana; van Helden, Daniël P; Zheng, Zedong; Mirkes, Evgeny; Sutton, Oliver J; Zhou, Qinghua; Gorban, Alexander; Allison, Penelope

Coping with AI errors with provable guarantees

journal contribution

posted on 2024-08-14, 11:15 authored by Ivan Y Tyukin, Tatiana Tyukina, Daniël P van Helden, Zedong Zheng, Evgeny MirkesEvgeny Mirkes, Oliver J Sutton, Qinghua Zhou, Alexander GorbanAlexander Gorban, Penelope AllisonPenelope Allison

AI errors pose a significant challenge, hindering real-world applications. This work introduces a novel approach to cope with AI errors using weakly supervised error correctors that guarantee a specific level of error reduction. Our correctors have low computational cost and can be used to decide whether to abstain from making an unsafe classification. We provide new upper and lower bounds on the probability of errors in the corrected system. In contrast to existing works, these bounds are distribution agnostic, non-asymptotic, and can be efficiently computed just using the corrector training data. They also can be used in settings with concept drifts when the observed frequencies of separate classes vary. The correctors can easily be updated, removed, or replaced in response to changes in distributions within each class without retraining the underlying classifier. The application of the approach is illustrated with two relevant challenging tasks: (i) an image classification problem with scarce training data, and (ii) moderating responses of large language models without retraining or otherwise fine-tuning.

Funding

UKRI Turing AI Acceleration Fellowship ( EP/V025295/2)

Arch-I-Scan: Automated recording and machine learning for collating Roman ceramic tablewares and investigating eating and drinking practices

Arts and Humanities Research Council

Find out more...

History

Author affiliation

College of Science & Engineering College of Social Sci Arts and Humanities Comp' & Math' Sciences Archaeology & Ancient History

Version

VoR (Version of Record)

Published in

Information Sciences

Volume

678

Pagination

120856

Publisher

Elsevier BV

issn

0020-0255

Copyright date

2024

Available date

2024-08-14

Publisher DOI

https://doi.org/10.1016/j.ins.2024.120856

Language

en

Publisher version

http://dx.doi.org/10.1016/j.ins.2024.120856

Deposited by

Professor Alexander Gorban

Deposit date

2024-08-12

Data Access Statement

Data will be made available on request.

Usage metrics

Coping with AI errors with provable guarantees

Funding

UKRI Turing AI Acceleration Fellowship ( EP/V025295/2)

Arch-I-Scan: Automated recording and machine learning for collating Roman ceramic tablewares and investigating eating and drinking practices

History

Author affiliation

Version

Published in

Volume

Pagination

Publisher

issn

Copyright date

Available date

Publisher DOI

Language

Publisher version

Deposited by

Deposit date

Data Access Statement

Usage metrics

Categories

Keywords

Licence

Exports