<div class="csl-bib-body">
<div class="csl-entry">Jang, M., & Thomas Lukasiewicz. (2022). NoiER: An Approach for Training more Reliable Fine-Tuned Downstream Task Models. <i>IEEE/ACM Transactions on Audio, Speech and Language Processing</i>, <i>30</i>, 2514–2525. https://doi.org/10.1109/TASLP.2022.3193292</div>
</div>
-
dc.identifier.issn
2329-9290
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/146156
-
dc.description.abstract
The recent development in pretrained language models that are trained in a self-supervised fashion, such as BERT, is driving rapid progress in natural language processing. However, their brilliant performance is based on leveraging syntactic artefacts of the training data rather than fully understanding the intrinsic meaning of language. The excessive exploitation of spurious artefacts is a problematic issue: the distribution collapse problem, which is the phenomenon that the model fine-tuned on downstream tasks is unable to distinguish out-of-distribution sentences while producing a high-confidence score. In this paper, we argue that the distribution collapse is a prevalent issue in pretrained language models and propose noise entropy regularisation (NoiER) as an efficient learning paradigm that solves the problem without auxiliary models and additional data. The proposed approach improved traditional out-of-distribution detection evaluation metrics by 55% on average compared to the original fine-tuned models.
en
dc.language.iso
en
-
dc.publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
-
dc.relation.ispartof
IEEE/ACM Transactions on Audio, Speech and Language Processing
-
dc.subject
natural language processing
en
dc.subject
out-of-distribution detection
en
dc.subject
reliability of language model
en
dc.subject
text classification
en
dc.title
NoiER: An Approach for Training more Reliable Fine-Tuned Downstream Task Models
en
dc.type
Article
en
dc.type
Artikel
de
dc.identifier.url
http://dx.doi.org/10.1109/TASLP.2022.3193292
-
dc.contributor.affiliation
University of Oxford, United Kingdom of Great Britain and Northern Ireland (the)
-
dc.description.startpage
2514
-
dc.description.endpage
2525
-
dc.type.category
Original Research Article
-
tuw.container.volume
30
-
tuw.journal.peerreviewed
true
-
tuw.peerreviewed
true
-
wb.publication.intCoWork
International Co-publication
-
tuw.researchTopic.id
I2
-
tuw.researchTopic.name
Computer Engineering and Software-Intensive Systems
-
tuw.researchTopic.value
100
-
dcterms.isPartOf.title
IEEE/ACM Transactions on Audio, Speech and Language Processing