<div class="csl-bib-body">
<div class="csl-entry">Recski, G., & Kádár, F. (2023). Language complexity in human and machine translation: a preliminary study. In C. Orasan, R. Mitkov, G. Corpas Pastor, & J. Monti (Eds.), <i>International Conference on Human-Informed Translation and Interpreting Technology (HiT-IT 2023). Proceedings</i> (pp. 268–281). Incoma Ltd. http://hdl.handle.net/20.500.12708/187885</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/187885
-
dc.description.abstract
Systematic comparison between machine translation (MT) and human translation (HT) is mostly limited to the evaluation of MT output with HT as reference, as opposed to a more general study of the properties of MT and HT output texts. We present preliminary experiments investigating di erences between MT and HT in terms of readability and language complexity. We perform both quantitative and qualitative comparison of the outputs of machine and human translation, using samples of English text across multiple domains and genres and their Hungarian translations created by humans and by the state-of-the-art machine translation system deepl. Our results show that machine translation produces somewhat simpler text than human translation on 3 out of 4 samples, and on 2 samples this effect is caused primarily by human translators using a higher number of complex words. We release all software used in our experiments to facilitate further studies on larger samples, additional languages and domains, and using alternative MT systems.
en
dc.language.iso
en
-
dc.relation.ispartofseries
Workshop on Human-Informed Translation and Interpreting Technology
-
dc.subject
Machine Translation
en
dc.subject
Natural Language Processing
en
dc.subject
Language complexity
en
dc.title
Language complexity in human and machine translation: a preliminary study
en
dc.type
Inproceedings
en
dc.type
Konferenzbeitrag
de
dc.contributor.affiliation
Deloitte Hungary, Hungary
-
dc.contributor.editoraffiliation
University of Surrey, United Kingdom of Great Britain and Northern Ireland (the)
-
dc.contributor.editoraffiliation
Lancaster University, United Kingdom of Great Britain and Northern Ireland (the)
-
dc.contributor.editoraffiliation
Universidad de Málaga, Spain
-
dc.contributor.editoraffiliation
University of Naples - L'Orientale, Italy
-
dc.description.startpage
268
-
dc.description.endpage
281
-
dc.type.category
Full-Paper Contribution
-
dc.relation.eissn
2683-0078
-
tuw.booktitle
International Conference on Human-Informed Translation and Interpreting Technology (HiT-IT 2023). Proceedings
-
tuw.peerreviewed
true
-
tuw.relation.publisher
Incoma Ltd.
-
tuw.relation.publisherplace
Shoumen
-
tuw.researchTopic.id
X1
-
tuw.researchTopic.id
I4
-
tuw.researchTopic.name
Beyond TUW-research foci
-
tuw.researchTopic.name
Information Systems Engineering
-
tuw.researchTopic.value
50
-
tuw.researchTopic.value
50
-
tuw.linking
https://hit-it-conference.org/proceedings/
-
tuw.publication.orgunit
E194-04 - Forschungsbereich Data Science
-
tuw.publication.orgunit
E194 - Institut für Information Systems Engineering
-
dc.description.numberOfPages
14
-
tuw.author.orcid
0000-0001-5551-3100
-
tuw.editor.orcid
0000-0003-2067-8890
-
tuw.editor.orcid
0000-0001-6688-1531
-
tuw.event.name
International Conference on Human-Informed Translation and Interpreting Technology (HiT-IT 2023)
en
tuw.event.startdate
07-07-2023
-
tuw.event.enddate
09-07-2023
-
tuw.event.online
On Site
-
tuw.event.type
Event for scientific audience
-
tuw.event.place
Naples
-
tuw.event.country
IT
-
tuw.event.presenter
Recski, Gábor
-
wb.sciencebranch
Sprach- und Literaturwissenschaften
-
wb.sciencebranch
Informatik
-
wb.sciencebranch.oefos
6020
-
wb.sciencebranch.oefos
1020
-
wb.sciencebranch.value
50
-
wb.sciencebranch.value
50
-
item.openairetype
conference paper
-
item.fulltext
no Fulltext
-
item.cerifentitytype
Publications
-
item.languageiso639-1
en
-
item.openairecristype
http://purl.org/coar/resource_type/c_5794
-
item.grantfulltext
restricted
-
crisitem.author.dept
E194-04 - Forschungsbereich Data Science
-
crisitem.author.dept
Deloitte Hungary, Hungary
-
crisitem.author.orcid
0000-0001-5551-3100
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering