<div class="csl-bib-body">
<div class="csl-entry">Redshaw, J., Ting, D. S. J., Brown, A., Hirst, J. D., & Gärtner, T. (2023). Krein support vector machine classification of antimicrobial peptides. <i>Digital Discovery</i>. https://doi.org/10.1039/D3DD00004D</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/175617
-
dc.description.abstract
Antimicrobial peptides (AMPs) represent a potential solution to the growing problem of antimicrobial resistance, yet their identification through wet-lab experiments is a costly and time-consuming process. Accurate computational predictions would allow rapid in silico screening of candidate AMPs, thereby accelerating the discovery process. Kernel methods are a class of machine learning algorithms that utilise a kernel function to transform input data into a new representation. When appropriately normalised, the kernel function can be regarded as a notion of similarity between instances. However, many expressive notions of similarity are not valid kernel functions, meaning they cannot be used with standard kernel methods such as the support-vector machine (SVM). The Kreĭn-SVM represents generalisation of the standard SVM that admits a much larger class of similarity functions. In this study, we propose and develop Kreĭn-SVM models for AMP classification and prediction by employing the Levenshtein distance and local alignment score as sequence similarity functions. Utilising two datasets from the literature, each containing more than 3000 peptides, we train models to predict general antimicrobial activity. Our best models achieve an AUC of 0.967 and 0.863 on the test sets of each respective dataset, outperforming the in-house and literature baselines in both cases. We also curate a dataset of experimentally validated peptides, measured against Staphylococcus aureus and Pseudomonas aeruginosa, in order to evaluate the applicability of our methodology in predicting microbe-specific activity. In this case, our best models achieve an AUC of 0.982 and 0.891, respectively. Models to predict both general and microbe-specific activities are made available as web applications.
en
dc.language.iso
en
-
dc.publisher
Royal Society of Chemistry (RSC)
-
dc.relation.ispartof
Digital Discovery
-
dc.subject
Machine Learning
en
dc.subject
Kernel Methods
en
dc.subject
Sequence Alignment
en
dc.subject
Antimicrobial Resistance
en
dc.subject
Indefinite Kernels
en
dc.title
Krein support vector machine classification of antimicrobial peptides
en
dc.type
Article
en
dc.type
Artikel
de
dc.contributor.affiliation
University of Nottingham, United Kingdom of Great Britain and Northern Ireland (the)
-
dc.contributor.affiliation
University of Nottingham, United Kingdom of Great Britain and Northern Ireland (the)
-
dc.contributor.affiliation
GSK Medicines Research Centre, Stevenage, UK
-
dc.contributor.affiliation
University of Nottingham, United Kingdom of Great Britain and Northern Ireland (the)
-
dcterms.dateSubmitted
2023-01-19
-
dc.type.category
Original Research Article
-
tuw.journal.peerreviewed
true
-
tuw.peerreviewed
true
-
wb.publication.intCoWork
International Co-publication
-
tuw.researchTopic.id
I4a
-
tuw.researchTopic.name
Information Systems Engineering
-
tuw.researchTopic.value
100
-
dcterms.isPartOf.title
Digital Discovery
-
tuw.publication.orgunit
E194-06 - Forschungsbereich Machine Learning
-
tuw.publisher.doi
10.1039/D3DD00004D
-
dc.date.onlinefirst
2023-02-27
-
dc.identifier.eissn
2635-098X
-
dc.description.numberOfPages
10
-
tuw.author.orcid
0000-0003-1081-1141
-
tuw.author.orcid
0000-0002-2726-0983
-
tuw.author.orcid
0000-0001-5985-9213
-
wb.sciencebranch
Informatik
-
wb.sciencebranch.oefos
1020
-
wb.sciencebranch.value
100
-
item.languageiso639-1
en
-
item.openairetype
research article
-
item.grantfulltext
none
-
item.fulltext
no Fulltext
-
item.cerifentitytype
Publications
-
item.openairecristype
http://purl.org/coar/resource_type/c_2df8fbb1
-
crisitem.author.dept
University of Nottingham
-
crisitem.author.dept
University of Nottingham
-
crisitem.author.dept
GSK Medicines Research Centre, Stevenage, UK
-
crisitem.author.dept
University of Nottingham
-
crisitem.author.dept
E194-06 - Forschungsbereich Machine Learning
-
crisitem.author.orcid
0000-0003-1081-1141
-
crisitem.author.orcid
0000-0002-2726-0983
-
crisitem.author.orcid
0000-0001-5985-9213
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering