<div class="csl-bib-body">
<div class="csl-entry">Wang, Y., Saleh, A., Donta, P. K., Hossein Motlagh, N., Lovén, L., Tarkoma, S., & Dustdar, S. (2025). iKafka: Intelligent Storage Management for Adaptive Event Streaming in Kafka. In <i>2025 IEEE International Conference on Edge Computing and Communications (EDGE)</i> (pp. 34–43). IEEE. https://doi.org/10.1109/EDGE67623.2025.00013</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/225655
-
dc.description.abstract
Continuous evaluation of Distributed Computing Continuum Systems heavily relies on efficient communication models, with event streaming Publish-Subscribe (Pub-Sub) systems playing a key role in ensuring fault tolerance, scalability, and real-time analytics across heterogeneous tiers. On event streaming platforms like Apache Kafka, consumer applications often exhibit periodic or event-driven patterns when revisiting historical events, which requires storing these events over time. However, accumulating all events increases storage demands. To address this challenge, Kafka's default log retention policies, governed by static time or size thresholds, may prematurely delete data that will be revisited in future cycles. Unfortunately, static time- or size-based retention policies are insufficient, as they fail to maintain equilibrium between resource utilization, cost, and quality of service (QoS). As a result, intelligent and adaptive storage management strategies are required to minimize storage requirements while maintaining enhanced QoS. In this context, we propose a Light Gradient Boosting Machine (LightGBM)-based adaptive storage optimization in event streaming Kafka broker (namely, iKafka) to identify periodic consumer patterns and determine near-optimal retention times for events. iKafka also considers adversarial attacks by monitoring prediction accuracy to determine whether to use the predicted retention times or revert to default retention times. We evaluate the proposed iKafka system with an air quality use case, and our results demonstrate approximately 5.5× less memory resources over traditional Kafka under ideal conditions.
en
dc.language.iso
en
-
dc.relation.ispartofseries
IEEE International Conference on Edge Computing (EDGE)
-
dc.subject
quality of service
en
dc.subject
event-driven architecture
en
dc.subject
Kafka broker
en
dc.subject
light gradient boosting machine
en
dc.subject
Publish/subscribe systems
en
dc.subject
storage management
en
dc.title
iKafka: Intelligent Storage Management for Adaptive Event Streaming in Kafka
en
dc.type
Inproceedings
en
dc.type
Konferenzbeitrag
de
dc.contributor.affiliation
University of Helsinki, Finland
-
dc.contributor.affiliation
University of Oulu, Finland
-
dc.contributor.affiliation
Stockholm University, Sweden
-
dc.contributor.affiliation
University of Helsinki, Finland
-
dc.contributor.affiliation
University of Oulu, Finland
-
dc.contributor.affiliation
University of Helsinki, Finland
-
dc.relation.isbn
979-8-3315-5559-7
-
dc.relation.doi
10.1109/EDGE67623.2025
-
dc.relation.issn
2767-990X
-
dc.description.startpage
34
-
dc.description.endpage
43
-
dc.type.category
Full-Paper Contribution
-
dc.relation.eissn
2767-9918
-
tuw.booktitle
2025 IEEE International Conference on Edge Computing and Communications (EDGE)
-
tuw.peerreviewed
true
-
tuw.relation.publisher
IEEE
-
tuw.researchTopic.id
I4
-
tuw.researchTopic.name
Information Systems Engineering
-
tuw.researchTopic.value
100
-
tuw.publication.orgunit
E194-02 - Forschungsbereich Distributed Systems
-
tuw.publisher.doi
10.1109/EDGE67623.2025.00013
-
dc.description.numberOfPages
10
-
tuw.author.orcid
0000-0002-8233-6071
-
tuw.author.orcid
0000-0001-9923-9879
-
tuw.author.orcid
0000-0001-9475-4839
-
tuw.author.orcid
0000-0003-4220-3650
-
tuw.author.orcid
0000-0001-6872-8821
-
tuw.event.name
IEEE International Conference on Edge Computing and Communications (IEEE EDGE 2025)
en
tuw.event.startdate
07-07-2025
-
tuw.event.enddate
12-07-2025
-
tuw.event.online
On Site
-
tuw.event.type
Event for scientific audience
-
tuw.event.place
Helsinki
-
tuw.event.country
FI
-
tuw.event.presenter
Wang, Yangyang
-
wb.sciencebranch
Informatik
-
wb.sciencebranch.oefos
1020
-
wb.sciencebranch.value
100
-
item.openairecristype
http://purl.org/coar/resource_type/c_5794
-
item.fulltext
no Fulltext
-
item.languageiso639-1
en
-
item.grantfulltext
none
-
item.openairetype
conference paper
-
item.cerifentitytype
Publications
-
crisitem.author.dept
University of Helsinki, Finland
-
crisitem.author.dept
University of Oulu, Finland
-
crisitem.author.dept
E194-02 - Forschungsbereich Distributed Systems
-
crisitem.author.dept
University of Helsinki, Finland
-
crisitem.author.dept
University of Oulu, Finland
-
crisitem.author.dept
University of Helsinki, Finland
-
crisitem.author.dept
E194-02 - Forschungsbereich Distributed Systems
-
crisitem.author.orcid
0000-0002-8233-6071
-
crisitem.author.orcid
0000-0001-9923-9879
-
crisitem.author.orcid
0000-0003-4220-3650
-
crisitem.author.orcid
0000-0001-6872-8821
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering