<div class="csl-bib-body">
<div class="csl-entry">Reisecker, M., Marcelino, C., Pusztai, T., & Nastic, S. (2025). Gaia: Hybrid Hardware Acceleration for Serverless AI in the 3D Compute Continuum. In <i>BDCAT ’25 : Proceedings of the IEEE/ACM 12th International Conference on Big Data Computing, Applications and Technologies</i>. IEEE/ACM 12th International Conference on Big Data Computing, Applications and Technologies (BDCAT 2025), Nantes, France. ACM. https://doi.org/10.1145/3773276.3774299</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/227838
-
dc.description.abstract
Serverless computing offers elastic scaling and pay-per-use execution, making it well-suited for AI workloads. As these workloads run in heterogeneous environments such as the Edge-Cloud-Space 3D Continuum, they often require intensive parallel computation, which GPUs can perform far more efficiently than CPUs. However, current platforms struggle to manage hardware acceleration effectively, as static user-device assignments fail to ensure SLO compliance under varying loads or placements, and one-time dynamic selections often lead to suboptimal or cost-inefficient configurations.To address these issues, we present Gaia, a GPU-as-a-service model and architecture that makes hardware acceleration a platform concern. Gaia combines (i) a lightweight Execution Mode Identifier that inspects function code at deploy time to emit one of four execution modes, and a Dynamic Function Runtime that continuously re-evaluates user-defined SLOs to promote or demote between CPU- and GPU backends. Our evaluation shows that it seamlessly selects the best hardware acceleration for the workload, reducing end-to-end latency by up to 95%. These results indicate that Gaia enables SLO-aware, cost-efficient acceleration for serverless AI across heterogeneous environments.
en
dc.description.sponsorship
Internet Privatstiftung Austria
-
dc.description.sponsorship
European Commission
-
dc.description.sponsorship
European Commission
-
dc.language.iso
en
-
dc.rights.uri
http://creativecommons.org/licenses/by/4.0/
-
dc.subject
3D Continuum
en
dc.subject
AI
en
dc.subject
Cloud
en
dc.subject
Edge
en
dc.subject
FaaS
en
dc.subject
GPU
en
dc.subject
LEO
en
dc.subject
Serverless
en
dc.title
Gaia: Hybrid Hardware Acceleration for Serverless AI in the 3D Compute Continuum