<div class="csl-bib-body">
<div class="csl-entry">Luo, L., Huang, Y., Chen, X., Zhao, Y., Yu, H., & Dustdar, S. (2026). Location Matters: LLM-Guided Joint Optimization of In-Network Aggregation Placement and Routing for DML Workloads. <i>IEEE Transactions on Network Science and Engineering</i>, <i>13</i>, 5978–5991. https://doi.org/10.1109/TNSE.2026.3654163</div>
</div>
-
dc.identifier.issn
2327-4697
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/227939
-
dc.description.abstract
In-network aggregation (INA) accelerates gradient aggregation in distributed machine learning (DML) by alleviating communication bottlenecks, but its effectiveness crucially depends on two location decisions: where to deploy INA functions and where to aggregate gradient flows. Most existing methods optimize INA placement and gradient flow routing independently, missing the advantages of joint optimization. This paper presents LLMINA, which leverages Large Language Models (LLMs) to automate the heuristic design for joint INA placement and gradient aggregation, aiming to minimize makespan (i.e., the total time required for all DML jobs to complete gradient aggregation). Directly using LLMs to generate end-to-end solutions is infeasible due to problem complexity and LLM limitations. Instead, LLMINA uses LLMs to generate heuristics for INA placement through an evolutionary process, and then applies an optimization-based heuristic for gradient routing that takes into account DML workload characteristics. Experiments across diverse network topologies and workloads show that LLMINA can significantly reduce makespan compared to state-of-the-art baselines. These results underscore that location matters for both INA deployment and aggregation, and highlight the potential of LLM-guided heuristic design for complex network resource optimization.
en
dc.language.iso
en
-
dc.publisher
IEEE COMPUTER SOC
-
dc.relation.ispartof
IEEE Transactions on Network Science and Engineering
-
dc.subject
distributed machine learning
en
dc.subject
heuristic
en
dc.subject
In-network aggregation
en
dc.subject
joint optimization
en
dc.subject
large language model (LLM)
en
dc.title
Location Matters: LLM-Guided Joint Optimization of In-Network Aggregation Placement and Routing for DML Workloads