Tian, Z., Lindner, P., Nissl, M., Koch, C., & Tannen, V. (2023). Generalizing Bulk-Synchronous Parallel Processing for Data Science: From Data to Threads and Agent-Based Simulations. Proceedings of the ACM on Management of Data (PACMMOD), 1(2), 1–28. https://doi.org/10.34726/5391
We generalize the bulk-synchronous parallel (BSP) processing model to make it better support agent-based simulations. Such simulations frequently exhibit hierarchical structure in their communication patterns which can be exploited to improve performance. We allow for the creation of temporary artificial network partitions during which agents synchronize only locally within their group in a way that does not compromise the correctness of a simulation. We have built a distributed engine, CloudCity, which uses this idea to improve the locality of computation, communication, and synchronization in such simulations. We experimentally evaluate the performance of our system on a benchmark of simulation workloads and compare it against other popular BSP-like systems, obtaining insights into the impact of various system design choices and optimization on simulation engine performance.
en
Project title:
Scalable Reasoning in Knowledge Graphs: VRG18-013 (WWTF Wiener Wissenschafts-, Forschu und Technologiefonds)