IPDPS 2025 square logo
The Fifteenth International Workshop on
Accelerators and Hybrid Emerging Systems
(AsHES)

To be held in conjunction with
39th IEEE International Parallel and Distributed Processing Symposium
Milan, Italy
June 3rd, 2025

Welcome & Opening remarks

8:30 am - 8:40 am

Keynote: Escaping the Performance Portability Trap: A case of re-arranging priorities

8:40 am - 9:40 am

Escaping the Performance Portability Trap: A Case of re-arranging priorities

Konstantinos Parasyris, LLNL

Abstract: Konstantinos Parasyris "In this keynote, I argue that our pursuit of Performance, Portability, and Productivity has led us into a trap. We often treat these goals as equally important and equally attainable at all times — but in practice, this assumption rarely holds. Drawing from recent work in compilers and system software — including kernel-level replay, JIT compilation, and ML-guided runtime systems — I make the case for a re-evaluation of priorities."

Bio: Konstantinos Parasyris is a Computer Scientist at the Center for Applied Scientific Computing (CASC) at Lawrence Livermore National Laboratory (LLNL). His research focuses on programming languages, compilers, and performance engineering for heterogeneous high-performance computing (HPC) systems. He is the involved in several research projects such as Mneme, a record-replay framework for GPU kernel autotuning, Proteus, a just-in-time optimization system for GPU performance and the AMS (Autonomous Multi-Scale Simulation) surrogate model framework. His work has been published at top venues including SC, PACT, CGO, PPoPP, TACO and TPDS. Konstantinos has served on the program committees of SC, IPDPS, EuroMPI, and was a member of the review board for IEEE Transactions on Parallel and Distributed Systems (TPDS). He holds a Ph.D. in Computer Science from the University of Thessaly, Greece, and actively organizes workshops such as LLPP, ESwML, and AI4Dev.

Session: Applications

9:40 am - 10:30 am

  • 9:40 am - 10:05 am (Regular)
    Performance Characterization of Parallel Combination Generators on CPU and GPU Systems
    Brian Donnelly and Michael Gowanlock
  • 10:05 am - 10:30 am (Regular)
    Slaying a Life: Optimizing GPU-accelerated Game of Life Stencil
    Matyáš Brabec, Jiří Klepl and Martin Kruliš

Break

10:30 am - 11:00 am

Session: Performance Modeling

11:00 am - 11:50 pm

  • 11:00 am - 11:25 am (Regular)
    Science per Dollar: Modeling Emerging Node Architectures for Accelerator-centric Computing
    Jordan Abt, Ali Farazdaghi, Elizabeth Reid, Curtis Shorts, Tooraj Taraz, Zachary Silva, Ethan Shama, Scott Levy, Whit Schonbein, Matthew Dosanjh, Amirreza Barati Sedeh and Ryan Grant
  • 11:25 am - 11:50 am (Regular)
    Heterogeneity-Aware Software Performance Characterization via Graph Machine Learning
    Ronaldo Armando Canizales Turcios and Jedidiah McClurg

Session: Performance Characterization

11:50 am - 12:15 pm

  • 11:50 am - 12:15 pm (Regular)
    Apple vs Oranges: Evaluating the Apple Silicon M-Series SoCs for HPC Performance and Efficiency
    Paul Hübner, Andong Hu, Ivy Peng and Stefano Markidis

Session: Runtimes

12:15 pm - 12:30 pm

  • 12:15 pm - 12:30 pm (Short)
    ARTEMIS: Adaptive Real-Time Task Execution & Management in Heterogeneous Systems
    Tom Springer and Peiyi Zhao

Lunch Break

12:30 pm - 2:00 pm

Session: GPU Communication

2:00 pm - 2:55 pm

  • 2:00 pm - 2:15 pm (Short)
    Exploring NCCL Tuning Strategies for Distributed Deep Learning
    Majid Salimi Beni, Ruben Laso, Biagio Cosenza, Siegfried Benkner and Sascha Hunold
  • 2:15 pm - 2:30 pm (Short)
    Collaborative Bandwidth-Efficient Intra-Node Allreduce
    Amirhossein Sojoodi, Ali Farazdaghi, Hamed Sharifian, Ryan E Grant and Ahmad Afsahi
  • 2:30 pm - 2:55 pm (Regular)
    nbshmem: Enabling GPU-Initiated Multi-GPU Communication in Python
    Calvin Bombis and Lena Oden

Session: Virtualization & GPU Sharing

2:55 pm - 3:20 pm

  • 2:55 pm - 3:20 pm (Regular)
    Towards an efficient containerized cloud gaming platform
    Adrien Gégout, Djob Mvondo, Davide Frey and Pascal Manchon

Best-paper award & Closing Remarks

3:20 pm - 3:30 pm

Presentation Details

Presenters are expected to target 25 minutes (Regular papers) or 15 minutes (Short papers) for the talks with 5 minutes for questions.