Accepted papers

Research Track

  • Introducing GPU persistent graphs for time-sensitive workflows - - Cyril Cetre, Florian Ferreira, Raomi Barrere, Damien Gratadour

  • Non-linear Programming for the Network Calculus Analysis of FIFO Feedforward Networks - - Lukas Herll, Steffen Bondorf

  • Generating Executable Microservice Applications for Performance Benchmarking - - Yannik Lubas, Martin Straesser, Andrao Bauer, Samuel Kounev

  • Utilizing Graph Neural Networks for Effective Link Prediction in Microservice Architectures - - Ghazal Khodabandeh, Alireza Ezaz, Majid Babaei, Naser Ezzati-Jivan

  • Proportional Fairness and Isolation for Serverless Applications over FaaS Platforms - - George Kelantonakis, Fallia Kourou, Kostas Magoutis

  • PARAGRAPH: Phase-Aware Resource Demand Profiling for HPDA/HPC Jobs - (Short Paper) - Ivo Rohwer, Nikolas Herbst, Maximilian Schwinger, Peter Friedl, Michael Stephan, Samuel Kounev

  • Parallel GPU-Enabled Algorithms for SpGEMM on Arbitrary Semirings with Hybrid Communication - (Short Paper) - Thomas McFarland, Julian Bellavita, Giulia Guidi

  • HeteroBench: Multi-kernel Benchmarks for Heterogeneous Systems - - Hongzheng Tian, Alok Mishra, Zhiheng Chen, Rolando Pablo Hong Enriquez, Dejan Milojicic, Eitan Frachtenberg, Sitao Huang

  • Component-Based Analytical Modeling of GPU Runtime Performance: a Case-Study in Scientific Computing - - Jolly Chen, Ana Lucia Varbanescu, Monica Dessole

  • Quantifying Data Leakage in Failure Prediction Tasks - - Daniel Grillmeyer, Marius Hadry, Veronika Lesch, Vanessa Borst, Robert Leppich, Andrao Bauer, Samuel Kounev

  • On-demand Memory Compression of Stream Aggregates through Reinforcement Learning - - Jingyu Liu, Vincenzo Gulisano

  • An Analysis of User-space Idle State Instructions on x86 Processors - (Short Paper) - Malte-Christian Kuns, Robert Schoene, Hannes Trajpgen, Wolfgang E. Nagel

  • Understanding the Energy Consumption of Cloud-native Software - - Lars Andringa, Brian Setz, Vasilios Andrikopoulos

  • An Empirical Characterization of Outages and Incidents in Public Services for Large Language Models - - Xiaoyu Ch, Sacheendra Talluri, Qingxian Lu, Alexandru Iosup

  • PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time - - Alireza Pourali, Arian Boukani, Hamzeh Khazaei

  • Columbo: A Reasoning Framework for Kubernetes Configuration Space - - Matthijs Jansen, Sacheendra Talluri, Krijn Doekemeijer, Nick Tehran, Alexandru Iosup, Animesh Trivedi

  • Multi-Strided Access Patterns to Boost Hardware Prefetching - - Miguel Blom, Kristian Rietveld, Rob van Nieuwpoort

  • A Novel Approach for Detecting Noisy Neighbors in CPU-Isolated Cgroups Environments - (Short Paper) - Simon Volpert, Sascha Winkelhofer, Jajrg Domaschka, Stefan Wesner

  • Large Language Model Fine-tuning with Low-Rank Adaptation: A Performance Exploration - - Bagus Hanindhito, Bhavesh Patel, Lizy K. John

  • Better memory tiering, right from the first placement - - Joao Barreto, Bartosz Chominski, Andrao Gonasalves, Fedar Karabeinikau, Maciej Maciejewski, Joao Pavoas, Jakub Schmiegel, Kostiantyn Storozhuk

  • Optimization Strategies for Enhancing Resource Efficiency in Transformers & Large Language Models - (Short Paper) - Tom Wallace, Beatrice Ombuki-Berman, Naser Ezzati-Jivan

  • Energy Metrics for Edge Microservice Request Placement Strategies - (Short Paper) - Klervie Toczao, Simin Nadjm-Tehrani

  • Uplink End-to-End Latency Characterization of a 5G NSA Access Network - - Orangel Azuaje Contreras, Ana Aguiar, Peter Steenkiste

  • BottleMod: Modeling Data Flows and Tasks for Fast Bottleneck Analysis - (Short Paper) - Ansgar Lajayer, Joel Witzke, Florian Schintke, Bjajrn Scheuermann

Industry Track

  • Bridging Clusters: A Comparative Look at Multicluster Networking Performance in Kubernetes - - Sai Sindhur Malleni, Raúl Sevilla, José Castillo Lema, André Bauer

  • Shaved Ice: Optimal Compute Resource Commitments for Dynamic Multi-Cloud Workloads - - Murray Stokely, Orestis Kostakis, Neel Nadgir, Jack Peele

  • Accelerating Model Optimization on the Edge Through Automated Performance Benchmarking and End-to-End Profiling - - Nayara Aguiar, Helen Chigirinskaya, Jie Chen, Anoush Najarian

  • Beyond Maximum Throughput: Explore Full Operational Envelope for Capacity Planning - - Xiaosong Lou

  • Cost optimization and performance control in the hybrid multi-cloud environment - - Boris Zibitsker, Alex Lupersolsky

  • Towards Workload-aware Cloud Efficiency: A Large-scale Empirical Study of Cloud Workload Characteristics - - Anjaly Parayil, Jue Zhang, Xiaoting Qin, Inigo Goiri, Lexiang Huang, Timothy Zhu, Chetan Bansal, Lexiang Huang

  • CADAEC: Content-Aware Deployment of AI Workloads in Edge-Cloud Ecosystem - - Ratul Kishore Saha, Sparsh Mittal, Rekha Singhal, Manoj Nambiar

  • Platform Performance Suite (PPS): A Framework for Performance Analysis & Diagnosis of Complex Cyber-Physical Systems - - Konstantinos Triantafyllidis, Jos Hegge, Yuri Blankenstein, Sobhan Niknam

Journal First

  • RPerf: Mining User Reviews Using Topic Modeling to Assist Performance Testing: An Industrial Experience Report - - Zehao Wang, Wei Liu, Jinfu Chen, Tse-Hsun (Peter) Chen

  • Performance Modeling of Distributed Data Processing in Microservice Applications - - Yicheng Gao, Giuliano Casale, Rekha Singhal

  • IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency - - Saeid Ghafouri, Kamran Razavi, Mehran Salmani, Alireza Sanaee, Tania Lorido Botran, Lin Wang, Joseph Doyle, Pooyan Jamshidi

  • Multi-Criteria Optimization of Real-Time DAGs on Heterogeneous Platforms under P-EDF - - Tommaso Cucinotta, Alexandre Amory, Gabriele Ara, Francesco Paladino, Marco Di Natale

Artifact Evaluation Track

  • SplitTracr: A Flexible Performance Evaluation Tool for Cooperative Inference and Split Computing - - Nicholas Bovee, Izhar Ali, Gopi Patapanchala, Suraj Bitla, Shen Shyang Ho

  • The Kieker Observability Framework Version 2 - - Shinhyung Yang, David Georg Reichelt, Reiner Jung, Marcel Hansson, Wilhelm Hasselbring

  • A Dataset of Performance Measurements and Alerts from Mozilla - - Mohamed Bilel Besbes, Diego Elias Costa, Suhaib Mujahid, Gregory Mierzwinski, Marco Castelluccio

Data Challenge

  • TraceLens: Early Detection of Software Anomalies Using Critical Path Analysis - - M. Nourollahi, A. Haghshena, M. Dagenais

  • Kernel-Level Event-Based Performance Anomaly Detection in Software Systems under Varying Load Conditions - - A. Njoku, H. Li

Emerging Research Track

  • Improving Runtime Performance in Java: A Systematic Detection and Refactoring Approach for Lock Contention Code Smells - (Work In progress paper) - Ankita Mukherjee, Ashadullah Shawon, Ramiro Liscano, Akramul Azim, Md Asif Khan, Joseph Robertson, Vijay Sundaresan, Yee-Kang Chang

  • Optimizing Memory Access Patterns through Automatic Data Layout Transformation - (Work in Progress Paper) - Jolly Chen, Ana Lucia Varbanescu, Axel Naumann

  • Modeling and Optimizing Runtime Adaptation Strategies at Design-Time using Evolutionary Algorithms - (Vision Paper) - Martina Rapp, Max Scheerer, Ralf Sieger, Ralf Reussner, Raffaela Mirandola

Industry Presentation Track

  • Capacity Planning and Performance Modelling for Cloud Services: A Case Study on Messaging Service Workloads - - Rupinder Virk, Syed Uzzaman, Subramanian Rangaswamy, Pradeep Sonawane

  • Optimizing Hunter to Compute Change Points in Constant Time and What that Means for UX - - Henrik Ingo

  • Developing Software with Performance in Mind - - Josef Mayrhofer

  • Performance Optimizations for Scaling LLM based Log Analytics Tool - - Harshit Kumar, Pranjal Gupta, Karan Bukar, Seema Nagar, Prateeti Mohapatra, Debanjana Kar

  • Will Performance Engineers Be the Cloud Cost Saving Rock Stars? - - Andrew Lee

Posters and Demonstrations Track

  • LogAn: An LLM-Based Log Analytics Tool with Causal Inferencing - - P. Gupta, K. Bhukar, H. Kumar, S. Nagar, P. Mohapatra, D. Kar

  • COQO: Cost-Optimal Query Orchestration Tool - - K. Singh, R. Singh, S. Kunde, M. Mishra, R. Singhal, M. Nambiar"