Banner Banner

Memphis paper receives 2026 ACM SIGMOD Research Highlight Award

A new framework for eliminating redundant computation across all hardware environments

BIFOLD researchers Dr. Arnab Phani and Prof. Dr. Matthias Böhm have received the 2026* ACM SIGMOD Research Highlight Award for their paper "MEMPHIS: Holistic Lineage-based Reuse and Memory Management for Multi-backend ML Systems." The award will be formally presented at SIGMOD 2026.

Modern machine learning (ML) systems use different types of computing resources, such as CPUs, GPUs, and distributed platforms like Apache Spark or Ray. These systems often split ML tasks into parts that run across these resources. However, during data analysis, the same computations are often repeated, which wastes time and resources. While earlier solutions can reuse results in specific systems, it is still difficult to do this efficiently across multiple types of computing environments, especially because of challenges like limited memory, data transfer costs, and coordination between tasks.

In this paper, the researchers introduce MEMPHIS, a new framework that helps ML systems reuse previously computed results more effectively across different computing resources. At the heart of MEMPHIS is an efficient caching system that keeps track of past computations and decides when to reuse, move, or discard them, while also managing memory efficiently. To address differences between computing environments, such as delayed or parallel execution and varying memory and communication speeds, they design adaptive strategies for managing this cache. They also enhance an ML compiler so it can better schedule tasks and exchange data efficiently.

Their experiments on a wide range of ML tasks show that MEMPHIS can make systems run up to 9.6 times faster than current approaches.

MEMPHIS is fully integrated into Apache SystemDS, an open-source machine learning system for the end-to-end data science lifecycle, from data preparation and feature transformations of input data such as text, images, and tabular data, to model training. It extends LIMA, an earlier framework from the same authors for reusing intermediate results within a single in-memory environment, to three backends: standard in-memory processing, distributed computing via Apache Spark, and GPU acceleration. The paper was also honored with the Best Research Paper Award at the EDBT Conference 2025.

 

About the ACM SIGMOD Research Highlight Award

The ACM SIGMOD Research Highlight Award is one of the most prestigious distinctions in database research. Presented annually since 2016 by the ACM Special Interest Group on Management of Data (SIGMOD), it recognizes research that addresses an important problem, represents a definitive milestone in solving it, and demonstrates significant impact potential within and beyond the research community. 

In his technical review of the Memphis paper, Arun Kumar (UC San Diego; here in the role of the SIGMOD Records editor) explains that the paper demonstrates how classical database principles, such as query optimization, caching, and memory management, are becoming essential for operating AI infrastructure efficiently and sustainably. This trend highlights the key role the database community plays in powering the ongoing AI boom.

For BIFOLD and its predecessor organization, this is the fourth time they have received the SIGMOD Research Highlight Award. Two earlier awards were associated with Prof. Dr. Volker Markl, today Co-Director of BIFOLD, and one was associated with Prof. Dr. Matthias Böhm, chair of BIFOLD’s DAMS research group.

 

About BIFOLD

The Berlin Institute for the Foundations of Learning and Data (BIFOLD) is one of Germany's six national AI research centers, founded in 2019 through the merger of the Berlin Big Data Center and the Berlin Center for Machine Learning. Based at TU Berlin and in partnership with Charité - Universitätsmedizin Berlin, BIFOLD conducts foundational research at the intersection of machine learning and large-scale data management. The institute receives permanent funding from the State of Berlin and the Federal Ministry of Research, Technology, and Space, reflecting its role as a cornerstone of Germany's long-term AI research strategy.

Dr. Arnab Phani is a postdoctoral researcher in the DEEM Lab, a research group headed by Sebastian Schelter at BIFOLD and Technische Universität Berlin. He received his PhD from TU Berlin, where he was a research associate in the DAMS Lab group headed by Matthias Böhm. Prior to his PhD, he was a Senior Software Engineer at Teradata Labs, Hyderabad, India.

Please read the BIFOLD researcher Spotlight if you like to learn more about Arnab and his research.

Prof. Dr. Matthias Böhm is a full professor of large-scale data engineering at BIFOLD and Technische Universität Berlin. His research group, the Big Data Engineering Lab, focuses on high-level, data science-centric abstractions, as well as on systems and tools to execute these tasks efficiently and scalably.

Links

Memphis Paper / Code / SIGMOD Record, Research Highlight

* ”2026 ACM SIGMOD Research Highlight Award” in accordance with the official ACM SIGMOD award listing. Some sources may use 2025 instead, reflecting the year of the selection process.