SIGMOD/PODS 2025
Berlin Welcomes the World of Data Management
Between June 22nd and June 27th, Berlin became a global epicenter of data management research. The ACM SIGMOD/PODS International Conference on Management of Data, one of the premier events in the world of database research, took place at the Intercontinental Berlin, attracting nearly 1200 participants. This year we received a record number of submissions (1008 research papers submitted) and we accepted a record number of papers (250 in all), one of the largest in SIGMOD history.
BIFOLD was proud to serve as one of the co-organizers of this landmark event. Hosting SIGMOD was a significant milestone; it brought many of the leading database researchers from around the globe here to Berlin. We want to extend our heartfelt thanks to the dedicated organizing team and the many volunteers who contributed to making this conference memorable. We are already looking forward to next year's SIGMOD, where — rumor has it — Joe Hellerstein will kick off the poster session with his trumpet!
The three outstanding keynote speakers — Christos H. Papadimitriou (Columbia University), Margo Seltzer (University of British Columbia), and Phil Bernstein (Microsoft) — were a highlight of the conference, delivering thought-provoking and inspiring talks.
Furthermore, we would like to congratulate the BIFOLD NebulaStream team for their successes. With great fervor, they worked arduously to ensure the open-source release of NebulaStream, the week of the SIGMOD conference. NebulaStream is a novel, data stream processing system for massively distributed, heterogeneous data streams in the cloud-edge continuum. In addition, NEEDMI Project team members prepared an impressive demo that won the SIGMOD 2025 Best Demo Honorable Mention. Kudos to them for all of their hard-work!
BIFOLD’s Contributions to SIGMOD/PODS 2025
BIFOLD / TU Berlin researchers and their collaborators contributed across multiple tracks and formats:
- “Fast and Scalable Data Transfer across Data Systems,” Haralampos Gavriilidis, Kaustubh Beedkar, Matthias Boehm, and Volker Markl [ https://dl.acm.org/doi/10.1145/3725294 ]
- “Disclosure-compliant Query Answering,” Rudi Poepsel-Lemaitre, Kaustubh Beedkar, and Volker Markl [ https://dl.acm.org/doi/10.1145/3698808 ]
- “CatDB: LLM-based Generation of Data-centric ML Pipelines,” Saeed Fathollahzadeh, Essam Mansour, and Matthias Boehm [Short Paper and Video: https://dl.acm.org/doi/10.1145/3722212.3725097 ]
- “NebulaStream: A High-performance Streaming Engine for Multi-modal Edge Applications,” Steffen Zeuch, Adrian Michalke, Aljoscha Lepping, Volker Markl, Ricardo Martinez, Nils Schubert, Lukas Schwerdtfeger, Taha Tekdogan, Ariane Ziehn, Christoph Falkensteiner, Kyle Krüger, Tobias Röschl, Alexander Meyer, and Svea Wilkending [Short Paper: https://dl.acm.org/doi/10.1145/3722212.3725118 ]
- “Finding What You’re Looking For: A Distribution-Aware Dataset Search Engine,” Lennart Behme, Leonard Geißler, Pratham Agrawal, Emil Badura, Benjamin Ueber, Kaustubh Beedkar, and Volker Markl [Short Paper and Video: https://dl.acm.org/doi/10.1145/3722212.3725104 ]
- “Mobility Stream Processing on NebulaStream and MEOS,” Mariana Duarte, Dwi Nugroho, Georges Tod, Evert Bevernage, Pieter Moelans, Emine Tas, Esteban Zimányi, Mahmoud Sakr, Steffen Zeuch, and Volker Markl [Short Paper: https://dl.acm.org/doi/10.1145/3722212.3725116 ]
- “Apache Wayang in Action: A Unified Data Analytics Framework,” Kaustubh Beedkar, Aurelien Bertrand, Haralampos Gavriilidis, Augusto Fonseca, Zoi Kaoudi, Mingxi Liu, Volker Markl, Juri Petersen, Fabio Porto, Victor Ribeiro, Mads Sejer Pedersen, Lucas Tavares, Michalis Vargiamis, and Chen Xu [Short Paper: https://dl.acm.org/doi/10.1145/3722212.3725081 ]
- “Navigating Data Errors in Machine Learning Pipelines,” Sebastian Schelter, Bojan Karlaš, and Babak Salimi
- 4th International Workshop on Data Systems Education (DataEd)
- “Teaching Large-Scale Data Management to Large-Scale Undergraduate Students,” Gereon Dusella, Lennart Behme, Rudi Poepsel-Lemaitre, Alexander Borusan, and Volker Markl
- 9th Workshop on Data Management for End-to-End Machine Learning (DEEM)
- “Towards Automated Task-Aware Data Validation,” Hao Chen and Sebastian Schelter
- SIGMOD Best Demo Honorable Mention: “NebulaStream: An extensible, high-performance streaming engine for multi-modal edge applications”
- Distinguished Reviewer: Ziawasch Abedjan
- General Chair: Volker Markl
- Finance Chair: Matthias Boehm
- Publicity/Social Media Chair: Ziawasch Abedjan
- Web Chairs: Sebastian Schelter and Luca Zecchini
- Proceedings Chair: Eleni Tzirita Zacharatou
- Remote Participation Chair: Tilmann Rabl
- Program Committee: Stefan Halfpap, Steffen Zeuch, Varun Pandey, Ziawasch Abedjan, and Matthias Boehm
- Workshop Co-organizers: BIFOLD DEEM Lab
We are proud to have helped organize SIGMOD/PODS this year and sincerely thank everyone who was involved in the preparation and execution of (and participation in) this conference — from the three keynote speakers, to the many researchers to volunteers and participants. We are already looking forward to the 2026 SIGMOD/PODS in Bengaluru (Bangalore), India!