Babelfish: Efficient Execution of Polyglot Queries

Philipp Marian Grulich
Steffen Zeuch
Volker Markl

October 01, 2021

Today's users of data processing systems come from different domains, have different levels of expertise, and prefer different programming languages. As a result, analytical workload requirements shifted from relational to polyglot queries involving user-defined functions (UDFs). Although some data processing systems support polyglot queries, they often embed third-party language runtimes. This embedding induces a high performance overhead, as it causes additional data materialization between execution engines.