Brussels / 1 & 2 February 2025

schedule

Data Analytics


09 10 11 12 13 14 15 16 17 18
Saturday What the Spec?!: New Features in Apache Iceberg™ Table Format V3
Graph Databases after 15 Years – Where Are They Headed?
Apache XTable - Interoperability Across Apache Hudi, Apache Iceberg, and Delta Lake
Exactly-Once Event Processing E2E: Bridging Apache Flink and Kafka for Reliable Data Streams
Accelerating QuestDB: Lessons from a 6x Query Performance Boost
ODBC Takes an Arrow to the Knee
Apache Arrow tensor arrays: an approach for storing tensor data
How we built a new powerful JSON data type for ClickHouse
volesti: sampling efficiently from high dimensional distributions
dbt-score: a linter for your dbt model metadata
Open Source Business Intelligence - Introduction to Apache Superset
Enhancing Airflow for Analytics, Data Engineering, and ML at Wikimedia
Developing Custom UIs to Explore Graph Databases Using Sigma.js
A Business Intelligence architecture for Social and Solidarity Economy.

Read the Call for Papers at https://lists.fosdem.org/pipermail/fosdem/2024q4/003587.html.

This DevRoom is a celebration of Open Source Data Analytics projects of all types and sizes. We invite project contributors to present their features, architecture, design, real-world use cases, and integrations. We also encourage end users to share how open-source data analytics projects are helping them solve their everyday challenges.

See https://javier.github.io/data_analytics_devroom_cfp_fosdem_2025/ for more information.

Event Speakers Start End

Saturday

  What the Spec?!: New Features in Apache Iceberg™ Table Format V3
Danica Fine, Russell Spitzer 10:30 11:00
  Graph Databases after 15 Years – Where Are They Headed?
Gábor Szárnyas 11:10 11:40
  Apache XTable - Interoperability Across Apache Hudi, Apache Iceberg, and Delta Lake
Dipankar Mazumdar 11:50 12:20
  Exactly-Once Event Processing E2E: Bridging Apache Flink and Kafka for Reliable Data Streams
Adi Polak 12:30 13:00
  Accelerating QuestDB: Lessons from a 6x Query Performance Boost
javier ramirez, Jaromir Hamala 13:10 13:40
  ODBC Takes an Arrow to the Knee
Matthew Topol 13:50 14:20
  Apache Arrow tensor arrays: an approach for storing tensor data
Rok Mihevc, Alenka 14:30 14:35
  How we built a new powerful JSON data type for ClickHouse
Pavel Kruglov 14:45 15:15
  volesti: sampling efficiently from high dimensional distributions
Vissarion Fisikopoulos 15:25 15:55
  dbt-score: a linter for your dbt model metadata
Jochem van Dooren 16:05 16:35
  Open Source Business Intelligence - Introduction to Apache Superset
Evan Rusackas, Maxime Beauchemin 16:45 17:15
  Enhancing Airflow for Analytics, Data Engineering, and ML at Wikimedia
Ben Tullis, Balthazar Rouberol 17:25 17:55
  Developing Custom UIs to Explore Graph Databases Using Sigma.js
Alexis Jacomy 18:05 18:35
  A Business Intelligence architecture for Social and Solidarity Economy.
Jordi Isidro Llobet 18:45 18:55