Brussels / 1 & 2 February 2025

schedule

UB5.132


Day Start End Track(s)
Saturday 10:30 18:55 Data Analytics
Sunday 09:00 16:55 HPC, Big Data & Data Science
09 10 11 12 13 14 15 16 17 18
Saturday What the Spec?!: New Features in Apache Iceberg™ Table Format V3
Graph Databases after 15 Years – Where Are They Headed?
Apache XTable - Interoperability Across Apache Hudi, Apache Iceberg, and Delta Lake
Exactly-Once Event Processing E2E: Bridging Apache Flink and Kafka for Reliable Data Streams
Accelerating QuestDB: Lessons from a 6x Query Performance Boost
ODBC Takes an Arrow to the Knee
Apache Arrow tensor arrays: an approach for storing tensor data
How we built a new powerful JSON data type for ClickHouse
volesti: sampling efficiently from high dimensional distributions
dbt-score: a linter for your dbt model metadata
Open Source Business Intelligence - Introduction to Apache Superset
Enhancing Airflow for Analytics, Data Engineering, and ML at Wikimedia
Developing Custom UIs to Explore Graph Databases Using Sigma.js
A Business Intelligence architecture for Social and Solidarity Economy.
Sunday Optimizing Resource Utilization for Interactive GPU Workloads with Transparent Container Checkpointing
Efficient Histogramming for High-Performance Computing in C++ with YODA
Explainable forecasting from big weather data: rapid and sustainable solutions
The High Performance Software Foundation (HPSF)
Environment Modules: why this old idea is still useful today and what's next
Programming models with the ROCm™ compiler
Adding built-in support for basic performance test analytics to ReFrame
Making Data Fun Again: Extending EESSI to improve Research Data Management
Running Kubernetes Workloads on HPC with HPK
OpenCL, CUDA, and HIP as compilation targets for functional array programs
Harnessing Reduced Precision for Accurate and Efficient Scientific Computing in HPC
Easier API Interoperability: writing a bindings Generator to C/C++ with Coccinelle
A Pantheon of The Gods: Open Source Multiphysics Software for Analysis of Fusion Power Plant Systems
Effect of kernel optimizations on HPC workloads performance
Multithreading in Python using OpenMP?
Mapping Applications to the Hardware Portably and Transparently
Job-specific performance monitoring on HPC clusters: Challenges and Solutions

Events

Title Speakers Track Start End

Saturday

  What the Spec?!: New Features in Apache Iceberg™ Table Format V3
Danica Fine, Russell Spitzer Data Analytics 10:30 11:00
  Graph Databases after 15 Years – Where Are They Headed?
Gábor Szárnyas Data Analytics 11:10 11:40
  Apache XTable - Interoperability Across Apache Hudi, Apache Iceberg, and Delta Lake
Dipankar Mazumdar Data Analytics 11:50 12:20
  Exactly-Once Event Processing E2E: Bridging Apache Flink and Kafka for Reliable Data Streams
Adi Polak Data Analytics 12:30 13:00
  Accelerating QuestDB: Lessons from a 6x Query Performance Boost
javier ramirez, Jaromir Hamala Data Analytics 13:10 13:40
  ODBC Takes an Arrow to the Knee
Matthew Topol Data Analytics 13:50 14:20
  Apache Arrow tensor arrays: an approach for storing tensor data
Rok Mihevc, Alenka Data Analytics 14:30 14:35
  How we built a new powerful JSON data type for ClickHouse
Pavel Kruglov Data Analytics 14:45 15:15
  volesti: sampling efficiently from high dimensional distributions
Vissarion Fisikopoulos Data Analytics 15:25 15:55
  dbt-score: a linter for your dbt model metadata
Jochem van Dooren Data Analytics 16:05 16:35
  Open Source Business Intelligence - Introduction to Apache Superset
Evan Rusackas, Maxime Beauchemin Data Analytics 16:45 17:15
  Enhancing Airflow for Analytics, Data Engineering, and ML at Wikimedia
Ben Tullis, Balthazar Rouberol Data Analytics 17:25 17:55
  Developing Custom UIs to Explore Graph Databases Using Sigma.js
Alexis Jacomy Data Analytics 18:05 18:35
  A Business Intelligence architecture for Social and Solidarity Economy.
Jordi Isidro Llobet Data Analytics 18:45 18:55

Sunday

  Optimizing Resource Utilization for Interactive GPU Workloads with Transparent Container Checkpointing
Adrian Reber, Radostin Stoyanov, Viktória Spišaková HPC, Big Data & Data Science 09:00 09:25
  Efficient Histogramming for High-Performance Computing in C++ with YODA
Christian Gutschow HPC, Big Data & Data Science 09:30 09:55
  Explainable forecasting from big weather data: rapid and sustainable solutions
David Salvador-Jasin HPC, Big Data & Data Science 10:00 10:25
  The High Performance Software Foundation (HPSF)
Todd Gamblin HPC, Big Data & Data Science 10:55 11:05
  Environment Modules: why this old idea is still useful today and what's next
Xavier Delaruelle HPC, Big Data & Data Science 11:05 11:30
  Programming models with the ROCm™ compiler
Jan-Patrick Lehr HPC, Big Data & Data Science 11:35 12:00
  Adding built-in support for basic performance test analytics to ReFrame
Felix Abecassis, Vasileios Karakasis HPC, Big Data & Data Science 12:00 12:25
  Making Data Fun Again: Extending EESSI to improve Research Data Management
Thomas Röblitz HPC, Big Data & Data Science 12:30 12:55
  Running Kubernetes Workloads on HPC with HPK
Antony Chazapis HPC, Big Data & Data Science 13:30 13:55
  OpenCL, CUDA, and HIP as compilation targets for functional array programs
Troels Henriksen HPC, Big Data & Data Science 14:00 14:10
  Harnessing Reduced Precision for Accurate and Efficient Scientific Computing in HPC
Nima Sahraneshinsamani HPC, Big Data & Data Science 14:10 14:20
  Easier API Interoperability: writing a bindings Generator to C/C++ with Coccinelle
Michele Martone, Ivan Pribec HPC, Big Data & Data Science 14:20 14:30
  A Pantheon of The Gods: Open Source Multiphysics Software for Analysis of Fusion Power Plant Systems
Aleksander Dubas HPC, Big Data & Data Science 14:35 14:45
  Effect of kernel optimizations on HPC workloads performance
Alex Domingo HPC, Big Data & Data Science 14:45 14:55
  Multithreading in Python using OpenMP?
Dorian Ouakli HPC, Big Data & Data Science 15:00 15:25
  Mapping Applications to the Hardware Portably and Transparently
Edgar Leon HPC, Big Data & Data Science 16:00 16:25
  Job-specific performance monitoring on HPC clusters: Challenges and Solutions
Christian Iwainsky HPC, Big Data & Data Science 16:30 16:55