FOSDEM 2025
/
Schedule
/
Events
/
Developer rooms
/
Low-level AI Engineering and Hacking
/
Expanding GGML Hardware Support using the Vulkan API

Expanding GGML Hardware Support using the Vulkan API

Track: Low-level AI Engineering and Hacking
Room: UB2.252A (Lameere)
Day: Sunday
Start: 15:40
End: 16:00
Video only: ub2252a
Chat: Join the conversation!

Most machine learning applications are accelerated using vendor-specific APIs like CUDA and ROCm. While alternatives like OpenCL and SYCL exist, they are not as well-supported. What if we could harness the broad driver support that is being put into gaming and use Vulkan compute shaders instead? In this talk, I will present advantages and disadvantages of this approach and the difficulties I had to overcome to create a Vulkan API backend for llama.cpp.

Speakers

Ruben Ortlam

Attachments

Slides

fosdem-2025

Brussels / 1 & 2 February 2025

Expanding GGML Hardware Support using the Vulkan API

Speakers

Attachments

Links