Eldar Kurtić
Eldar Kurtić is a Principal Research Scientist at Red Hat and Institute of Science and Technology Austria (ISTA), specializing in efficient inference techniques for large language models (LLMs), with a particular focus on pruning, quantization, and speculative decoding. His work centers on developing methods to accelerate inference within the vLLM engine, bridging cutting-edge research with practical deployment solutions. Outside of his primary research, Eldar enjoys finding and fixing bugs in large-scale machine learning projects, contributing to the robustness of open-source AI ecosystems.
Events
| Title | Day | Room | Track | Start | End |
|---|---|---|---|---|---|
| Accelerating vLLM Inference with Quantization and Speculative Decoding |
Sunday | AW1.120 | Open Research | 11:00 | 11:30 |