What can PyArrow do for you - Array interchange, storage, compute and transport
- Track: Python
- Room: UD2.218A
- Day: Sunday
- Start: 11:00
- End: 11:30
- Video only: ud2218a
- Chat: Join the conversation!
PyArrow is a powerful tool for Python developers seeking high-performance data processing and interchange. This talk will provide a pragmatic overview of some of PyArrow's capabilities, demonstrating data interchange, storage, manipulation and transport using a single Python library.
We'll explore four key capabilities:
Array Interchange: Seamless data exchange between NumPy, pandas, and other libraries using zero-copy Storage: Efficient serialization and file format support (Parquet, ORC, Feather) with advanced compression Compute: High-performance in-memory computation and data transformation capabilities Transport: Leveraging Arrow Flight RPC for distributed data movement and processing
Speakers
Rok Mihevc | |
Alenka |