Главная
Study mode:
on
1
- Introductions
2
- How I Met Wes McKinney
3
- Timeline of Open Source Data Science at TS
4
- Featurization Challenges
5
- About Wes McKinney
6
- Apache Arrow
7
- Ibis
8
- Substrait
9
- One Data Science Interface; Many Data Engines
10
- Look Ahead
Description:
Explore a collaborative effort between Two Sigma and Voltron Data to enhance featurization workflow performance using Ibis, Substrait, and Apache Arrow in this 31-minute conference talk. Learn about the evolution of open-source data science at Two Sigma, featurization challenges, and the key components of this powerful software stack. Dive into Apache Arrow's high-performance data representation, Ibis' high-level APIs for data processing and analysis, and Substrait's machine learning framework. Discover how this integrated solution enables real-time streaming data processing, providing fast and accurate insights for decision-making. Gain valuable knowledge about the future of data science interfaces and their potential to work with multiple data engines.

Streaming Featurization with Ibis, Substrait and Apache Arrow

Open Data Science
Add to list
0:00 / 0:00