Speaker: Henry Ehrenberg, Co-Founder at Snorkel AI
Snorkel AI recently adopted Arrow to help power Snorkel Flow, their data-centric development platform which helps enterprise data science teams build high quality training datasets and ML models quickly. In this talk, Henry will cover how the Snorkel AI team evolved their data and compute architecture to leverage Arrow. With Arrow under the hood, Snorkel Flow users saw 5x speedups on key data labeling operations, significantly improved UX working with complex data types like PDF documents, and better resource utilization through optimized horizontal scaling.