Beyond Deployment: Exploring Machine Learning Inference Architectures and Patterns
🔊 Recorded at PyCon DE & PyData Berlin 2024, 24.04.2024 https://2024.pycon.de/program/ZLDMGM/ 🎓 Watch as Tim Elfrink explores machine learning inference architectures and patterns, sharing insights on setting up robust systems for high-throughput predictions, offering practical strategies for MLOps on cloud platforms, and illustrating real-life examples from StepStone's production systems. Speakers: Tim Elfrink Description: Tim Elfrink, a Staff Machine Learning Engineer at Stepstone, delivered a comprehensive talk on machine learning inference architectures and patterns. The session focused on setting up robust and scalable ML systems for high-throughput real-time predictions and large user bases. Tim discussed various prediction methods including real-time, asynchronous, and batch processing, highlighting their pros and cons and the importance of selecting the right method for specific use cases. Using examples from StepStone's production systems, he illustrated how to build systems capable of handling thousands of simultaneous requests with low-latency, reliable predictions. The talk emphasized the technical aspects of managing operations efficiently, providing real-life examples in an easily understandable format. Participants learned about different ML setups to improve inference speed, cost-efficiency, and reliability. The presentation also addressed the key challenges of ML deployment and management, focusing on inference patterns for robust and scalable applications. Tim highlighted StepStone's infrastructure as a model for efficiently managing large workloads and complex models, such as large language models, to deliver fast, cost-effective, and dependable results. Overall, the talk offered valuable insights into ML inference patterns and effective MLOps strategies, showcasing practical implementation examples from StepStone's real-world applications. ⭐️ About PyCon DE & PyData Berlin: The PyCon DE & PyData conference unite the Python, AI, and data science communities, offering a unique platform for collaboration and innovation. The PyCon DE & PyData Berlin 2024 conference, hosted in partnership with the local Berlin PyData chapter, provided an exceptional experience, fostering deeper connections within the Python community while showcasing advancements in AI and data science. Attendees enjoyed a diverse and engaging program, solidifying the event as a highlight for Python and AI enthusiasts nationwide. Follow us: • LinkedIn: https://www.linkedin.com/company/28908640/ • X: https://www.x.com/pyconde • X: https://www.x.com/pydataberlin Links: • Conference website: http://pycon.de • Related sessions: http://2024.pycon.de/program/categories/pycon-mlops-devops The conference is organized by • Python Softwareverband e.V.: http://pysv.org • NumFOCUS Inc.: http://numfocus.org • Pioneers Hub gemeinnützige GmbH: http://pioneershub.org If you enjoyed this session, please like, comment, and subscribe to our channel for more insightful talks and discussions. Share this video with your network to spread the knowledge! Hashtags: #Python #PyConDE #PyData #OpenSource #AI #DataScience #MachineLearning #SoftwareDevelopment #LLMs #Community Acknowledgements: Special thanks to all the volunteers and sponsors who made this event possible. About: Python Softwareverband e.V.: PySV is a non-profit that promotes the use and development of Python in Germany through events, education, and advocacy, fostering an open Python community. NumFOCUS Inc. supports open-source scientific computing by providing financial and logistical support to key projects like NumPy and Jupyter, promoting sustainable development and collaboration. Pioneers Hub gemeinnützige GmbH: is a non-profit fostering innovation in AI and tech by connecting experts and promoting knowledge exchange through events and collaborative initiatives.
Download
0 formatsNo download links available.