We can't find the internet
Attempting to reconnect
Something went wrong!
Hang in there while we get back on track
Vino Duraisamy - From raw data to interactive data app in an hour | PyData Global 2023
Learn how to build end-to-end ML workflows in Snowflake using Snowpark's Python capabilities, from data processing to model deployment and interactive visualization.
- 
    
Snowpark enables end-to-end ML workflows within Snowflake by allowing Python, Java and Scala code execution alongside SQL
 - 
    
Key components include DataFrame API for data engineering and ML Modeling API for machine learning tasks, removing need for external compute environments
 - 
    
Model Registry provides versioning and deployment capabilities for ML models, with metadata tracking and easy deployment options
 - 
    
Zero-copy cloning allows multiple teams to work on the same data without creating redundant copies, maintaining data governance
 - 
    
Pre-processing and feature engineering can be done using familiar APIs similar to scikit-learn (ordinal encoding, one-hot encoding, scaling etc.)
 - 
    
Supports popular ML libraries like scikit-learn, XGBoost while optimizing execution through Snowflake’s compute engine
 - 
    
Eliminates data silos by keeping entire workflow within Snowflake’s security boundary instead of moving data between environments
 - 
    
Model deployment is simplified to a two-step process: logging the model and deploying it for inference
 - 
    
Provides standardized environments for both development and production through container services and GPU-enabled compute options
 - 
    
Interactive visualization apps can be built using Streamlit integration for model monitoring and results display