We can't find the internet
Attempting to reconnect
Something went wrong!
Hang in there while we get back on track
Juan Luis Cano Rodríguez - Who needs ChatGPT? Rock solid AI pipelines with Hugging Face and Kedro
Learn how to build production-ready ML pipelines by combining Kedro's software engineering best practices with Hugging Face's state-of-the-art AI models and tools.
-
Kedro is an open source framework that applies software engineering best practices to data science and ML pipelines, helping transition from experiments to production
-
The framework decouples I/O operations from computation by separating datasets (inputs/outputs) from nodes (computation steps), making pipelines more maintainable
-
Kedro projects follow a standardized template structure, with clean separation between configuration, data, notebooks and source code
-
The data catalog provides a declarative way to define datasets and their locations (local, S3, databases etc.), abstracting away data access details
-
Pipelines are defined as directed acyclic graphs (DAGs) of nodes, with clear dependencies between computation steps
-
Kedro integrates with major orchestration platforms like Airflow, Argo, Kubeflow while remaining orchestrator-agnostic
-
The framework supports experiment tracking and can connect with MLflow through plugins
-
Kedro is extensible through hooks and plugins, following similar patterns to tools like PyTest
-
While not a full MLOps solution, Kedro focuses on providing solid foundations for building maintainable ML pipelines
-
The project is now part of Linux Foundation AI & Data with multiple stakeholders including McKinsey, Societe General and others