Metaflow
Open-source ML platform from Netflix for building and managing real-life data science projects.
Open-source ML platform from Netflix for building and managing real-life data science projects.

Workflow orchestration
Define multi-step ML pipelines using Python, with automatic dependency management and parallel execution
Data versioning
Built-in tracking of datasets and artefacts throughout your workflow to ensure reproducibility
Local-to-cloud scaling
Develop workflows locally and run them on cloud infrastructure (AWS, Google Cloud, Azure) without code changes
Integration with existing tools
Works alongside popular ML frameworks and data tools rather than replacing them
Automatic job scheduling
Deploy workflows to run on schedules or trigger them from external systems
Experiment tracking
Record parameters, metrics, and outputs from each workflow run for comparison and analysis
Building production ML pipelines that run daily or weekly on fresh data
Managing complex workflows involving data preprocessing, model training, validation, and deployment steps
Running A/B tests and experiments with different model configurations whilst tracking all parameters and results
Coordinating multi-team ML projects where reproducibility and audit trails matter
Migrating existing ad-hoc ML scripts into reliable, scheduled workflows