4.9 • 848 Ratings
🗓️ 29 January 2022
⏱️ 69 minutes
🧾️ Download transcript
Machine learning pipeline orchestration tools, such as SageMaker and Kubeflow, streamline the end-to-end process of data ingestion, model training, deployment, and monitoring, with Kubeflow providing an open-source, cross-cloud platform built atop Kubernetes. Organizations typically choose between cloud-native managed services and open-source solutions based on required flexibility, scalability, integration with existing cloud environments, and vendor lock-in considerations.
Dirk-Jan Verdoorn - Data Scientist at Dept Agency
Click on a timestamp to play from that location
0:00.0 | Welcome back to machine learning applied. In this episode, I'm talking to Dirk from Dept Agency |
0:10.5 | about Cube Flow, K-U-B-E-F-L-O-W. It is an extension on Kubernetes for machine learning pipeline |
0:20.8 | orchestration. |
0:21.6 | In the last few episodes, we talked about SageMaker for accomplishing the same thing. |
0:26.6 | And then in the DevOps episode, we talked about just general DevOps deployment of your web stack to the cloud. |
0:33.6 | And if you'll recall, there are solutions for hosting web stacks in the cloud like |
0:38.3 | AWS, ECS, and then there's the open source equivalent to that called Kubernetes. |
0:44.6 | And an AWS service called EKS, or Elastic Kubernetes service, allows you to use Kubernetes |
0:50.4 | on AWS. |
0:51.2 | So you can either use ECS or ECS. |
0:53.7 | ECS is going to be using AWS as a managed |
0:56.1 | service for hosting your Docker containers. And so it's going to be easier to use. An EKS, or |
1:01.8 | Elastic Kubernetes service, is going to allow you to host your Kubernetes cluster on AWS. The benefit |
1:08.0 | of Kubernetes is that it is open source and cross platform. You can host your Kubernetes cluster on AWS, GCP, or Azure. Setting it up in those various providers is going to be a bit different. So it's not just a turnkey, one size fits all, but it's going to be more cross cloud provider compatible than if you had set up your cluster on ECS. So AWS ECS, their managed container |
1:31.9 | hosting solution is easier, but there's vendor lock-in. And then E-KS, their managed Kubernetes |
1:39.2 | solution is harder. It's more complex for certain, as you recall, the discussion with Giroat, |
1:45.7 | but it is open source and it is general case, cross-cloud compatible, and you can use it on |
1:51.0 | local host or on-prem. And so the equivalent in the machine learning space is Amazon SageMaker |
1:57.6 | for their hosted machine learning offerings versus Cubeflow and extension on top of |
2:03.4 | Kubernetes for the open source cross cloud compatible version of the same. And before we get into |
2:09.9 | this interview, I just want to talk about what these things are, these pipeline orchestration |
2:15.0 | tools, because SageMaker is not just about hosting machine learning model. |
... |
Please login to see the full transcript.
Disclaimer: The podcast and artwork embedded on this page are from OCDevel, and are the property of its owner and not affiliated with or endorsed by Tapesearch.
Generated transcripts are the property of OCDevel and are distributed freely under the Fair Use doctrine. Transcripts generated by Tapesearch are not guaranteed to be accurate.
Copyright © Tapesearch 2025.