In this simple tutorial we will run one of the public examples of Kubeflow (the flip coin pipeline) using K3ai. We assume the following:
- The machine to be used is running Linux
- The machine does not have anything installed if not just the basic Operating System
K3ai uphas been executed. If not please take a look at the quick-start documentation page.
Step1 - Deploy the infrastructure
First we have to deploy a new cluster. For this tutorial we will use Rancher K3S. We selected K3s due its light footprint so it adapt better to, almost, any environment.
We will type:
k3ai cluster deploy -t k3s -n myk3scluster
Currently (v1.0.1) we do not support Rancher K3s deployment on WSL2 but you may use K3d instead.
Once the cluster will be deployed we may proceed to the step 2.
Step 2 - Deploy Kubeflow Pipelines (Argo) on the cluster
K3ai allow the user to deploy any plugin on any deployed target cluster in one single command.
k3ai plugin deploy -n kf-pa -t myk3scluster
The deployment will publish the URL where to connect to the UI of KFP.
Please not that not all clusters supported by K3ai are currently able to expose the URL correctly. If the URL is not reachable please consult the platform documentation to learn how to expose it (typically through a port-forward). In v1.0.1 we will make this automatic
Now let's move to the last step.
Step 3 - Run the training and track the result
We will run a copy of the flip coing pipeline example from here
k3ai run -s https://github.com/k3ai/quickstart/kfp -b kfp -e condition.py -t mycluster
wait for the deployment to complete and check the results on the Kubeflow UI.
Done, you completed the Kubeflow tutorial on how to run a simple training with K3ai and MLFLow. Now take your examples and have fun!