

Deploying Models as RESTful APIs using Kubeflow Pipelines and KFServing: A Step-by-Step Tutorial

Deploying Models as RESTful APIs using Kubeflow Pipelines and KFServing: A Step-by-Step Tutorial

Deploying machine learning models as RESTful APIs allows for easy integration with other applications and services. Kubeflow Pipelines provides a platform for building and deploying machine learning pipelines, while KFServing is an open-source project that simplifies the deployment of machine learning models as serverless inference services on Kubernetes. In this tutorial, we will explore how to deploy models as RESTful APIs using Kubeflow Pipelines and KFServing.


Before we begin, make sure you have the following installed and set up:

  • Kubeflow Pipelines
  • KFServing
  • Kubernetes cluster
  • Python 3.x
  • Docker

Building the Model and Pipeline

First, we need to build the machine learning model and create a pipeline to train and deploy it. For this tutorial, we will use a simple example of training and deploying a sentiment analysis model using the IMDb movie reviews dataset. We will use TensorFlow and Keras for model training.

# Import libraries
import tensorflow as tf
from tensorflow import keras
from tensorflow.keras import layers

# Load the IMDb movie reviews dataset
imdb =
(train_data, train_labels), (test_data, test_labels) = imdb.load_data(num_words=10000)
# Preprocess the data
train_data = keras.preprocessing.sequence.pad_sequences(train_data, value=0, padding='post', maxlen=250)
test_data = keras.preprocessing.sequence.pad_sequences(test_data, value=0, padding='post', maxlen=250)
# Build the model
model = keras.Sequential([
    layers.Embedding(10000, 16),
    layers.Dense(16, activation='relu'),
    layers.Dense(1, activation='sigmoid')
# Compile the model
model.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])
# Train the model, train_labels, epochs=10, batch_size=32, validation_data=(test_data, test_labels))
# Save the model'model.h5')

Defining the Deployment Pipeline

Next, we need to define the deployment pipeline using Kubeflow Pipelines. This pipeline will use KFServing to deploy the trained model as a RESTful API.

import kfp
from kfp import dsl
from kubernetes.client import V1EnvVar

@dsl.pipeline(name='Sentiment Analysis Deployment', description='Deploy the sentiment analysis model as a RESTful API')
def sentiment_analysis_pipeline(model_dir: str, api_name: str, namespace: str):
    kfserving_op = kfp.components.load_component_from_file('kfserving_component.yaml')
    # Define the deployment task
    deployment_task = kfserving_op(
            V1EnvVar(name='MODEL_NAME', value=api_name),
            V1EnvVar(name='NAMESPACE', value=namespace)
if __name__ == '__main__':
    kfp.compiler.Compiler().compile(sentiment_analysis_pipeline, 'sentiment_analysis_pipeline.tar.gz')

The pipeline definition includes a deployment task that uses the KFServing component to apply the model deployment. It specifies the model directory, API name, and Kubernetes namespace for the deployment.

Deploying the Model as a RESTful API

To deploy the model as a RESTful API, follow these steps:

Build a Docker image for the model:

docker build -t sentiment-analysis-model:latest .

Push the Docker image to a container registry:

docker push <registry>/<namespace>/sentiment-analysis-model:latest

Create a YAML file for the KFServing configuration, e.g., kfserving.yaml:

kind: InferenceService
  name: sentiment-analysis
        storageUri: <registry>/<namespace>/sentiment-analysis-model:latest

Deploy the model as a RESTful API using KFServing:

kubectl apply -f kfserving.yaml

Access the RESTful API:

kubectl get inferenceservice sentiment-analysis

# Get the service URL
kubectl get inferenceservice sentiment-analysis -o jsonpath='{.status.url}'

With the model deployed as a RESTful API, you can now make predictions by sending HTTP requests to the service URL.

In this tutorial, we have explored how to deploy machine learning models as RESTful APIs using Kubeflow Pipelines and KFServing. We built a sentiment analysis model, defined a deployment pipeline using Kubeflow Pipelines, and used KFServing to deploy the model as a RESTful API on a Kubernetes cluster. This approach allows for easy integration of machine learning models into applications and services, enabling real-time predictions and inference.

By combining Kubeflow Pipelines and KFServing, you can streamline the process of training and deploying machine learning models as scalable and reliable RESTful APIs on Kubernetes. This enables efficient model management, deployment, and serving in production environments.

Achieving Scalability with Distributed Training in Kubeflow Pipelines

Achieving Scalability with Distributed Training in Kubeflow Pipelines

Distributed training is a technique for parallelizing machine learning tasks across multiple compute nodes or GPUs, enabling you to train models faster and handle larger datasets. Kubeflow Pipelines provide a robust platform for managing machine learning workflows, including distributed training. In this tutorial, we will guide you through implementing distributed training with TensorFlow and PyTorch in Kubeflow Pipelines using Python.


Step 1: Prepare Your Training Code

Before implementing distributed training in Kubeflow Pipelines, you need to prepare your TensorFlow or PyTorch training code for distributed execution. You can follow the official TensorFlow and PyTorch guides for implementing distributed training:

Make sure your training code is set up to handle the following distributed training aspects:

Step 2: Containerize Your Training Code

Once your training code is ready for distributed training, you need to containerize it using Docker. Create a Dockerfile that includes all the necessary dependencies and your training code. For example, if you are using TensorFlow, your Dockerfile may look like this:

FROM tensorflow/tensorflow:latest-gpu

COPY ./ /app/
ENTRYPOINT ["python", ""]

Build and push the Docker image to a container registry, such as Docker Hub or Google Container Registry:

docker build -t your_registry/your_image_name:latest .
docker push your_registry/your_image_name:latest

Step 3: Define a Component for Distributed Training

In your Python script, import the necessary libraries and define a component that uses your training container image:

import kfp
from kfp import dsl

def distributed_training_op(num_workers: int):
    return dsl.ContainerOp(
        name="Distributed Training",
            "--num_workers", num_workers,

Step 4: Implement a Pipeline for Distributed Training

Now, create a pipeline that uses the distributed_training_op component:

    name="Distributed Training Pipeline",
    description="A pipeline that demonstrates distributed training with TensorFlow and PyTorch."
def distributed_training_pipeline(num_workers: int = 4):
    distributed_training = distributed_training_op(num_workers)

if __name__ == "__main__":
    kfp.compiler.Compiler().compile(distributed_training_pipeline, "distributed_training_pipeline.yaml")

This pipeline takes the number of workers as a parameter and calls the distributed_training_op component with the specified number of workers.

Step 5: Upload and Run the Pipeline

In this tutorial, we covered how to implement distributed training with TensorFlow and PyTorch in Kubeflow Pipelines using Python. With distributed training, you can scale up your machine learning workflows and train models faster, handle larger datasets, and improve the overall efficiency of your ML experiments. As you continue to work with Kubeflow Pipelines, you can explore other advanced features to further enhance your machine learning workflows.

Building Your First Kubeflow Pipeline: A Simple Example

Building Your First Kubeflow Pipeline: A Simple Example

Kubeflow Pipelines is a powerful platform for building, deploying, and managing end-to-end machine learning workflows. It simplifies the process of creating and executing ML pipelines, making it easier for data scientists and engineers to collaborate on model development and deployment. In this tutorial, we will guide you through building and running a simple Kubeflow Pipeline using Python.


  1. Familiarity with Python programming

Step 1: Install Kubeflow Pipelines SDK

First, you need to install the Kubeflow Pipelines SDK on your local machine. Run the following command in your terminal or command prompt:

pip install kfp

Step 2: Create a Simple Pipeline in Python

Create a new Python script (e.g., and add the following code:

import kfp
from kfp import dsl

def load_data_op():
    return dsl.ContainerOp(
        name="Load Data",
        command=["sh", "-c"],
        arguments=["echo 'Loading data' && sleep 5"],
def preprocess_data_op():
    return dsl.ContainerOp(
        name="Preprocess Data",
        command=["sh", "-c"],
        arguments=["echo 'Preprocessing data' && sleep 5"],
def train_model_op():
    return dsl.ContainerOp(
        name="Train Model",
        command=["sh", "-c"],
        arguments=["echo 'Training model' && sleep 5"],
    name="My First Pipeline",
    description="A simple pipeline that demonstrates loading, preprocessing, and training steps."
def my_first_pipeline():
    load_data = load_data_op()
    preprocess_data = preprocess_data_op().after(load_data)
    train_model = train_model_op().after(preprocess_data)
if __name__ == "__main__":
    kfp.compiler.Compiler().compile(my_first_pipeline, "my_first_pipeline.yaml")

This Python script defines a simple pipeline with three steps: loading data, preprocessing data, and training a model. Each step is defined as a function that returns a ContainerOp object, which represents a containerized operation in the pipeline. The @dsl.pipeline decorator is used to define the pipeline, and the kfp.compiler.Compiler().compile() function is used to compile the pipeline into a YAML file.

Step 3: Upload and Run the Pipeline

  1. Click on the “Pipelines” tab in the left-hand sidebar.
  2. Click the “Upload pipeline” button in the upper right corner.
  3. In the “Upload pipeline” dialog, click “Browse” and select the my_first_pipeline.yaml file generated in the previous step.
  4. Click “Upload” to upload the pipeline to the Kubeflow platform.
  5. Once the pipeline is uploaded, click on its name to open the pipeline details page.
  6. Click the “Create run” button to start a new run of the pipeline.
  7. On the “Create run” page, you can give your run a name and choose a pipeline version. Click “Start” to begin the pipeline run.

Step 4: Monitor the Pipeline Run

After starting the pipeline run, you will be redirected to the “Run details” page. Here, you can monitor the progress of your pipeline, view the logs for each step, and inspect the output artifacts.

  1. To view the logs for a specific step, click on the step in the pipeline graph and then click the “Logs” tab in the right-hand pane.
  2. To view the output artifacts, click on the step in the pipeline graph and then click the “Artifacts” tab in the right-hand pane.

Congratulations! You have successfully built and executed your first Kubeflow Pipeline using Python. You can now experiment with more complex pipelines, integrate different components, and optimize your machine learning workflows.

With Kubeflow Pipelines, you can automate your machine learning workflows, making it easier to build, deploy, and manage complex ML models. Now that you have a basic understanding of how to create and run pipelines in Kubeflow, you can explore more advanced features and build more sophisticated pipelines for your own projects.

Kubeflow Pipelines: A Step-by-Step Guide

Kubeflow Pipelines: A Step-by-Step Guide

Kubeflow Pipelines is a platform for building, deploying, and managing end-to-end machine learning workflows. It streamlines the process of creating and executing ML pipelines, making it easier for data scientists and engineers to collaborate on model development and deployment. In this tutorial, we will guide you through the process of setting up Kubeflow Pipelines on your local machine using MiniKF and running a simple pipeline in Python.


Step 1: Install Vagrant

First, you need to install Vagrant on your machine. Follow the installation instructions for your operating system here:

Step 2: Set up MiniKF

Now, let’s set up MiniKF (Mini Kubeflow) on your local machine. MiniKF is a lightweight version of Kubeflow that runs on top of VirtualBox using Vagrant. It is perfect for testing and development purposes.

Create a new directory for your MiniKF setup and navigate to it in your terminal:

mkdir minikf
cd minikf

Initialize the MiniKF Vagrant box by running:

vagrant init arrikto/minikf

Start the MiniKF virtual machine:

vagrant up

This process will take some time, as Vagrant downloads the MiniKF box and sets up the virtual machine.

Step 3: Access the Kubeflow Dashboard

After the virtual machine is up and running, you can access the Kubeflow dashboard in your browser. Open the following URL: You will be prompted to log in with a username and password. Use admin as both the username and password.

Step 4: Create a Simple Pipeline in Python

Now, let’s create a simple pipeline in Python that reads some data, processes it, and outputs the result. First, install the Kubeflow Pipelines SDK:

pip install kfp

Create a new Python script (e.g., and add the following code:

import kfp
from kfp import dsl

def read_data_op():
    return dsl.ContainerOp(
        name="Read Data",
        command=["sh", "-c"],
        arguments=["echo 'Reading data' && sleep 5"],
def process_data_op():
    return dsl.ContainerOp(
        name="Process Data",
        command=["sh", "-c"],
        arguments=["echo 'Processing data' && sleep 5"],
def output_data_op():
    return dsl.ContainerOp(
        name="Output Data",
        command=["sh", "-c"],
        arguments=["echo 'Outputting data' && sleep 5"],
    name="Simple Pipeline",
    description="A simple pipeline that reads, processes, and outputs data."
def simple_pipeline():
    read_data = read_data_op()
    process_data = process_data_op().after(read_data)
    output_data = output_data_op().after(process_data)
if __name__ == "__main__":
    kfp.compiler.Compiler().compile(simple_pipeline, "simple_pipeline.yaml")

This Python script defines a simple pipeline with three steps: reading data, processing data, and outputting data. Each step is defined as a function that returns a ContainerOp object, which represents a containerized operation in the pipeline. The @dsl.pipeline decorator is used to define the pipeline, and the kfp.compiler.Compiler().compile() function is used to compile the pipeline into a YAML file.

Step 5: Upload and Run the Pipeline

Now that you have created a simple pipeline in Python, let’s upload and run it on the Kubeflow Pipelines platform.

Step 6: Monitor the Pipeline Run

After starting the pipeline run, you will be redirected to the “Run details” page. Here, you can monitor the progress of your pipeline, view the logs for each step, and inspect the output artifacts.

Congratulations! You have successfully set up Kubeflow Pipelines on your local machine, created a simple pipeline in Python, and executed it using the Kubeflow platform. You can now experiment with more complex pipelines, integrate different components, and optimize your machine learning workflows.

With Kubeflow Pipelines, you can automate your machine learning workflows, making it easier to build, deploy, and manage complex ML models. Now that you have a basic understanding of how to create and run pipelines in Kubeflow, you can explore more advanced features and build more sophisticated pipelines for your own projects.