Google Cloud Platform (GCP) Cloud Run Function Model Inference

A cloud function to invoke a prediction against a machine learning model that has been trained outside a cloud provider, using tools like MLFlow. This repository will not contain the model artifact output, but the code for the cloud function.

FastAPI will be used for the cloud function as it offers many features e.g. authentication, body validation etc. and overall easy to use and maintain. Note all GCP resources are created within this repository, some resources are created via Terraform in this repository terraform-gcp-model-serving.

Disclaimer

There are many options of serving a model, this project aims to demonstrate one. Each machine learning (ML) project is different other options approaches that could be considered and related to this project is the following:

Include the model artifact in the docker image, instead of downloading from Google Cloud Platform (GCP) bucket,
Utilizing GCP cloud run volumes and storing the model artifact within a volume and can load the model in memory.
Instead of having to upload the model artifact each time, the default artifact root to the GCP bucket¹

Architecture

Machine learning model is trained outside GCP and model artifact output created,
User makes a request to a HTTP endpoint for a prediction,
Model artifact is stored within a bucket, when function is invoked -- model is downloaded,
Prediction is output via a HTTP response.

Training using MLflow

As with all machine learning projects, your milage may vary (YMMV). This project will re-use existing data set for wine quality using mlflow-example for demonstration. The provided docker-compose.yml file will create the necessary resources needed to train locally. The following steps below will start the services:

Start mlflow server, postgres and minio:

docker compose up -d --build

Access MLflow UI with http://localhost:5001
Access MinIO UI with http://localhost:9000*
Next start a training job within the mlflow_server container using the cli:

docker exec mlflow_server mlflow run https://github.com/mlflow/mlflow-example.git -P alpha=0.42

Once training has started, you should be able to view the run under 'Experiments' tab in MLflow UI
You can download the model.pkl using the UI under 'Artifacts' tab, but should be available locally under /mlartifacts/
Copy the model artifact model.pkl to the root directory of the project ready to be used locally in FastAPI

* Login with credentials used in docker-compose.yml.

Note

Because the MLflow is a custom docker image, passing --build arg will cause the docker image to be re-built each time which is helpful when amending the .mlflow/requirements.txt. A re-build is not needed each time, if there is no changes being made to the file and ---build can be omitted from the command.

Running FastAPI

The following environment variables need to be set before attempting to run the application:

Environment variable name	Description	Default	Required
GCP_MLFLOW_MODEL_ARTIFACT_BUCKET_NAME	The GCP bucket name, where the model artifact has been uploaded	N/A	Yes
USE_LOCAL_FILE_PATH_MODEL	Use the model artifact found locally, rather than fetching from GCP bucket.	False	No

The following steps below will start the FastAPI service locally:

Install python packages used for the service:
```
pip install -r requirements.txt
```
Run the FastAPI server, which will start on port 8000:
```
python main.py
```
Endpoint documentation is available on: http://127.0.0.1:8000/docs

Prediction with FastAPI

The application exposes a single /predict/* endpoint, which allows the user to send a list of various quantitative features needed to predict the wine quality. An example payload for predicting wine quality for one wine can be found below:

[
  {
    "alcohol": 12.8,
    "chlorides": 0.029,
    "citric acid": 0.48,
    "density": 0.98,
    "fixed acidity": 6.2,
    "free sulfur dioxide": 29,
    "pH": 3.33,
    "residual sugar": 1.2,
    "sulphates": 0.39,
    "total sulfur dioxide": 75,
    "volatile acidity": 0.66
  }
]

Which will return an HTTP 200 Successful and a score e.g. [3.6182495833379846].

GitHub Action (CI/CD)

The GitHub Action "🚀 Push Docker image to GCP Artifact Registry" will check out the repository and push a docker image to the chosen GCP Artifact Registry using setup-gcloud action. The following repository secrets need to be set:

Secret	Description
GCP_GITHUB_SERVICE_ACCOUNT_KEY	The json private key for the GitHub service account
GCP_PREDICTION_SERVICE_ACCOUNT_KEY_BASE64	The json private key base64 encoded for the prediction service account

Important

The GCP_PREDICTION_SERVICE_ACCOUNT_KEY_BASE64 must be base64 encoded this can be done with the following command e.g. base64 -i <service_account>.json -o prediction_service_account.base64. During the workflow run this will be decoded and used as part of the docker image build.

Additionally, the following variables need to be set:

Secret	Description
GCP_PROJECT_ID	The GCP Project ID
GCP_REGION	The region that project is in
GCP_REGISTRY_REPOSITORY_NAME	The artifact registry repository name

Secondly, GitHub Action "🛸 GCP Cloud Run Deploy" will check out the repository and deploy the cloud run function utilizing the same GitHub action mentioned above. The following repository variable needs to be set:

Secret	Description
GCP_MLFLOW_MODEL_ARTIFACT_BUCKET_NAME	The GCP bucket name, where the model artifact has been uploaded

Lastly, GitHub Action "🛰️ GCP Cloud Run Delete" will check out the repository and delete the cloud run function utilizing the same GitHub action mentioned above and repository variables.

References

How to serve deep learning models using TensorFlow 2.0 with Cloud Functions by Rustem Feyzkhanov

Within the minio can run the following command mc alias set gcs https://storage.googleapis.com <YOUR-ACCESS-KEY> <YOUR-SECRET-KEY> and check that you can list the contents of the bucket e.g. mc ls gcs/<YOUR-BUCKET-NAME> ↩

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.github/workflows		.github/workflows
docs/drawio		docs/drawio
mlflow		mlflow
tests		tests
.cz.toml		.cz.toml
.dockerignore		.dockerignore
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
VERSION		VERSION
columns.py		columns.py
docker-compose.yml		docker-compose.yml
main.py		main.py
predict.py		predict.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Google Cloud Platform (GCP) Cloud Run Function Model Inference

Disclaimer

Architecture

Training using MLflow

Running FastAPI

Prediction with FastAPI

GitHub Action (CI/CD)

References

About

Releases

Packages

Contributors 2

Languages

License

kwame-mintah/gcp-cloud-run-function-model-inference

Folders and files

Latest commit

History

Repository files navigation

Google Cloud Platform (GCP) Cloud Run Function Model Inference

Disclaimer

Architecture

Training using MLflow

Running FastAPI

Prediction with FastAPI

GitHub Action (CI/CD)

References

Footnotes

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages