Lead Image © jame505th, 123RF.com
Implementation of an MLOps pipeline from ETL to deployment
In the Pipe
We deliver a production-grade machine learning operations (MLOps) [1] pipeline that accommodates the entire machine learning (ML) application lifecycle from ingestion of data to deployment in near-real time in a Kubernetes context. The pipeline has three primary phases of operation: (1) A mature extract, transform, and load (ETL) process reads in semi-structured and structured data, cleanses and transforms it, and saves the output in a scalable MongoDB instance; (2) dynamic model training reads in fresh data from MongoDB and employs reproducible, modulated scripts to achieve on-demand, decoupled computation; and (3) all parts are dockerized and shipped to Minikube hosting in a local cluster of Kubernetes.
Deployment and service configurations reveal near-real time inference by NodePort services. The system follows current DevOps techniques of modulated deployment, compositionality, and scaling and forms the basis of realizing in the near future additional extensions (e.g., continuous integration and continuous delivery (CI/CD) and observability) and deployment to the cloud, resulting in a working, cloud-agnostic method of deployment and scaling of ML systems.
ML Deployment Difficulties
Machine learning deployment in production is usually hindered by disjointed workflows, manual intervention, and a pipeline that is not automated end-to-end. Conventional ML pipelines – covering data ingestion through deployment – are normally disjointed, script-based, and not very integrated with contemporary DevOps methods. This disjointedness creates major hurdles in scaling, reproducibility, and maintainability and inhibits the operationalization of ML models in applications.
A number of systemic problems lie behind this inefficiency. Pipelines are often constructed as discrete pieces that interact with each
...Buy this article as PDF
(incl. VAT)
Buy ADMIN Magazine
Subscribe to our ADMIN Newsletters
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Most Popular
Support Our Work
ADMIN content is made possible with support from readers like you. Please consider contributing when you've found an article to be beneficial.

