Mlops Engineer

Senior MLOps Engineer

Fully Remote (Spain or Portugal)

Start Date ASAP

6-12 month contract

GCP Focus

Strong focus on Personalisation/Recommendation

Job Description:

You will take existing pipelines and evolve them to be
-
- class, responsible for operationalising new models (like NBA, ranking, and LLM-based solutions) with agility and efficiency. Your primary goal is to create a seamless, reliable, and highly observable environment on GCP that empowers our Data Scientists and ML Engineers to iterate and deploy models faster. You will be expected to have created or significantly evolved MLOps frameworks in the past and be able to quantify the improvements you deliver (e. G. , in deployment frequency, model performance monitoring, or system reliability).

What You'll Do:

Take ownership of and evolve our
-
- end ML lifecycle, from data ingestion and feature engineering pipelines to model training, deployment, and
- time serving.
Design, build, and manage robust, automated CI/CD/CT (Continuous Integration / Continuous Delivery / Continuous Training) pipelines specifically for ML models, integrating with existing CI/CD patterns.
Leverage the GCP ecosystem, especially Vertex AI Pipelines, Vertex AI Endpoints, and Vertex AI Model Registry, to create a standardised and efficient path to production.
Design and own a
-
- class observability framework for ML models in production. This includes implementing granular monitoring for model performance (accuracy, bias), data and concept drift, and operational health (latency, throughput, error rates).
Collaborate closely with Data Scientists and ML Engineers to understand their needs, building the tools and abstractions that create a seamless environment and accelerate their workflow.
Optimise ML serving infrastructure for
- latency,
- time personalisation requirements.
Partner with data engineering to ensure robust integration with feature stores and data sources (like Big
Query and Oracle).
Define and track key MLOps metrics to quantify and communicate improvements in system performance, model quality, and team velocity.

What We're Looking For

7+ years of deep,
- on experience in a dedicated MLOps or Dev
Ops role with a strong focus on machine learning systems.
Proven experience building or evolving MLOps frameworks from the ground up, with clear examples of the improvements you delivered.
Expert-level knowledge of the GCP cloud stack, particularly Vertex AI (Pipelines, Endpoints, Training), Big
Query, Pub/Sub, and GKE.
Deep expertise in building and managing observability stacks for
- time ML systems (e. G. , using tools like Prometheus, Grafana, ELK stack, or specialised platforms).
Proven experience operationalising LLM-based systems, including managing embedding generation pipelines, vector databases, and
- tuning/deployment workflows.
Strong practical experience with Infrastructure as Code (Ia
C) tools (e. G. , Terraform, Ansible).
Demonstrable expertise in building and managing complex CI/CD pipelines.
Proficiency in Python and experience with scripting for automation, infrastructure management, and building tooling for ML teams.

Strong understanding of containerisation (Docker, Kubernetes) and microservices architecture as it applies to ML model serving

Empresa:	Oakwell Hampton Group
Localização:	Viseu Viseu, Viseu District, Portugal
Publicado:	15. 11. 2025 Vaga de emprego atual

Responder ao anúncio
Seja o primeiro a candidar-se à vaga de emprego oferecida!

Ofertas de emprego interessantes nas proximidades: