Senior Site Reliability Engineer (Observability)

Senior Site Reliability Engineer (Observability)
Lisboa
Lisboa, Lisboa, Portugal

Iterable is the leading AI-powered customer engagement platform that helps leading brands like Redfin, Seat
Geek, Priceline, Calm, and Box create dynamic, individualized experiences at scale. Our platform empowers organizations to activate customer data, design seamless
- channel interactions, and optimize engagement—all with
- grade security and compliance. Today, nearly 1, 200 brands across 50+ countries rely on Iterable to drive growth, deepen customer relationships, and deliver joyful customer experiences.

With a global presence—including offices in San Francisco, New York, Denver, London, and Lisbon, plus remote employees worldwide—we are committed to building a diverse and inclusive workplace. We welcome candidates from all backgrounds and encourage you to apply. Learn more about our story and mission on our Culture and About Us pages. Let’s shape the future of customer engagement together!

How you will make an impact:

As a Senior Engineer on the Observability Team, your impact is measured by the clarity and reliability with which our engineers can see into their systems. You don't just provide a suite of tools;

you serve as a strategic observability partner for the entire engineering organization.

Strategic Observability Partnership: You will collaborate deeply with product teams to ensure the frameworks we provide actually solve their problems. Your success is measured by how well teams can diagnose their own services, not just by the uptime of our clusters. You will act as a consultant to help teams define meaningful Observability that reflect the true customer experience.
Set the observability vision – Own the
- term roadmap for Datadog, Grafana, Prometheus, Elasticsearch, Quickwit, and emerging Open
Telemetry tooling. Define SLIs/SLOs that align platform health with customer experience.
Lead
- scale implementations - Design and automate scalable pipelines (metrics, traces, logs, events) so every engineer has
- second, queryable visibility into production.
Harden our platform - Drive upgrades, capacity modeling, and policy enforcement for our dedicated
- focused clusters;

introduce
-
- classpatterns for
- tenant isolation and cost optimization.
Ship platform enhancements – Contribute
- quality Go or Python services, operators, and Terraform modules that elevate reliability, performance, and developer velocity.
Partner with service owners to embed observability into their SDLC, guide best practices, perform instrumentation reviews, and elevate
- call readiness across the org.
Reduce MTTR, noise, and waste by designing
- efficient telemetry architectures,
- signal alerting, and automated recovery patterns.
Lead and model operational excellence through
- call participation,
- incident reviews, and continuous improvement initiatives.

What we’re looking for

We prioritize demonstrated proficiency and the ability to solve complex problems over years of experience.

Kubernetes & Cloud Mastery

Cluster Operations: Proven ability to architect and manage
- grade Kubernetes (EKS) clusters, specifically for stateful workloads.
Infrastructure as Code: Proficiency of Infrastructure-as-code (Ia
C), including Terraform.

Observability & Engineering Depth

Telemetry Expertise: Deep production experience with Elasticsearch, Prometheus, or Open
Telemetry. You know how to tune these systems for
- terabyte daily workloads.
Software Engineering: Proficiency in Go or Python to build custom operators, internal tools, and automation.
Data Pipeline Design: Ability to optimize ingestion and storage for logs, metrics, and traces while balancing query performance with
- efficiency.

Leadership & Collaboration

Consultative Approach: Ability to influence engineering culture by mentoring peers and partnering with service owners to improve their observability posture.
Growth Mindset: A humble, collaborative approach to
- solving and a bias toward systemic, automated solutions.

Bonus points

Hands-on success migrating to Open
Telemetry or similar
- neutral standards.
Experience tuning Datadog APM/Logs, Grafana/ Thanos/Mimir, or Click
House-based log stores for
- TB/day workloads.

Perks & Benefits:

Competitive salaries & meaningful equity
Private Medical Insurance
Life/Risk Assurance
Meal Allowance: 8. 55€ per day
Community Days (additional paid holidays)
Paid Annual Leave (22 days)
Paid Sabbatical (after 4 years tenure)
Initial laptop workstation setup
Teleworking Allowance

Iterable is an Equal Employment Opportunity employer that proudly pursues and hires a diverse workforce. Iterable does not make hiring or employment decisions on the basis of race, color, religion or religious belief, ethnic or national origin, nationality, sex, gender,
- identity, sexual orientation, disability, age, military or veteran status, or any other basis protected by applicable local, state, or federal laws or prohibited by Company policy. Iterable also strives for a healthy and safe workplace and strictly prohibits harassment of any kind. Pursuant to the San Francisco Fair Chance Ordinance and other similar state laws and local ordinances, and its internal policy, Iterable will also consider for employment qualified applicants with arrest and conviction records.

Informações detalhadas sobre a oferta de emprego

Empresa:	Iterable
Localização:	Lisboa Lisboa, Lisboa, Portugal
Publicado:	15. 1. 2026 Vaga de emprego atual

Responder ao anúncio
Seja o primeiro a candidar-se à vaga de emprego oferecida!

Senior Site Reliability Engineer (Observability)
Lisboa
Lisboa, Lisboa, Portugal

Teradata Data Engineer

Senior Data Engineer

Azure Data Engineer

Senior DevOps Engineer

Site Reliability Engineer (SRE) @Lisboa

DevOps Engineer

C++ Software Engineer

Lisboa
Ofertas de emprego em localidades próximas.

Ofertas de emprego

Senior Site Reliability Engineer (Observability)LisboaLisboa, Lisboa, Portugal

LisboaOfertas de emprego em localidades próximas.

Senior Site Reliability Engineer (Observability)
Lisboa
Lisboa, Lisboa, Portugal

Lisboa
Ofertas de emprego em localidades próximas.