Observability Engineer
Decskill, founded in 2014 as an IT Consulting Company, places paramount importance on its greatest asset: its people. Our main mission is to deliver value through knowledge and talent, and we achieve this by fostering a culture of excellence and investing in the development and
- being of our people. With over 600 dedicated professionals and offices in Lisbon, Porto, Madrid, and Luxembourg, Decskill operates across three core areas:
Decskill Talent: We believe that our people are key to our success. Through Decskill Talent, we empower our team to embrace the digital transformation challenges of our clients. We collaborate with clients to drive innovation, ensuring project success and business growth.
Decskill Boost: Equipping our team with the latest tools and methodologies, we optimize Time-to-Market and deliver innovative solutions exceeding client expectations.
Decskill Connect: Our team collaborates closely with clients to implement and manage IT infrastructures that generate
- term value.
At Decskill, we believe that by nurturing and empowering our people to confront the challenges of digital transformation, we create value not only for our clients but also for our entire ecosystem, fostering a digital community dedicated to growth and progress.
We are looking for an SRE / Onserbability Engineer for a remote position.
Responsibilities:
• Design, implement, and maintain observability solutions covering metrics, logs, traces, and RUM.
• Work with tools such as Grafana Cloud, Tempo, Loki, Mimir, Alloy, and Open
Telemetry.
• Build reliable alerting and monitoring pipelines based on SLOs/SLAs, focusing on
- maintenance automation.
• Ensure the health and integrity of observability data flows from instrumentation to dashboards.
• Collaborate with development and operations teams to embed observability by design into the software lifecycle.
• Define and promote best practices and standards for observability across the organization.
• Support the modernization of observability by replacing and evolving legacy monitoring and alerting solutions.
• Monitor
- related costs and contribute to Fin
Ops efforts by identifying optimization opportunities.
Must-have:
• 3+ years of experience as an SRE, Observability Engineer, or equivalent role.
• Practical experience with Open
Telemetry, or similar instrumentation tools.
• Knowledge in Kubernetes, Helm, Terraform, and Argo
CD.
• Experience designing and managing telemetry pipelines (metrics/logs/traces), exporters, and sidecars.
• Expertise in performance monitoring, alerting, dashboarding, and root cause analysis.
• Knowledge in Java development and applications instrumentation
• Product-oriented mindset with a bias for automation and a “you build it, you run it” culture
• Fluency in English.
Nice-to-have:
• Knowledge of APM and distributed tracing solutions.
• Experience with Fin
Ops practices applied to observability.
• Hands-on involvement in replacing legacy monitoring stacks.
• Experience with Cloud environments (Azure preferred)
• Contributions to
- source observability tools.
If you’re interested in this job please send your CV in English to
Decskill is committed to equality and
- discrimination with all our talents. We recruit and promote talent, based on diversity and inclusion, regardless of age, gender, ethnicity, race, nationality or any other form of discrimination incompatible with the dignity of the human being. The ideal candidate will be responsible for creating, installing and managing our databases. You will ensure optimal database performance by analyzing database issues and monitoring database performance.
- Informações detalhadas sobre a oferta de emprego
Empresa: Decskill Localização: Leiria
Leiria, Leiria District, PortugalPublicado: 14. 9. 2025
Vaga de emprego atual
Seja o primeiro a candidar-se à vaga de emprego oferecida!