Platform Monitoring Specialist
At Bloq. it, we’ve created the world’s leading smart locker solution. Solving online deliveries by enabling everyone to participate easily, reducing delivery costs and making them more sustainable.
We’re quickly expanding, and after growing at 1000% for three years in a row, we’re now the
- growing Smart Locker company in the world and one of the fastest growing
- ups in Europe.
We are in search of a Platform Monitoring Specialist to join our innovative team as our new #bloqstar. In this role, you'll play a crucial role in designing, scaling, and maintaining our monitoring stack, ensuring deep visibility into our hybrid cloud/edge systems and helping teams anticipate and resolve issues before they impact our customers.
What you’ll be doing:
- Own and evolve our monitoring, alerting, and observability infrastructure, ensuring coverage across all environments (Cloud, Lockers, CI/CD pipelines);
- Collaborate with engineering teams to define metrics, logs, and tracing strategies that reflect
- critical SLIs/SLOs; - Build and maintain dashboards and alerts using Datadog, driving insights for engineering, QA, and operations teams;
- Act as first responder and escalation point during platform incidents, coordinating diagnostics and driving
- mortems; - Develop automated health checks, alert tuning processes, and data integrity checks for critical services;
- Support continuous improvement of monitoring playbooks, runbooks, and documentation.
What you’ll bring to the table:
- Proven experience as a Platform, Dev
Ops, or Site Reliability Engineer with a specialized focus in observability or monitoring; - Solid expertise with Datadog (or equivalent platforms like Prometheus, Grafana, New Relic);
- Strong experience with AWS services, Linux system administration, and Infrastructure-as-Code (e. g. , Terraform, CDK);
- Proficiency with CI/CD pipelines and automation (Git
Hub Actions preferred); - Experience working with logging, tracing, and metric systems, and designing
- signal alerting rules; - Strong troubleshooting and
- solving skills in production environments; - Fluent in English, both written and spoken.
It would be great if you would also have:
- 4+ years in a platform, SRE, or observability role in a
- grade environment; - Familiarity with Atlas Mongo
DB, MQTT brokers, and distributed edge devices; - Experience defining SLIs/SLOs/SLAs and implementing reliability guardrails;
- Knowledge of incident response frameworks and root cause analysis methodologies.
Why join us?
- The opportunity to join ourSoftwareteam and play a pivotal role in building and improving our infrastructure, whilecontributing to innovative solutions that redefine Bloq. it's revolution in the smart locker industry ;
- A dynamic and
- paced work environment with a culture of innovation, collaboration, and continuous learning ; - Competitive salary and flexible benefits package, tailored to your experience and skills ;
- Eligibility for
- based bonus, tied to your results and designed to reward your impact ; - Work how you work best - we offer a
- friendly policy and flexible hours so you can stay productive and keep life balanced ; - Portuguese Health Insurance ;
- Unlimited days off (subject to manager approval).
Ready to join the revolution?
- Informações detalhadas sobre a oferta de emprego
Empresa: Phiture Localização: Lisboa
Lisboa, Lisboa, PortugalPublicado: 22. 8. 2025
Vaga de emprego atual
Seja o primeiro a candidar-se à vaga de emprego oferecida!