Site Reliability Engineer
Join us as a Site Reliability Engineer at Lynxmind
Ensure the availability, latency, and reliability of critical systems by applying SRE practices such as SLOs, error budgets, and incident analysis. Automate the resolution of repetitive issues to reduce MTTR, participate in
- call rotations, and lead
- incident reviews.
Responsibilities
- Ensure availability, latency, and reliability of critical systems
- Apply SRE practices such as SLOs, error budgets, and incident analysis
- Automate resolution of repetitive issues and reduce MTTR
- Participate in
- call rotations and lead
- incident reviews
Must-have qualifications
- Proven experience in SRE, Dev
Ops, or Systems Engineering roles - Strong grasp of monitoring, alerting, and incident management
- Scripting ability in Python, Bash, or Go
- Deep understanding of Linux system internals
Nice-to-have skills
- Exposure to chaos engineering or failure injection
- Familiarity with incident response tools (Pager
Duty, Opsgenie) - SRE/Dev
Ops certifications (Google SRE, AZ-400, etc. )
For more details, contact us at recruitment@lynxmind.com.
#J-18808-Ljbffr- Informações detalhadas sobre a oferta de emprego
Empresa: Lynxmind Localização: Lisboa
Lisboa, Lisboa, PortugalPublicado: 30. 5. 2025
Vaga de emprego atual
Seja o primeiro a candidar-se à vaga de emprego oferecida!