Lead Site Reliability Engineer
Viseu
Viseu, Viseu District, Portugal

3 days ago Be among the first 25 applicants
We are looking for a Lead Site Reliability Engineer to enhance a global execution platform, delivering robust solutions to trading desks and clients. You will collaborate with expert teams, advancing your expertise in system administration, monitoring, and low‑latency technologies. Join us to contribute to cutting‑edge financial technology innovations. Note that working on‑site at the client's Lisbon office for 2-3 days per week is required. Responsibilities- Design and enforce monitoring, alerting, and incident management
- Automate repetitive tasks and workflows to increase operational
- Work alongside software engineering teams to build and launch scalable, dependable
- Execute production deployments carefully to preserve platform
- Handle incident management with thorough analysis and reporting to maintain service
- Engage in on‑call duties to support essential systems and
- Communicate clearly with colleagues to swiftly resolve technical
- Maintain up‑to‑date documentation for operational workflows and system
- Drive continuous improvements in system reliability and efficiency through proactive initiatives
Requirements- Deep understanding of Unix/Linux operating systems and networking with over 5 years
- Proficiency in Unix/Linux shell scripting and programming languages including Python, Perl, C, C++, or Java- Experience with monitoring and observability solutions such as ITRS Geneos, Dynatrace, Prometheus, and Grafana- Strong troubleshooting skills for complex system
- Experience in environments with high availability and heavy
- Bachelor’s or Master’s degree in IT engineering or a related
- Ability to collaborate effectively within a team and adapt to evolving
- Self‑driven with excellent problem‑solving capabilities and thorough issue
- Excellent written and verbal communication abilities with English proficiency at B2+ level
Nice to
- Familiarity with log analysis tools like Splunk, ELK, Graylog, or Loki- Knowledge of network monitoring solutions such as Corvil- Experience with relational databases including Oracle, Postgre
SQL, My
SQL/Maria
DB, or KDB/q- Understanding of messaging platforms like IBM MQ, Tibco, Solace, LBM, or Kafka- Experience with Infrastructure as Code tools such as Ansible or Terraform
We
- International projects with top
- Work with global teams of highly skilled, diverse
- Employee financial
- Paid time off and sick
- Upskilling, reskilling and certification
- Unlimited access to the Linked
In Learning library and 22, 000+
- Global career
- Volunteer and community involvement
- EPAM Employee Groups- Award‑winning culture recognized by Glassdoor, Newsweek and Linked
In
Seniority
- Mid‑Senior level
Employment
- Full‑time
Job
- Engineering, Information Technology, and Business Development
Industries- Software Development, IT Services and IT Consulting, and Banking
Referrals increase your chances of interviewing at EPAM Systems by 2x
Get notified about new Site Reliability Engineer jobs in Lisbon, Lisbon, Portugal. #J-18808-Ljbffr

Responder ao anúncio
Seja o primeiro a candidar-se à vaga de emprego oferecida!
Continuar
para a próxima oferta
Tech Lead
📍Viseu, Viseu District
Integration Engineer
📍 Viseu, Viseu District 🏢 ALTEN Portugal
Electrical Engineer
📍 Viseu, Viseu District 🏢 TSG Portugal
Data Engineer
📍 Viseu, Viseu District 🏢 KCS iT
Cloud Engineer
📍 Viseu, Viseu District 🏢 MobiLab Solutions
Aws Data Engineer
📍 Viseu, Viseu District 🏢 Fujitsu
Power Platform Engineer
📍 Viseu, Viseu District 🏢 Decskill
Mais ofertas de emprego →
Veja todas as 7 285 ofertas de emprego em Viseu e arredores.

Viseu
Ofertas de emprego em localidades próximas.

0.1368