Elk Performance Analyst
Lisboa
Lisboa, Lisboa, Portugal

Main tasks:

  • Operate, monitor, and support Cloudera/Hadoop-based infrastructure in production environments.
  • Provide L3 incident response, deep troubleshooting, and performance tuning for big data components.
  • Automate operational tasks using Python and Ansible (monitoring, alerting, deployment, configuration management).
  • Ensure data integrity, replication, and capacity management within HDFS clusters.
  • Develop, maintain, and automate monitoring and alerting for key services and node health.
  • Work closely with platform and data teams to onboard new workloads, ensuring proper resource allocation and performance.
  • Apply patches, upgrades, and configuration changes to maintain security and stability of the big data environment.
  • Manage Kerberos-based authentication, Ranger/Sentry policies, and TLS encryption.
  • Collaborate with storage, network, and security teams to optimize throughput, access controls, and backup strategies.
  • Maintain cluster documentation and provide knowledge sharing.

Technical Skills:

- Linux System Engineering & Operations

- Automation (Python, Ansible)

- JVM & Middleware Troubleshooting

- Supervision and diagnostics of Linux servers (CPU, RAM, disks)

- Storage management for distributed systems (SAN/NAS, RAID, HDFS capacity)

- Cloudera/Hadoop Ecosystem (can be acquired via training*)

Language Skills:

- Fluent in English

Responder ao anúncio
Seja o primeiro a candidar-se à vaga de emprego oferecida!
0.1230