Observability Engineer

Detalhes da Vaga

The DGU Tech Services is looking to recruit an Observability Engineer

EDP is a global energy group present in around 30 markets with a particular emphasis on renewable energies. With more than 45 years of experience, we have been consolidating a relevant presence on the world energy scene based on the commitment to be all-green by 2030, leading the energy transition. With more than 13,000 employees around the world, we are committed to using our energy and heart to drive a better tomorrow.

What you will do:

Collaborating closely with teams managing monitoring platforms, this role drives the development of innovative solutions while leading initiatives to enhance system observability.

Designing and implementing improvements to monitoring processes, ensuring that systems are effectively tracked for performance, and enabling proactive issue resolution. This ensures optimized performance, minimizes downtime, and supports continuous system health monitoring;

Collaborate with DevOps, development, and cross-functional teams to request, track, and coordinate the implementation of system configurations and observability practices. Ensure the effective adoption of observability solutions in alignment with operational goals;

Design and implement monitoring solutions to ensure system and application performance observability, using tools such as Splunk, Elastic, and other relevant platforms;

Develop and optimize automation scripts for monitoring processes and problem resolution, minimizing response times and enhancing operational reliability;

Collect, analyze, and interpret system performance metrics, identifying bottlenecks and ensuring SLA compliance while optimizing overall performance;

Configure customized dashboards and automated alerts to provide clear, real-time visibility into system status, enabling quick detection and resolution of incidents;

Continuously evaluate observability solutions, proposing and implementing improvements and optimizations to meet evolving needs and align with organizational objectives;

Consolidate data from multiple monitoring tools into a unified view, facilitating efficient IT operations management and informed decision-making;

Proactively monitor critical business services, implementing preventive measures to mitigate risks and avoid disruptions;

Collaborate in the implementation and refinement of system and monitoring tool configurations, ensuring that observability solutions align with operational needs;

Leverage experience with ITIL frameworks and platforms like ServiceNow to align monitoring processes with service management best practices, enhancing incident management, change control, and service delivery.

Employment type: Full-Time

Work site: Hybrid Model

What are we looking for:

Bachelor's or Master's degree in Engineering, or a related field;

Extensive experience with monitoring tools, particularly Splunk, Elastic, and ITSI, with the ability to propose effective solutions;

Proven expertise in problem analysis and incident resolution in complex environments;

Familiarity with cloud platforms (Azure, AWS, Google Cloud) and their native monitoring tools;

Experience with ITIL practices and platforms like ServiceNow;

Proficient in configuration management tools such as Ansible and Terraform for automating and standardizing system and monitoring configurations;

Hands-on experience with APM tools like Dynatrace for monitoring critical application performance;

Strong scripting skills (Python, PowerShell, Shell Script) for automating monitoring tasks and optimizing incident response;

Focus on continuous improvement through the analysis of performance and availability metrics, proactively optimizing operations and ensuring SLA compliance;

Competence in developing real-time dashboards and alerts, providing efficient system visibility;

Focus on continuous improvement, with a proactive approach to optimizing implemented solutions;

Ability to monitor new system configurations, ensuring that changes do not negatively impact performance or observability;

Knowledge of OpenTelemetry for distributed tracing, metrics, and logging in complex systems;

Strong coordination skills with cross-functional teams to integrate monitoring practices and oversee configuration follow-ups;

Experience with centralized observability platforms, consolidating data and applying best practices for comprehensive applications observability.

More than academic knowledge and technical skills, we are looking for ambitious people who are enthusiastic about the future and who bring human skills aligned with our purpose.

Equal opportunities for all

Our vision is that each person combines their unique characteristics and experiences to fulfill our mission of creating new energy for the planet. We are an inclusive employer, ensuring all candidates are treated fairly throughout the recruitment process. We welcome and value all people, and we are committed to fostering a sense of belonging for each person who is part of the EDP group.

Need more reasons to apply?

As a top employer we:

Empower our employees through a positive and innovative work environment that promotes collaboration and agile decision-making;

Respect and value each person, providing a flexible, healthy, and inclusive workplace with a range of attractive benefits;

Provide a meaningful work experience and prepare our people for future challenges through different opportunities for development and internal mobility;

Our efforts have resulted in several distinctions over time, highlighting the EDP group's strong positioning and its dedication and commitment to attracting and retaining the best talent:

Top employer certification by Top Employers Institute

Part of the Bloomberg Gender-Equality Index

Global certification as a family-responsible company by Fundación Másfamília

Top 100 Workplaces by Houston Chronicle

Discover our tips to enhance your performance during the recruitment process and apply until 25/11/2024 if you think you are the right fit for this opportunity.

#J-18808-Ljbffr


Salário Nominal: A acordar

Fonte: Allthetopbananas_Ppc

Função de trabalho:

Requisitos

Scrum Master Júnior

Job Description Júnior - Entre 1 a 3 anos 3x semana no escritório - Lisboa  Principais tarefas a desenvolver: Treinar, orientar e apoiar as equipas de SCRUM ...


Alter Solutions - Lisboa

Publicado 12 days ago

Senior Python Developer

Job Description We are looking for a Senior Back End Developer with skills and experience with Python . It is expected that this person becomes the main poin...


Alter Solutions - Lisboa

Publicado 5 days ago

Data Architect

Job Title:  Data Architect Location: Lisbon or Porto Work Regime: Hybrid (3x office) Overview / Summary: LUZA Group is looking for Functional Data Architect ...


Luza Group - Lisboa

Publicado 5 days ago

Application Support

Job Description 2nd line of application support Knowing the procedures and participating in the task automation process Managing and monitoring service level...


Alter Solutions - Lisboa

Publicado 5 days ago

Built at: 2024-11-23T19:35:10.145Z