Senior Site Reliability Engineer

Senior Site Reliability Engineer
Empresa:

Sword Health, Inc



Função de trabalho:

Tecnologia da informação

Detalhes da Vaga

```html
Sword Health is on a mission to free two billion people from pain as the world's first and only end-to-end platform to predict, prevent and treat pain. Delivering a 62% reduction in pain and a 60% reduction in surgery intent, at Sword, we are using technology to save millions for our 2,500+ enterprise clients across three continents. Today, we hold the majority of industry patents, win 70% of competitive evaluations, and have raised more than $300 million from top venture firms like Founders Fund, General Catalyst, and Khosla Ventures.
Recognized as a Forbes Best Startup Employer in 2023, this award highlights our focus on being a destination for the best and brightest talent. Not only have we experienced unprecedented growth since our market debut in 2020, but we've also created a remarkable mission and value-driven environment that is loved by our growing team. With a recent valuation of $2 billion, we are in a phase of hyper growth and expansion, and we're looking for individuals with passion, commitment, and energy to help us scale our impact.
Joining Sword Health means committing to a set of core values, chief amongst them to "do it for the patients" every day, and to always "deliver more than expected" on behalf of our members and clients. This is an opportunity for you to make a significant difference on a massive scale as you work alongside 800+ (and growing!) talented colleagues, spanning two continents. Your charge? To help us build a pain-free world, powered by technology, enhanced by people — accessible to all.
As a Site Reliability Engineer (SRE) at Sword Health, you will play a critical role in maintaining the health and uptime of our services. You will collaborate with development teams to build and operate scalable and resilient systems, troubleshoot issues across the stack, and implement automation to reduce manual work.
What you'll be doing:

Monitoring and Incident Management: Develop and maintain monitoring and alerting solutions. Respond to incidents, troubleshoot issues, and perform root cause analysis.
Automation and Tooling: Automate repetitive tasks and improve deployment processes. Develop and maintain tools to support infrastructure and applications.
Performance Optimization: Analyze system performance and implement optimizations to improve efficiency and reduce latency.
Security and Compliance: Ensure systems are secure and compliant with relevant standards and regulations.
Documentation and Knowledge Sharing: Maintain comprehensive documentation of systems and processes. Share knowledge and best practices with team members.
Database Management: Ensure the reliability, performance, and scalability of databases. Perform database optimization, maintenance, and troubleshooting.

What you need to have:

Proficiency in programming languages such as Python, Go, Javascript.
5+ years of experience with cloud platforms such as AWS, Google Cloud, or Azure.
Strong understanding of Linux/Unix systems and networking.
Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).
Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
Knowledge of CI/CD pipelines and tools (e.g., Jenkins, GitLab CI).
Database Experience: Proficiency with relational and NoSQL databases (e.g., MySQL, PostgreSQL, Redis, Elasticsearch).
Team Player: Willingness to collaborate and share knowledge with colleagues to drive collective success.
Ownership: Taking responsibility for your work and demonstrating accountability for outcomes.

What we would love to see:

Innovative Mindset: A passion for exploring new technologies and methodologies to improve reliability and performance.
Proactive Approach: Ability to anticipate potential issues and implement preventive measures.
Continuous Improvement: A dedication to learning and growing in your role, staying updated with industry trends and best practices.

To ensure you feel good solving a big Human problem, we offer:

A stimulating, fast-paced environment with lots of room for creativity;
A bright future at a promising high-tech startup company;
Career development and growth, with a competitive salary;
The opportunity to work with a talented team and to add real value to an innovative solution with the potential to change the future of healthcare;
A flexible environment where you can control your hours (remotely) with unlimited vacation;
Access to our health and well-being program (digital therapist sessions);
Remote or Hybrid work policy.

```
#J-18808-Ljbffr


Fonte: Whatjobs_Ppc

Função de trabalho:

Requisitos

Senior Site Reliability Engineer
Empresa:

Sword Health, Inc



Função de trabalho:

Tecnologia da informação

Atenção Super Heróis! (M/F) Yupi! - Porto

O que procura nos candidatos Na Yupi, não precisamos de capas para salvar o dia - só de pessoas incríveis como TU! Estamos à procura de talentos destemidos...


Desde - Porto

Publicado 19 days ago

Senior Application Support

Seeking a Senior Application Support: Rhino, are you there?At WE ARE META, we focus on finding the perfect match between our Rhinos and our clients.Expand yo...


Desde We Are Meta - Porto

Publicado 19 days ago

Systems Administrator (M/F) Portugal/Porto

We are seeking a skilled Systems Administrator to join our team at Mindera. As a Systems Administrator, you will be responsible for managing and maintaining ...


Desde Mindera Group - Porto

Publicado 19 days ago

Quality Analyst - German Or Greek Speaker

Our client is a multinational company.A consolidated and dynamic Customer Service hub known worldwide.Sobre o nosso clienteOur client is a multinational Cust...


Desde Michael Page - Porto

Publicado 19 days ago

Built at: 2024-10-06T12:01:09.975Z