Company Overview
Arcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We constantly innovate our platform and capabilities to meet tomorrow's challenges, anticipate the risks our clients encounter, and design advanced solutions to help our clients achieve transformational business outcomes.
Financial technology is a high-growth industry as change and innovation continue to disrupt the status-quo and prompt major transformation. Arcesium is at a particularly interesting time in our own growth as we look to leverage our successfully established market position and expand operations in pursuit of strategic new business opportunities. We value intellectual curiosity, proactive ownership, and collaboration with colleagues, and we empower you to meaningfully contribute from day one and accelerate your professional development.
Position Summary
We are looking for an SRE to join our Corporate Technology team. The ideal candidate will be involved in planning, designing, and implementing various applications and infrastructure used by our staff. Strong focus will be on developing and managing applications built with cloud native and serverless technologies leveraging Azure & AWS Services. The ideal candidate is an excellent Site Reliability Engineer with experience in cloud-based tech and a firm understanding of how to solve business needs using emerging technologies with emphasis on building applications that are cost friendly and support zero-touch operations. You'll also need to analyze various reports and statistical data to measure productivity levels and identify root causes for underperforming areas, develop customized reporting to measure and track operational statistics, data and results, oversee weekend activities across various office spaces such as user migrations to newer platforms, software & hardware upgrades and audits etc. The role requires strong collaboration and communication skills, as it involves working with various departments and potentially traveling to the London office.
Responsibilities
Build integrations with third party SaaS applications that will include custom user provisioning, SSO, automation for migrating data and custom integrations with other applications.
Build, enhance and support serverless applications that use AWS Services like Lambda, SQS, SNS, DynamoDB, API Gateway, CloudWatch.
Use MS Azure for managing operations in Windows Compute and Solutioning domain.
Write good code, catch bugs, and style issues in code reviews, ship small features independently.
Participate in all aspects of the software development life cycle for AWS/Azure solutions, including planning, requirements, development, testing, and quality assurance.
Ensure the applications have optimal observability, monitoring and alerts that help identify the problems before they affect business productivity.
Handle operation issues for both Portugal and London office and act as Escalation engineer for both the sites. This role will require travel.
Familiarity with networking fundamentals and Desktop Infrastructure. Proficiency in OS management and network administration, including TCP/IP, DNS, DHCP, VLANs, routing, and switching.
Employ exceptional problem-solving skills to troubleshoot incidents, identify root cause, fix and document problems, and implement preventive measures.
You may also be involved in supporting our existing Corporate Tech applications and infrastructure like – Azure, AD, M365, Slack, Outlook/Exchange, AWS Workspaces/desktop infrastructure and other enterprise SaaS products.
Experience with security best practices and compliance standards (e.g., GDPR, ISO 27001) is essential for managing cloud infrastructure and data.
Qualifications & Must Haves
2+ years of solid SRE skills, with a proven track record in developing quality software solutions and passion for technology.
We are a global team, so excellent communication skills (both verbal and written in English and Portuguese) are critical as well as flexibility to work with team members in other time zones.
Hands on experience in troubleshooting Operational issues and documentation for RCA.
Proficient with writing and reviewing Python and other object-oriented language(s) are a plus.
Excellent analytical and problem-solving skills - In addition to technical expertise, the ideal candidate should possess strong problem-solving abilities, adaptability, and a proactive approach to learning and development.
Experience in at least one programming language with Python, Java, PowerShell preferred in that order.
Experience with Infrastructure as code tools like AWS CloudFormation/AWS Cloud Development Kit/Terraform.
Experience in having written and deployed code that runs in a production service in the Cloud and have demonstrated knowledge in DevOps and DevSecOps.
Hands on experience in several of the following areas – Object oriented programming skills, designing RESTful web services, NoSQL database design, AWS Cloud services, containerized microservices using Docker and Kubernetes, CI/CD pipeline.
Good To Have
Experience in cloud native and/or serverless architecture, Slack apps, and Azure/AWS certifications are a plus.
Experience with CI/CD pipelines, container orchestration tools (e.g., Kubernetes), and monitoring and logging tools (e.g., Prometheus, Grafana) is highly desirable.
Arcesium and its affiliates do not discriminate in employment matters on the basis of race, color, religion, gender, gender identity, pregnancy, national origin, age, military service eligibility, veteran status, sexual orientation, marital status, disability, or any other category protected by law. Note that for us, this is more than just a legal boilerplate. We are genuinely committed to these principles, which form an important part of our corporate culture, and are eager to hear from extraordinarily well qualified individuals having a wide range of backgrounds and personal characteristics.
#J-18808-Ljbffr