Site Reliability Engineer
Confirmed live in the last 24 hours
Ensono
Job Description
At Ensono, our purpose is to be a relentless ally, disrupting the status quo and enabling our clients to Do Great Things. As a trusted technology adviser and managed services provider, we help clients navigate continuous change and embrace innovation.
We deliver world-class transformational services across hybrid cloud, infrastructure, mainframe, data, IdAM, and cloud-native solutions, simplifying complex business challenges and creating new pathways to success. Headquartered in the USA and backed by private equity, Ensono has a strong track record in the UK and Europe, with growth plans built on trusted partnerships and deep industry expertise.
About the role:
We’re looking for Site Reliability Engineering (SRE) to join our growing team. We support a variety of data solutions that we’ve developed, and we are seeking individuals with data expertise to enhance our SRE team.
This role offers excellent career progression, as our division is in a state of growth—expanding both in-team size and client base. There are numerous opportunities to advance your career with us.
Key responsibilities:
• Act as a technical escalation point for unresolved data platform issues in the SRE Pod/s;
• Monitor, maintain, and troubleshoot databases/data warehouses and related infrastructure;
• Collaborate with the data engineering team to ensure efficient data flow and transformation;
• Develop and maintain accurate technical documentation in the form of operational runbooks;
• Perform standard pre-approved changes within the scope of our client’s Change Management Process (i.e. new users, etc.);
• Use Ensono’s helpdesk and work tracking systems to maintain logs of all support requests and incidents, and improve these processes, both technically and through stakeholder management.
• Participate in the process for, and proactively mitigate risks in a Security management process (Vulnerabilities in Code, Infrastructure, Dependencies) aligned to both Ensono’s and our Clients compliance objectives;
• Engaging with suppliers and 3rd parties for support, requests and opportunities, managing the relationship our clients get the best value for their service
What you will being to Ensono:
• IAC tooling (Terraform preferably, or ARM/bicep and CloudFront) Core CI/CD Tooling (Azure DevOps, GitHub Actions or Gitlab)
• Monitoring Tooling (DataDog, Splunk, NewRelic, Azure Monitor, AWS CloudWatch)
• Demonstrable experience in multiple core technology (Dotnet, Java, AI/Data Engineering, Golang)
• Troubleshooting issues and identifying systemic failings indicated by incidents/failures Implementing fixes and features
• Proposing solutions for reducing toil
• Implementing and refining automation for incident and service request resolution
• Providing leadership in the Incident resolution process, including creating and maintaining documentation, and leading Post-mortem analysis and mitigation planning.
• Designing and Reinforcing Service Requests and Change Management (both technically and through stakeholder management) processes, and improving existing processes.
• Develop and enhance the process for, and Proactively mitigate risks through Security management (Vulnerabilities in Code, Infrastructure, Dependencies)
• Lead discussion for multiple clients in client-facing meetings around the SRE process, identifying areas for increasing SRE footprint and identifying opportunities for small works and consultancy.
• Engaging with: Suppliers and 3rd parties for support, requests and opportunities
• Cross-sale and cross-pollination opportunities within the Ensono organisation
• Cloud provider (AWS, Azure, GCP) ‘DevOps Engineer’-level certification and CKAD certification highly beneficial, or required during probationary period.
What we can offer you:
We are a people-first business, which means people are at the heart of everything we do here. We offer our associates a safe environment where learning, knowledge sharing, and open communication is encouraged. Whether at one of the internal events, such as socials, competency meet-ups, hackathons or as part of one or more of our global
Similar Jobs
New Era Technology
Site Reliability Engineer (SRE)
Axle Informatics
Site Reliability Engineer
Veeam Software
Site Reliability Engineer II
Anduril Industries
Senior Site Reliability Engineer, Production Engineering
Anduril Industries
Senior Site Reliability Engineer, Production Engineering
Yugabyte