Staff Site Reliability Engineer
Confirmed live in the last 24 hours
Redwood Materials
Job Description
About Redwood Materials
Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling — keeping critical minerals in circulation and driving the energy transition. Founded in 2017, we’re delivering low-cost and large-scale energy storage and producing battery materials in the U.S. for the first time, all from batteries we already have.
Staff Site Reliability Engineer
Essential Duties:
We are seeking a highly skilled and motivated Staff Site Reliability Engineer to collect requirements, design & implement highly available systems & solutions, coordinate work across multiple teams, drive improvements to existing systems, introduce automation, integrations, and ensure appropriate monitoring & alerting is in place for rapid response. This role will collaborate, assist with, lead projects & drive initiatives to ensure Redwood Materials has resilient systems in place to scale at a rapid pace to a global enterprise.
Responsibilities will include:
- Collect business & technical requirements and work with cross-functional teams to establish SLOs
- Design effective on-premise & hybrid systems & solutions with high availability & scalability, utilizing platform technologies including vSphere, Kubernetes, Linux, Windows.
- Coordinate work across IT, Software, Industrial Controls, Engineering & Business teams to implement complete systems & ensure business needs are met.
- Identify opportunities to automate deployment & management of IT infrastructure & systems to reduce manual efforts and speed recovery.
- Develop integrations that streamline use & visibility of data across components to deliver complete, efficient systems providing excellent utility & ease of use.
- Support deployed systems responding to incidents, leading fast triage, troubleshoot issues, and participate in an on-call rotation.
- Lead post-incident reviews and drive improvements to eliminate repeat failure modes
Desired Qualifications:
- Bachelor’s degree in information technology or any related field.
- 2+ years in an SRE related role, and 5+ years in an IT Systems related role
- Experience administering IT Infrastructure such as VMware, Active Directory, Windows Server, Linux, Networking, Cloud Infrastructure (AWS, Azure), Load balancing & Monitoring
- Expertise in scripting, coding, automation, and integration with tools such as Python, Ansible, Chef, Puppet, REST, YAML, JSON, etc
- Experience working with SCADA, OT, MES, or other industrial related software & systems is preferred.
- Experience with DR playbooks, capacity modeling, and cost/performance optimization in hybrid environments
- Self-motivated, hands-on mindset, with a willingness to contribute at all levels.
- A passion for sustainability and making the world a better place!
Physical Requirements:
- Ability to perform the essential job functions consistent safely and successfully with the ADA, FMLA
Similar Jobs
Rent the Runway
Site Reliability Engineer I
Air Apps
Site Reliability Engineer (SRE)
Air Apps
Site Reliability Engineer (SRE)
Redwood Materials
Staff Site Reliability Engineer
Blink Health
Staff Site Reliability Engineer
TwinStream