Member Of Technical Staff- Production Engineering
Confirmed live in the last 24 hours
Pure Storage
Job Description
We’re in an unbelievably exciting area of tech and are fundamentally reshaping the data storage industry. Here, you lead with innovative thinking, grow along with us, and join the smartest team in the industry.
This type of work—work that changes the world—is what the tech industry was founded on. So, if you're ready to seize the endless opportunities and leave your mark, come join us.
THE ROLE
As a core member of the Forensics team, you will be the lead detective for Everpure’s most complex technical challenges. Your mission is to perform deep-dive debugging and root-cause analysis on issues that span our large-scale storage platforms, with a specific focus on firmware integration within the broader storage stack. Working at the intersection of engineering, hardware, and platform teams, you will resolve high-stakes escalations and architect the reliability features that protect the data of the world’s largest hyperscale customers.
WHAT YOU’LL DO
-
Lead High-Impact Investigations: Drive root-cause analysis for critical escalations by correlating evidence across platform logs, system metrics, and hardware telemetry to pinpoint the source of failure.
-
Debug Across the Stack: Analyze complex failures that range from high-level operating system layers down to low-level device firmware and hardware interactions.
-
Engineer Triage Tooling: Design and refine automated tools for log analysis and health monitoring to accelerate issue detection and reduce "Time to Resolution" for the entire engineering organization.
-
Collaborate in "War Rooms": Participate in cross-functional debug meetings and daily war rooms, providing the technical evidence and reproduction steps needed to drive fixes across firmware and software teams.
-
Architect System Guardrails: Contribute to the design of safety checks, observability features, and "safety rails" that proactively prevent known failure modes in our storage solutions.
-
Support Global Rollouts: Ensure the reliability of new firmware builds and hardware SKUs by validating their integration at scale before they reach the customer.
WHAT YOU BRING
-
Deep Systems Forensics Expertise: Extensive experience in systems, storage, or low-level software engineering (Kernel, Embedded, or Networking) with a "detective" mindset for solving non-deterministic bugs.
-
Multi-Layer Debugging Mastery: Proven ability to navigate and resolve issues across OS, platform software, and NVMe/Flash sto
Similar Jobs
Linxon
Site Supervisor - Mechanical and Electrical Equipment, Energy and Infrastructure Projects
Marathon Petroleum
Instrument Mechanic
Marathon Petroleum
Craftsman/Welder
Marathon Petroleum
Analyzer Technician
Coca-Cola
Line Mechanic
State of North Carolina