Staff Software Engineer, Order Management
Confirmed live in the last 24 hours
Stitch Fix
Compensation
$130,900 - $218,000/year
Job Description
About Stitch Fix, Inc.
Stitch Fix (NASDAQ: SFIX) is the leading online personal styling service that helps people discover the styles they will love that fit perfectly so they always look - and feel - their best. Few things are more personal than getting dressed, but finding clothing that fits and looks great can be a challenge. Stitch Fix solves that problem. By pairing expert stylists with best-in-class AI and recommendation algorithms, the company leverages its assortment of exclusive and national brands to meet each client's individual tastes and needs, making it convenient for clients to express their personal style without having to spend hours in stores or sifting through endless choices online. Stitch Fix, which was founded in 2011, is headquartered in San Francisco.
About the Role
We are looking for a Staff Software Engineer to provide technical leadership for our Order Management domain. This role is responsible for shaping the architecture, reliability, and long-term evolution of the systems that power checkout, order lifecycle, fulfillment orchestration, and integrations with critical third-party partners.
You will work across multiple teams, partnering closely with Product Managers, Engineering Managers, and other senior engineers to define technical strategy, drive complex initiatives, and ensure our order platform is resilient, scalable, and operable under real-world conditions such as peak traffic, partner outages, and partial failures.
This is a hands-on leadership role. You will design and build systems, lead technical decisions, guide incident response, and raise the bar for quality, reliability, and clarity across the domain.
You're excited about this opportunity because you will…
- Own and evolve the technical direction of the Order Management domain, including checkout, order state, fulfillment coordination, and third-party integrations.
- Design and implement resilient distributed systems with clear failure modes, graceful degradation, safe rollout and rollback strategies, and strong observability.
- Establish and standardize architectural patterns for partner integrations, including timeouts, retries, circuit breaking, fallbacks, idempotency, reconciliation, and consistency guarantees.
- Lead domain-wide initiatives from problem framing through production rollout and long-term hardening, ensuring solutions are not only shipped but remain reliable and maintainable.
- Drive high-severity and cross-service incident response, including communication, technical decision making, root-cause analysis, and systemic remediation.
- Define and uphold domain standards for testing, release safety, on-call readiness, runbooks, SLIs/SLOs, and operational excellence.
- Produce clear technical designs, RFCs, and decision records that create alignment and a durable paper trail for future teams.
- Partner with Product and Engineering leadership on roadmap planning, sequencing, and investment tradeoffs, framing technical decisions in terms of customer impact, risk, and business outcomes.
- Mentor and coach senior and mid-level engineers, raising the quality bar for system design, operational thinking, and technical decision-making across the organization.
We’re excited about you because…
- You have 8+ years of experience building and operating large-scale backend or distributed systems.
- You have strong software engineering experience across one or more modern programming languages; our current stack includes Golang, Ruby on Rails, PostgreSQL, AWS, Kafka, Temporal, though prior experience with these specific tools is not required.
- You have experience in e-commerce, order management, payments, inventory, logistics, or other high-throughput, transactional domains (preferred, but not required).
- You have strong production experience in cloud-hosted systems (e.g., GCP, AWS, Azure), with the ability to reason about capacity, failure scenarios, and operational behavior beyond just the application layer.
- You have hands-on experience with relational databases such as PostgreSQL, with solid understanding of data modeling, transactional semantics, performance tuning, and operational concerns (backups, failover, migrations).
- You have a proven ability to design and operate business-critical, stateful systems with high availability and strict correctness requirements.
Similar Jobs
DraftKings
Senior Software Engineer, iOS
Western Union
Senior Manager, Solution Engineering (Devops)
Broadcom
Staff Software Engineer (ICB4)
Cadence Design Systems
Sr Principal Software Engineer
Fidelity Investments
Principal Systems Engineer
Fidelity Investments