Back to Search






Mid-Level
Network Development Engineer, Office Network Reliability Engineering
Confirmed live in the last 24 hours
ADCI - Karnataka - A66
Bengaluru, KA, IND
On-site
Posted April 1, 2026
Job Description
Join Amazon's Office Network Reliability Engineering team and help keep our global office networks running smoothly for 540,000 Amazonians across 400+ locations. In this role, you'll combine expert incident resolution with systematic capability building and proactive reliability engineering—designing automation systems, self-service tools, and operational processes that scale our ability to detect, respond to, and prevent network incidents before they impact productivity.
Key job responsibilities
- Provide Tier 3 escalation support on a rotating on-call schedule for your regional hub, diagnosing and resolving complex office network incidents including multi-site outages, routing protocol failures, wireless infrastructure degradation, and circuit performance problems while maintaining clear communication with operations teams
- Build capability and reduce escalations by conducting structured learning sessions after high-severity incidents, identifying gaps in training, permissions, tooling, or technical barriers, and developing automation and self-service tools that enable operations teams to independently handle incidents
- Deliver knowledge transfer and training to operations engineers across your regional hub, covering complex failure patterns, diagnostic techniques, and resolution approaches based on real escalation data and monthly operational reviews
- Execute proactive reliability engineering by conducting Network Availability Risk assessments, driving Operating System Compliance programs, implementing Configuration Compliance initiatives, and participating in Network Infrastructure Validation reviews to identify and remediate technical debt, vulnerabilities, and architectural risks before they cause incidents
- Contribute to platform and tooling development by developing and integrating alarming systems, automation scripts, and monitoring improvements that enhance observability and operational efficiency across the office network infrastructure
A day in the life
As a Network Development Engineer on the Office Network Reliability Engineering team, you'll operate at the intersection of immediate problem-solving and long-term system improvement. Your day might begin by reviewing overnight escalations during handoff, identifying a pattern in wireless controller failures that points to a configuration gap. You'll document the root cause and draft an automated remediation script that enables our Operations Management Center team to self-heal this failure type going forward.
Later, you might receive an escalation about a multi-site network issue affecting three offices in your region, with Amazonians unable to access internal systems. You'll take ownership of the escalation, engage with carriers, isolate the fault to a circuit configuration issue, and restore service. You'll then document the resolution and schedule a lessons learned session to identify why our operations team didn't have the tooling or permissions to address this independently.
In the afternoon, you might join a Network Infrastructure Validation review for a new campus design, making recommendations on alerting coverage and pre-built runbooks before the design moves to production. You'll close your shift by updating documentation, handing off to the next regional team, and reviewing action items from recent lessons learned sessions. No two days are the same—you'll work in an environment where Amazon's scale means developing durable, scalable solutions that have direct and visible impact on hundreds of thousands of people.
About the team
We are a globally distributed team of network engineers operating on a 24/7/365 follow-the-sun model across three regional hubs: EMEA, APAC, and AMER. Our mission is to make the office network invisible to the 540,000 Amazonians who depend on it every day. We partner closely with the Operations Management Center, Office Infrastructure Excellence, AWS Enterprise Networking, and onsite IT support teams to ensure highly available, reliable, and performant networks across all corporate offices.
Our vision centers on building systems and processes that scale Amazon's ability to prevent and resolve network incidents. We're investing in automation platforms, monitoring improvements, and lifecycle automation to reduce the burden on our operations teams and enable them to handle increasingly complex scenarios independently. When you join us, you'll be part of a team that values engineering excellence, intellectual curiosity, and partnership—where you'll have significant autonomy to develop innovative solutions that go beyond standard industry patterns.
- 4+ years of major internet routing protocols experience
- 4+ years of network engineering and deployments for large-scale networks in a corporate environment, including hands-on physical infrastructure installati
Key job responsibilities
- Provide Tier 3 escalation support on a rotating on-call schedule for your regional hub, diagnosing and resolving complex office network incidents including multi-site outages, routing protocol failures, wireless infrastructure degradation, and circuit performance problems while maintaining clear communication with operations teams
- Build capability and reduce escalations by conducting structured learning sessions after high-severity incidents, identifying gaps in training, permissions, tooling, or technical barriers, and developing automation and self-service tools that enable operations teams to independently handle incidents
- Deliver knowledge transfer and training to operations engineers across your regional hub, covering complex failure patterns, diagnostic techniques, and resolution approaches based on real escalation data and monthly operational reviews
- Execute proactive reliability engineering by conducting Network Availability Risk assessments, driving Operating System Compliance programs, implementing Configuration Compliance initiatives, and participating in Network Infrastructure Validation reviews to identify and remediate technical debt, vulnerabilities, and architectural risks before they cause incidents
- Contribute to platform and tooling development by developing and integrating alarming systems, automation scripts, and monitoring improvements that enhance observability and operational efficiency across the office network infrastructure
A day in the life
As a Network Development Engineer on the Office Network Reliability Engineering team, you'll operate at the intersection of immediate problem-solving and long-term system improvement. Your day might begin by reviewing overnight escalations during handoff, identifying a pattern in wireless controller failures that points to a configuration gap. You'll document the root cause and draft an automated remediation script that enables our Operations Management Center team to self-heal this failure type going forward.
Later, you might receive an escalation about a multi-site network issue affecting three offices in your region, with Amazonians unable to access internal systems. You'll take ownership of the escalation, engage with carriers, isolate the fault to a circuit configuration issue, and restore service. You'll then document the resolution and schedule a lessons learned session to identify why our operations team didn't have the tooling or permissions to address this independently.
In the afternoon, you might join a Network Infrastructure Validation review for a new campus design, making recommendations on alerting coverage and pre-built runbooks before the design moves to production. You'll close your shift by updating documentation, handing off to the next regional team, and reviewing action items from recent lessons learned sessions. No two days are the same—you'll work in an environment where Amazon's scale means developing durable, scalable solutions that have direct and visible impact on hundreds of thousands of people.
About the team
We are a globally distributed team of network engineers operating on a 24/7/365 follow-the-sun model across three regional hubs: EMEA, APAC, and AMER. Our mission is to make the office network invisible to the 540,000 Amazonians who depend on it every day. We partner closely with the Operations Management Center, Office Infrastructure Excellence, AWS Enterprise Networking, and onsite IT support teams to ensure highly available, reliable, and performant networks across all corporate offices.
Our vision centers on building systems and processes that scale Amazon's ability to prevent and resolve network incidents. We're investing in automation platforms, monitoring improvements, and lifecycle automation to reduce the burden on our operations teams and enable them to handle increasingly complex scenarios independently. When you join us, you'll be part of a team that values engineering excellence, intellectual curiosity, and partnership—where you'll have significant autonomy to develop innovative solutions that go beyond standard industry patterns.
Basic Qualifications
- Associate's degree or above- 4+ years of major internet routing protocols experience
- 4+ years of network engineering and deployments for large-scale networks in a corporate environment, including hands-on physical infrastructure installati
pythonjavagoawsaiiosdevopsdataproductdesign
Similar Jobs
Lightning AI
Senior Application Security Engineer, AI and Machine Learning
SeniorSan Francisco, Calif...
Glean
Designated AI Support Engineer
Mid-LevelRemote - US
Glean
Designated AI Support Engineer
Mid-LevelNew York, NY
Glean
Designated AI Support Engineer
Mid-LevelSan Francisco Bay Ar...
Glean
AI Support Engineer (PST shift hours)
Mid-LevelBangalore, India
Glean
AI Support Engineer (EST shift hours)
Mid-LevelBangalore, India