Amazon is where women work


Home    Amazon    Jobs    Job

Job is no longer available

Network Reliability Engineer

Amazon

Dublin, Ireland

DESCRIPTION

Do you aspire to be a world class Network Engineer, and learn from experts in the field? Would you like the opportunity to support a world leader in both online retail and cloud computing? The AWS Network Reliability and Optimization (NRO) team is looking for highly motivated individuals to support the operation of our global network infrastructure, and drive the reliability and performance of our next-generation network as it continues to grow.

The NRO team is primarily responsible for maintaining and sustaining the health and performance of Amazon’s Retail and Web Services network. This team plays a key role in responding to operational changes in the network, investigating complex failure modes and determining root cause, and working with internal and external stakeholders to return the network to a fully operational state. The team is also responsible for identifying, owning and driving initiatives to remove potential risks to the availability of the network and for defining solutions to optimize its performance. Additionally, the NRO team engages early with internal AWS Network Engineering teams in the design, development and testing of new technologies and solutions, ensuring that all infrastructure that gets deployed to the network meets a high quality operational bar.

Responsibilities
Operational Excellence
· Working within a "Follow the Sun" operational organisation, participating on a rota basis to provide reactive response for complex or ambiguous events across the global network, ensuring rapid mitigation and resolution
· Develop expertise across assigned network fabrics / layers and provide escalation support to other AWS operational teams to respond to impacting events
· Identify and define solutions that address technical debt or other availability risks within the network and take ownership for projects that remove those risks
· Measure and identify areas of non-compliance across the network and develop and implement solutions that return the network to defined standards
· Engage in deep dive investigations into the root cause for complex or ambiguous events and help define and implement follow up actions that prevent re-occurrence
· Engage with internal Engineering teams early in the design of new technologies and solutions and take ownership for the operational quality of new infrastructure that gets deployed to the network
Change Management
· Focus on the needs of the customer in all change scenarios
· Determine requirements, write, review and execute changes safely
· Drive standards across the network, and ensure that we are fully compliant to standards and policies
· Work with invested teams to scope change work and deliver to a deadline
Automation
· Look for automation opportunities in troubleshooting and administration tasks
· Work with System Developers to scope software solutions that enhance the availability and performance of the network

Continued Improvement
· Create and review documentation for processes and troubleshooting best practices and deliver informal training to other opertional teams
· Identify simple, sustainable and repeatable solutions and processes
· Provide interview support for open roles within the operations organization

Amazon is an Equal Opportunity-Affirmative Action Employer – Minority / Female / Disability / Vet

Visit www.amazon.jobs for more information

BASIC QUALIFICATIONS

· 3-5 years of experience in a Network Engineering / Operations role
· Experience troubleshooting switching and routing platforms
· Applied knowledge and troubleshooting experience of OSPF, BGP, TCP, ARP and Ethernet Protocols
· Strong written and verbal communication skills
· Highly motivated to learn
· Ability to interact efficiently with peers and customers
· Able to work occasional weekends and holidays (on a rota basis)
· Authorized to work in the U.S. without sponsorship

PREFERRED QUALIFICATIONS

· Prior experience supporting a fast paced, global network environment
· Demonstrated ability to work well under pressure
· Experience working with customers to diagnose a problem, and work toward resolution
· Basic understanding of the UNIX/LINUX operating environments with the ability to navigate, manipulate and understand the file structure.
· Understanding of ACLs – application, usage and troubleshooting
· Experience scripting with Python and/or Perl
· Strong logical thinking skills, with the ability to adapt as new information becomes available
· Demonstrated ability to deliver projects to deadline with minimal supervision
· Effective prioritization and time management skills


Share this page: