Job Description
Join our team of Incident Managers/Tech Ops Engineers and help us maintain high availability on the Amazon Retail Website. As a front-line defense, you'll drive down event duration using your operational experience, knowledge of best practices, and incident management tools.
Key responsibilities include:
- Become a technology evangelist and use your knowledge to solve business problems.
- Reduce mean time to resolution for all incident types.
- Design and/or build world-class listening systems.
- Adapt and improve operations management systems and processes.
- Participate in Agile sprints and create standard operating procedures.
- Identify and troubleshoot recurring platform issues.
- Automate tasks through script creation and maintenance.
- Respond to customer requests within SLA.
- Participate in follow-the-sun rotations.
- Mentor peers and participate in the interviewing process.
We offer a stimulating work environment, mentoring programs, regular tech talks, and well-defined career paths for motivated engineers.
BASIC QUALIFICATIONS:
- Bachelors degree in Computer Science or related field.
- Relevant experience in large-scale online technical operations environment.
- Strong Incident Management skills.
- Scripting experience in at least one interpreted language.
- Experience using Linux and networking fundamentals.
- Experience driving collaborative projects.
- Experience in Agile/Scrum or related workflow.
PREFERRED QUALIFICATIONS:
- Confidence to drive large conference calls.
- Understanding of routing protocols.