Software Development Engineer, AWS Incident Tooling - Amazon
  • Dublin, Leinster, Ireland
  • via BeBee.com
-
Job Description

Amazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.

The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the highest-availability, lowest-latency cloud platform on the planet.

We work closely with the teams who own the largest AWS products, building systems to detect and mitigate operational issues before they impact customers.

We are looking for a knowledgeable and experienced software development engineer to help us succeed in this mission.

As a Software Development Engineer at AWS Incident Response Systems, you will join the team in the design and implementation of systems which automate fault containment, problem diagnosis, and issue resolution across multiple hugely-distributed, always-on architectures.

These systems will take metric and dependency data from multiple sources and analyse them, correlating them with customer impact to determine root cause of an issue without human intervention.

they will create engagements, facilitate communication and coordination of the response and mitigation.

As the scale and complexity of AWS grows, this is the best way that we can offer our customers a stable and reliable cloud computing platform.

We succeed once these systems detect, diagnose, and repair operational defects without customer impact or human intervention.

You will work with teams across AWS to drive adoption of the software that has been built by the team, and influence systems development practices for new and existing products.

You will define availability goals for service teams across AWS, and strategies to make these goals attainable with minimal effort.

Your goal will be to remove human-error from the day-to-day operations of the massive, always-on, distributed systems which make up AWS.

Within your first year on the AWS Incident Response Systems team, you will have met with senior technical leaders from across AWS, designed and implemented at least one new system, and you will have dived deep into the causes of at least one historic external customer impacting event, and determined how to prevent a similar event from ever happening again.

As your career continues to develop, you will influence the growth and direction not only of the Incident Response Systems team, but of the AWS group as a whole.

If this sounds like the right challenge for you, then please apply today

Key job responsibilities
Write well-tested, maintainable code.
Design, contribute to, and maintain systems which solve customer problems.
Work with team-mates to improve code quality, system architecture and team processes
Learn about the incident management processes supported by the team's system to identify improvement opportunities

A day in the life

As a Software Development Engineer on the AWS Incident Response Systems team, you will spend time each day writing code, reviewing code, creating documentation and responding to operational issues in the team's systems.

You will have conversations with technical leaders which will help you grow your career. You will get to know your customers and figure out better ways to solve their problems. You will contribute to the long term direction for your team and for the Incident Response Systems organisation.

About the team

Our team has a wide and diverse set of backgrounds and experiences - we have engineers and managers who have been involved in Incident Response at Amazon for many years, people with a more traditional software engineering background, and a range in between.

This breadth of experience makes for a vibrant and creative team, and we collaborate to build high-quality software systems which solve our customers' problems.

- Experience (non-internship) in professional software development

  • Experience designing or architecting (design patterns, reliability and scaling) of new and existing systems
  • Experience programming with at least one software programming language
  • Bachelor's degree in computer science or equivalent
  • Experience with full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice ) to know more about how we collect, use and transfer the personal data of our candidates.

Amazon is committed to a diverse and inclusive workplace.

Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

For individuals with disabilities who would like to request an accommodation, please visit

;