Assist in establishing SRE practices for our organization including creating and maintaining new Service Level Objectives, establishing requirements for, and performing Production Readiness Reviews
Engage with engineering teams and leaders to mature our site reliability practices
Work with our Center of Excellence teams to build out a toolkit of solutions for our application teams to integrate into their processes.
Evolve problem statements into actionable items that enable engineers to deliver business value as quickly as possible, in a safe and repeatable way.
Lead projects/technical initiatives for multiple organizations
Demonstrate ownership of initiatives and drive them through to completion.Define and codify organizational standards relative to resiliency, configuration, and scalability.
Bachelor’s degree in Computer Science or related field and 10+ years' experience OR MS w/ 8+ years' experience OR 14+ years' experience of equivalent combination of industry related professional experience and education
Ability to learn new software, method and practices and bringing them to our developers
Commitment to Infrastructure as Code
Git or general version control experience
Ability to work with engineering teams and leadership to engineer requirements
Working in a continuous integration, testing, and delivery SDLC employing automation
Eager to dig into problems and bring proposed solutions to group discussion
Open to feedback and able to creatively adapt multiple ideas into a solution
Minimum 6+ years' experience with DevOps engineering or SRE
Experience with containers and serverless and how to deploy and operate both
Minimum 6+ years' experience with monitoring and observabilityMinimum 6+ years' experience with configuration management