Site Reliability Engineer
Boston, MA (or remote with possibly 70% travel to Boston the first 2 months)
Focus on scale, automation, and high-availability platforms for our members around the globe
Introduce modern cloud best-practices to the redesign of complex, mission-critical platforms
On behalf of our client, we are searching for a well seasoned Site Reliability Engineer who knows how to lead a team to success while still enjoys being an individual contributor. The ask is to quickly modernize and automate the way we build and deploy software in support of operations around the globe.
With the purpose to bring modern cloud best practices to the team, we are looking for an experienced Site Reliability Engineer to lead this undertaking. You’ll join a group of diverse, passionate engineers from all manner of backgrounds, responsible for the development, security, and operations of our global runtime platform. Together, we will tackle critical problems that impact everyone.
As a Site Reliability Engineer you will:
Lead engineering teams to design, build, and deploy automation software needed to improve the availability, scalability, and efficiency of the global platform
Maintain a comprehensive and holistic system view while addressing risks and concerns regarding infrastructure reliability to ensure mission success
Work on solutions to ensure the client is supported in unique environments, and to design solutions to meet specific stakeholder needs
Work on cutting-edge security solutions for critical infrastructure with the largest attack surfaces you’ll find anywhere
Automate processes that have never before been automated, both for technology delivery and instrumentation workflows that span classified and unclassified networks
And we think you’d be the perfect contribution to our team, if you:
Have deployed scalable containerized applications (Docker/Kubernetes, AWS ECS, Mesos)
Have experience with Java or other JVM-based languages
Have worked with multiple cloud providers (AWS, Azure, GCP, OpenStack, etc.)
Bring hands-on experience with PCF (Pivotal Cloud Foundry) and/or OpenShift, and CI/CD technologies (Jenkins, TravisCI, CircleCI, or Pivotal Concourse)
Are passionate about automation, and believe nothing should ever be done manually twice
Bonus points: Hands-on experience with AWS GovCloud
As a Site Reliability Engineer you will join a passionate group of industry and technology leaders solving real-world problems in a truly mission critical environment while working in a flexible environment that encourages you to take risks. You’ll be welcomed into an intelligent, and diverse group and that’s bringing modern software development methodologies to a realm that impacts everybody, everyday.