Sr. Site Reliability Engineer

Spartan Technologies, Inc. - Irvine, CA

Job Requirements:

Include on Resume: Citizenship Status, Current Location

Include in the Notes Section: Expected FTE Salary for Direct Hire

Are you ok with H1B's? Yes

Do candidates need to be local to interview?

No, but strongly prefer candidates in Irvine or Atlanta area.

If not, will they be required onsite when Cox returns? See above.

Target Years of Exp: 5+, job is posted at 8+, but can make exceptions for good experience.

Top 5 Must Haves:

Specific mention of DevOps and/or SRE experience, strong coding/scripting ability, expert-level AWS experience, terraform/cloud formation/ansible/similar.


At Cox, we’re passionate about building software that solves problems. We count on our site reliability engineers (SREs) to empower our users with a rich feature set, high availability, and stellar performance level to pursue their missions. As we expand our customer deployments, we are currently seeking an experienced SRE to deliver insights from massive scale data in real time. Specifically, we are searching for someone who brings fresh ideas, demonstrates a unique and informed viewpoint, and enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences at every interaction.

Objectives of this Role:

Run the production environment by monitoring availability and taking a holistic view of system health

Build software and systems to manage platform infrastructure and applications

Improve reliability, quality, and time-to-market of our suite of software solutions

A proactive approach to spotting problems, areas for improvement, and performance bottlenecks.

Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve

Provide primary operational support and engineering for multiple large distributed software applications

Daily and Monthly Responsibilities:

Balance feature development speed and reliability with well-defined service level objectives

Create sustainable systems and services through automation and uplifts

Partner with development teams to improve services through rigorous testing and release procedures

Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding

Participate in system design consulting, platform management, and capacity planning


Bachelor’s Degree; preferably in Computer Science or Computer Engineering and 5 years’ experience OR 9 years relevant experience.

5+ years experience working as a site reliability engineer

Hands-on experience with various software languages: Python, C++, .NET, BASH

Working knowledge of cloud computing Services: AWS preferred


AWS Certification in Architecture and-or DevOps and-or SysOps

Experience with Infrastructure tooling: Terraform(must have), Cloud Formation, Powershell

CICD Tools: Jenkins, CodeBuild – Must Haves, GitFlow

Source Control: GitHub Preferred

Posted On: Thursday, October 7, 2021

Apply to this job
  • *