Sr. Site Reliability Engineer

Spartan Technologies, Inc. - New Hyde Park, NY

Job Description

Overview:

Run the production environment by monitoring availability and taking a holistic view of system health

Build software and systems to manage platform infrastructure and applications

Improve reliability, quality, and time-to-market of our suite of software solutions

A proactive approach to spotting problems, areas for improvement, and performance bottlenecks.

Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve

Provide primary operational support and engineering for multiple large, distributed software applications

Responsibilities:

Balance feature development speed and reliability with well-defined service level objectives

Create sustainable systems and services through automation and uplifts

Partner with development teams to improve services through rigorous testing and release procedures

Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding

Participate in system design consulting, platform management, and capacity planning

Minimum Qualifications:

Bachelor’s Degree; preferably in Computer Science or Computer Engineering and 5 years’ experience OR 9 years relevant experience.

5+ years’ experience working as a site reliability engineer

Hands-on experience with various software languages: Python, C++, .NET, BASH

Working knowledge of cloud computing Services: AWS preferred

Preferred Qualifications:

AWS Certification in Architecture and-or DevOps and-or SysOps

Experience with Infrastructure tooling: Terraform (must have), Cloud Formation, Powershell

CICD Tools: Jenkins, CodeBuild – Must Haves, GitFlow

Source Control: GitHub Preferred



Posted On: Tuesday, August 31, 2021



Apply to this job
  • *
  • *