Staff Site Reliability Engineer - remote

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

B.S. in Computer Science or equivalent experience, Minimum 4 years managing AWS infrastructure, At least 7 years in a senior or technical lead role in site reliability or systems engineering, Expertise in containerization services and observability tooling..

Key responsibilities:

  • Design and implement AWS infrastructure components like VPCs and EC2.
  • Lead architecture and automation of cloud-based infrastructure management.
  • Provide guidance on reliability and performance of SaaS environments.
  • Architect monitoring and alerting systems using tools like Datadog and CloudWatch.

CyberArk logo
CyberArk
1001 - 5000 Employees
See all jobs

Job description

Company Description

About CyberArk:
CyberArk (NASDAQ: CYBR), is the global leader in Identity Security. Centered on privileged access management, CyberArk provides the most comprehensive security offering for any identity – human or machine – across business applications, distributed workforces, hybrid cloud workloads and throughout the DevOps lifecycle. The world’s leading organizations trust CyberArk to help secure their most critical assets. To learn more about CyberArk, visit our CyberArk blogs or follow us on X, LinkedIn or Facebook.

Job Description

CyberArk is seeking a Staff Site Reliability Engineer looking to bring their knowledge, excitement, and energy to the team. If you have worked in the cloud solving scale problems, bringing visibility into your platform and accomplishing true CI/CD pipelines we want you on the team! Driven and excited to innovate is what we need all while allowing you to grow professionally and creating strong relationships that will last a lifetime. 

Responsibilities: 

  • Design Implementation of AWS infrastructure components such as VPCs, EC2, EKS, S3, tagging schemes, CloudFormation, etc. 
  • Lead architecture, designs and feature analysis of deployment and management automation of cloud-based infrastructure and software 
  • Provide guidance to Site Reliability and DevOps Engineers on managing the reliability and performance of SaaS environments as well as on building automation to prevent problem reoccurrence 
  • Architecting and guiding the team with the use of configuration management tools in both Windows and Linux - CloudFormation, Helm, Terraform, Salt, Ansible 
  • Ensuring cloud-based architectures meet availability and recoverability requirements 
  • Architecture and implementation of cloud-based monitoring, alerting and reporting – Datadog, Logz.io, CloudWatch, Catchpoint, ELK,  
  • Support and guidance on tooling that helps to enable teams for greater output and reliability. 
  • Deep understanding of the latest tech solutions, trends, and ability to dive into the details of the architecture as needed. 
  • Work with the Team Leads within the group to identify areas of improvement, prepare architecture road maps, and advocate to the Product Management group. 

#LI-HA1

Qualifications
  • B.S. in Computer Science or equivalent experience 
  • Minimum 4 years of experience managing AWS infrastructure 
  • Minimum of 7 years in a senior, architect or a technical lead role of site reliability, systems engineering or software development 
  • A deep understanding of Site Reliability, infrastructure and Cloud Platform 
  • Expert understanding/experience of containerization services such as Docker/Kubernetes 
  • Expert in observability tooling such as Datadog, NewRelic, Logstash, Elasticsearch 
  • Solid understanding/experience of web services, databases and relating infrastructure/architectures 
  • Solid understanding of backup/restore best practices 
  • Strong level of expertise programming writing configuration management languages 
  • Strong level of expertise programming in Python / Java or equivalent language 
  • Excellent Troubleshooting Skills 
  • Experience supporting an enterprise-level SaaS environment 
  • Security Experience a plus 
  • Experience with AI/ML models to improve system performance and reliability a plus. 

 

Additional Information

CyberArk is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, creed, sex, sexual orientation, gender identity, national origin, disability, or protected Veteran status. 

We are unable to sponsor or take over sponsorship of employment Visa at this time.

The salary range for this position is $141,000 – $176,000/year, plus commissions or discretionary bonus, which will be based on the employee’s performance. Base pay may also vary considerably depending on job-related knowledge, skills, and experience. The compensation package includes a wide range of medical, dental, vision, financial, and other benefits. 
 

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Troubleshooting (Problem Solving)

Site Reliability Engineer (SRE) Related jobs