Match score not available

Principal Infrastructure / Site Reliability Engineer

Remote: 
Full Remote
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

4+ years of experience with DevOps tooling, Hands-on experience with cloud-based systems, Strong Linux System Administration skills, Proficiency in Git, Azure DevOps, Terraform, Ansible.

Key responsabilities:

  • Develop and optimize cloud-based SAP systems
  • Utilize IAC solutions for infrastructure management
Brighttier logo
Brighttier Scaleup https://brighttier.com/
11 - 50 Employees
See more Brighttier offers

Job description

As a Site Reliability Engineer you have a proven track record of supporting Operational environments whilst using DevOps tooling to automate management and reliability.

You will work closely with our Operations teams, Architects, and other Site Reliability and DevOps engineers to help meet the day-to-day business needs of Lemongrass’ core business, SAP on Cloud.

The team have a key responsibility for using DevOps tools and capabilities both to aid the current build activities and for the long-term running of the applications, for example CI/CD pipelines, Infrastructure-As-Code, Configuration-As-Code, test automation, operational monitoring, and cost control.

Responsibilities:
  • Develop, maintain, and optimize cloud-based SAP systems for our clients, ensuring optimal performance, reliability, and efficiency.
  • Utilize industry-leading tools such as Git, Azure DevOps, Terraform, and Ansible for the management and deployment of infrastructure as code (IAC) solutions.
  • Adhere to best practices for managing systems and services across various cloud environments.
  • Ensure high levels of system and infrastructure availability.
  • Work with development teams to identify and implement automation in appropriate areas.
  • Handle code deployments in all environments.
  • Maintain system standards and securities for Operating System and Cloud Best practices
  • Work with other teams, especially SAP basis, on Incident and Problem Resolution and fulfillment of Service and Change Requests.
  • Adhere to Enterprise-level Operational Procedures, such as Change Control. 
  • Lead emergency responses
 
Qualifications:
  • Ability to lead small teams to deliver customer outcomes
  • Has demonstrated ability to work collaboratively as part of globally diverse teams
  • Can work independently with little supervision
  • Working directly with clients and project teams to implement processes and technology that support various technical and business functions
  • Demonstrated involvement in projects that span the lifecycle from planning and design, through deployment, configuration, and migration
  • Minimum of 5 years of experience working in Operations environments that utilizes DevOps tooling.
  • Hands-on experience with cloud-based systems such as RHEL, OEL, SUSE, and Windows.
  • Proficiency in leveraging DevOps tools such as Git, Azure DevOps, Terraform, and Ansible.
  • Strong Linux System Administration skills (preferably with a focus on SUSE or RHEL)
  • Extensive experience with Cloud Infrastructure and SAP workloads.
  • Excellent problem-solving skills and attention to detail.
  • Proficiency in ITIL procedures such as Incident Management, Problem Management and Change Management.
  • Strong communication and collaboration abilities.

 Desired:
  • Ability to automate complex tasks with tools such as Terraform / Ansible / Bash / Powershell 
  • Knowledge of Agile methodologies
  • Worked across Development and Support
  • Relevant certification in Cloud technologies, SAP, or DevOps would be advantageous
  • ITIL certification is advantageous
  • Selected applicant will be subject to a background investigation, which will be conducted and the results of which will be used in compliance with applicable law.

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs