Match score not available

Site Reliability Engineer

Remote: 
Full Remote
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

5+ years in Site Reliability Engineering, Proficiency in monitoring and IaC tools, Experience with container orchestration, Strong understanding of incident management.

Key responsabilities:

  • Continuously monitor applications for reliability
  • Resolve reliability issues in production environments
  • Oversee change management and release processes
  • Collaborate with development teams to enhance productivity
Encora Inc. logo
Encora Inc. XLarge http://www.encora.com/
5001 - 10000 Employees
See more Encora Inc. offers

Job description

Important Information:

  • Years of Experience: 5+ years of experience in Site Reliability Engineering or a related field.
  • Job Mode: Full-Time
  • Work Mode: Remote

Job Summary: The Site Reliability Engineer (SRE) at Encora will drive the resilience and scalability of our software systems by combining software engineering and operations expertise. This role focuses on building robust, reliable applications, emphasizing automation, and continuous monitoring to achieve high performance and system reliability.

Responsibilities and Duties:

  • Application Monitoring: Continuously monitor applications using automated tools to ensure optimal reliability.
  • Emergency Response: Act promptly in production environments to resolve reliability issues, conducting thorough root cause analysis during ongoing incidents.
  • Change Management: Oversee change management and release processes, ensuring smooth deployments and production environment stability.
  • Collaboration: Partner with development teams to address system-related issues and eliminate toil through automation, enhancing team productivity.
  • Reliability and Scalability: Ensure systems are both reliable and scalable, focusing on high-performance standards and operational efficiency.

Qualifications and Skills:

  • Proficiency in modern monitoring tools, project tracking, and version management.
  • Experience with Infrastructure as Code (IaC) tools and release management tooling.
  • Familiarity with incident alert tools and container orchestration platforms.

Role-specific Requirements:

  • Proven experience managing applications in production environments.
  • Strong understanding of incident management and root cause analysis processes.
  • Ability to streamline processes, especially in change and release management.

Technologies:

  • Monitoring tools: Azure Monitoring, App Insights, Prometheus, Grafana.
  • Version control and project tracking: JIRA, SVN, GitHub.
  • Infrastructure as Code: Terraform, ARM/Bicep, Pulumi.
  • Release management: ArgoCD, Harness, Octopus.
  • Incident alert tools: PagerDuty, Opsgenie.
  • Container orchestration: Kubernetes, AKS.

Skillset Competencies:

  • Problem-solving and analytical skills.
  • Strong collaboration abilities with cross-functional teams.
  • Ability to implement and improve automation processes.
  • Capacity to work effectively under pressure in high-stakes situations.

About Encora: Encora is the preferred digital engineering and modernization partner of some of the world's leading enterprises and digital-native companies. With over 9,000 experts in 47+ offices and innovation labs worldwide, Encora's technology practices include Product Engineering & Development, Cloud Services, Quality Engineering, DevSecOps, Data & Analytics, Digital Experience, Cybersecurity, and AI & LLM Engineering.

At Encora, we hire professionals based solely on their skills and qualifications and do not discriminate based on age, disability, religion, gender, sexual orientation, socioeconomic status, or nationality.

Required profile

Experience

Level of experience: Senior (5-10 years)
Industry :
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Analytical Skills
  • Calmness Under Pressure
  • Collaboration
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs