SRE Engineer

Remote: 
Full Remote
Contract: 
Work from: 
Israel

Offer summary

Qualifications:

Minimum 5 years of experience with large-scale distributed systems., Hands-on experience with infrastructure services like caching, message queues, and load balancers., Proficiency in monitoring tools such as Prometheus, Grafana, or similar., Experience with containerization (Docker) and orchestration (Kubernetes), and cloud platforms..

Key responsibilities:

  • Ensure systems meet uptime and performance SLAs and SLOs.
  • Participate in on-call rotations, post-mortems, and root cause analysis.
  • Implement redundancy, failover, and high availability strategies.
  • Collaborate on building and improving CI/CD pipelines.

DriveNets logo
DriveNets Scaleup https://www.drivenets.com/
201 - 500 Employees
See all jobs

Job description

Description

Key Responsibilities:

·        Ensure critical systems meet uptime and performance SLAs (Service Level Agreements) and SLOs (Service Level Objectives)

·        Participate in on-call rotations, lead post-mortems, and drive root cause analysis

·        Implement redundancy, failover, and high availability strategies to keep services running smoothly.

·        Build and maintain robust monitoring, alerting, and observability systems (e.g., Prometheus, Grafana, Datadog)

·        Ensure the security of infrastructure and pipelines by implementing best practices for access control, encryption, and vulnerability management.

·        Collaborate with DevOps/Dev teams to build, maintain, and improve CI/CD pipelines

·        Have fun with a great team while tackling hard challenges.


Requirements

·        5 years of experience designing, deploying, maintaining, and troubleshooting large-scale distributed systems.

·        Hands-on experience with infrastructure services such as caching systems, message queues, distributed storage, and load balancers.

·        Proven experience in building and maintaining monitoring solutions using tools like Prometheus, Grafana, or equivalent platforms.

·        5 years of hands-on experience with containerization technologies like Docker and orchestration tools like Kubernetes.

·        At least 3 years of experience working with cloud platforms

·        Understanding of network security principles (e.g., segmentation, firewalls, VPNs, zero trust)

·        Familiarity with securing cloud resources: encryption, security groups, secrets management, etc

·        Cloud certifications – Advantage

·        Bachelor's degree (Computer Science, Computer Engineering, Data science) - Advantage


Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Problem Solving

Related jobs