Senior Site Reliability Engineer

fully flexible
Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

5+ years of experience as a Senior Site Reliability Engineer preferred., Expert knowledge of incident response, observability, and reliability tools in a cloud-native environment., Experience with cloud design patterns and developing application stacks on cloud services, particularly Python and TypeScript., Eagerness to share ideas and openness to collaboration..

Key responsabilities:

  • Build, deploy, and maintain observability platforms for metrics gathering.
  • Lead software and system design initiatives using cloud-native design patterns.
  • Partner with teams to improve incident response processes and write effective post-mortems.
  • Automate team and business processes to reduce toil and improve productivity.

BenchSci logo
BenchSci Scaleup https://www.benchsci.com/
201 - 500 Employees
See all jobs

Job description

We are looking for a Senior Site Reliability Engineer to join our growing Platform Infrastructure  group, Site Reliability Engineering team! Reporting to the Engineering Manager - Infrastructure, you'll apply your technical and domain expertise to solve complex technical and business challenges; respond to and assist with production incidents in collaboration with product teams; participate in design discussions, code reviews, and project-related team meetings; and work with other engineers to develop innovative solutions that meet business needs concerning functionality, performance, observability, scalability, and reliability.

You Will:
  • Build, deploy, and maintain observability platforms to enable teams to self-serve their metrics gathering and dash-boarding needs
  • Lead software and system design initiatives by leveraging cloud-native design patterns and injecting your cloud expertise into the entire development lifecycle
  • Partner with other teams to iterate on and improve BenchSci’s Incident Response processes
  • Help other teams to respond, mitigate, and remediate production incidents
  • Help other teams write effective post-mortems and improve our reliability culture and processes
  • Work with your team, Staff Engineers, and Engineering Managers to help promote SRE best practices
  • Help reduce toil and improve developer productivity by automating our team and business processes
  • Partner with engineering and product stakeholders and other cross-functional teams to devise and refine requirements
  • Communicate cross-cutting decisions to all potentially impacted teams

  • You Have:
  • 5+ years of experience working as a Senior Site Reliability Engineer preferred
  • Expert knowledge of incident response, observability, and reliability tools and techniques in a cloud-native environment (Google Cloud is preferred, but AWS experience is also valuable)
  • Experience with cloud design patterns (Google Cloud is considered an asset) and developing specialized application stacks on cloud services (Python backend, TypeScript frontend)
  • Experience working in Python and JavaScript/TypeScript codebases
  • Eagerness to share your own ideas, and openness to those of others
  • Benefits and Perks: 
    An engaging remote-first culture 
    A great compensation package that includes BenchSci equity options
    A robust  vacation policy plus an additional vacation day every year
    Company closures for 14 more days throughout the year
    Flex time for sick days, personal days, and religious holidays
    Comprehensive health and dental benefits.
    Annual learning & development budget
    A one-time home office set-up budget to use upon joining BenchSci
    An annual lifestyle spending account allowance
    Generous parental leave benefits with a top-up plan or paid time off options
    The ability to save for your retirement coupled with a company match!

    About BenchSci:
    BenchSci's mission is to exponentially increase the speed and quality of life-saving research and development. We empower scientists to run more successful experiments with the world's most advanced, biomedical artificial intelligence software platform. 
    Backed by Generation Investment Management, TCV, Inovia, F-Prime, Golden Ventures, and Google's AI fund, Gradient Ventures, we provide an indispensable tool for scientists that accelerates research at 16 top 20 pharmaceutical companies and over 4,300 leading academic centers. We're a certified Great Place to Work®, and top-ranked company on Glassdoor.

    Our Culture:
    BenchSci relentlessly builds on its strong foundation of culture. We put team members first, knowing that they're the organization's beating heart. We invest as much in our people as our products. Our culture fosters transparency, collaboration, and continuous learning. 
    We value each other's differences and always look for opportunities to embed equity into the fabric of our work. We foster diversity, autonomy, and personal growth, and provide resources to support motivated self-leaders in continuous improvement. 
    You will work with high-impact, highly skilled, and intelligent experts motivated to drive impact and fulfill a meaningful mission. We empower you to unleash your full potential, do your best work, and thrive. Here you will be challenged to stretch yourself to achieve the seemingly impossible.  Learn more about our culture.

    Diversity, Equity and Inclusion: We're committed to creating an inclusive environment where people from all backgrounds can thrive. We believe that improving diversity, equity and inclusion is our collective responsibility, and this belief guides our DEI journey.  Learn more about our DEI initiatives.

    Accessibility Accommodations: Should you require any accommodation, we will work with you to meet your needs. Please reach out to talent@benchsci.com.

    Required profile

    Experience

    Spoken language(s):
    English
    Check out the description to know which languages are mandatory.

    Other Skills

    • Collaboration
    • Communication
    • Problem Solving

    Site Reliability Engineer (SRE) Related jobs