Lead Site Reliability Engineer

extra holidays - extra parental leave
Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Bachelor's degree in a quantitative or business field such as statistics, mathematics, engineering, or computer science., 5-7 years of experience in site reliability engineering, particularly with Microsoft 365., Advanced proficiency in PowerShell scripting and Graph APIs, with intermediate skills in Power Apps/Automate., Strong understanding of incident management processes and tools..

Key responsibilities:

  • Develop and create monitoring and observability dashboards within Splunk, Dynatrace, and other platforms.
  • Lead projects focused on building and maintaining observability and monitoring for applications.
  • Conduct post-incident reviews and document findings for future decision-making.
  • Coach and mentor the team, while training them on new systems and best practices.

Centene Corporation logo
Centene Corporation XLarge https://www.centene.com/
10001 Employees
See all jobs

Job description

You could be the one who changes everything for our 28 million members by using technology to improve health outcomes around the world.  As a diversified, national organization, Centene's technology professionals have access to competitive benefits including a fresh perspective on workplace flexibility.
 

Position Purpose:
We are seeking a highly skilled and experienced M365 Lead Site Reliability Engineer to join our team. The ideal candidate will be responsible for developing and creating monitoring and observability dashboards within Splunk, Dynatrace, and other monitoring and alerting platforms. This role requires advanced proficiency in PowerShell scripting and Graph APIs, as well as intermediate proficiency in Power Apps/Automate. This role will ensure the reliability, performance, and scalability of our Microsoft 365 environment.

  • Leads team to identify problems with systems and services and drives regular deployment of new versions of the systems and their subcomponents
  • Leads projects from end-to-end that are focused on building and maintaining observability/monitoring for the application, monitoring key performance indicators, maintaining alerting, and continuously improving visibility.
  • Drives decisions around periodic system validation and testing, service monitoring, and standing up new services/tools
  • Uses advanced knowledge and experience to identify strategies that increase system reliability and performance through on-call rotation and process optimization
  • Leads post incident reviews and documents findings for future informed decision making
  • Drives implementation of approved proposals to optimize Software Development Life Cycle (SDLC) to boost service reliability
  • Leads functional and development teams to investigate and document issues and leads internal team to develop solutions to mitigate them
  • Leads root cause and problem solving initiatives
  • Understand and adapt new technologies, tools, methods, and processes from Microsoft and industry
  • Coaches and mentors team. Designs and implements key performance indicators
  • Contributes to engineering and organization success by welcoming related, different, and new requests; helping others accomplish job results
  • Trains the engineering team on new systems, protocols, and best practices
  • Drive and coach others through reviews of design, code, and test cases
  • Performs other duties as assigned
  • Complies with all policies and standards

Education/Experience:
A Bachelor's degree in a quantitative or business field (e.g., statistics, mathematics, engineering, computer science).
Requires 5 – 7 years of related experience.

Or equivalent experience acquired through accomplishments of applicable knowledge, duties, scope and skill reflective of the level of this position.

Technical Skills:

  • 5-7 or more years of experience in site reliability engineering, with a focus on Microsoft 365.
  • Microsoft 365: In-depth knowledge of Microsoft 365 services, architecture, and administration.
  • PowerShell Scripting: Advanced skills in writing and debugging scripts for automation and administration tasks.
  • Graph APIs: Advanced proficiency in utilizing Graph APIs for integration and automation.
  • Power Apps/Automate: Intermediate skills in creating and managing workflows and applications.
  • Monitoring and Observability: Experience in developing and creating dashboards in Splunk, Dynatrace, and other monitoring platforms.
  • Incident Management: Strong understanding of incident management processes and tools.


Soft Skills:

  • Intermediate - Seeks to acquire knowledge in area of specialty
  • Intermediate - Ability to identify basic problems and procedural irregularities, collect data, establish facts, and draw valid conclusions
  • Intermediate - Ability to work independently
  • Intermediate - Demonstrated analytical skills
  • Intermediate - Demonstrated project management skills
  • Intermediate - Demonstrates a high level of accuracy, even under pressure
  • Intermediate - Demonstrates excellent judgment and decision making skills
  • Intermediate - Ability to communicate and make recommendations to upper management
  • Intermediate - Ability to drive multiple projects to successful completion
  • Intermediate - Possesses technical aptitude

Pay Range: $100,900.00 - $186,800.00 per year

Centene offers a comprehensive benefits package including: competitive pay, health insurance, 401K and stock purchase plans, tuition reimbursement, paid time off plus holidays, and a flexible approach to work with remote, hybrid, field or office work schedules.  Actual pay will be adjusted based on an individual's skills, experience, education, and other job-related factors permitted by law.  Total compensation may also include additional forms of incentives.

Centene is an equal opportunity employer that is committed to diversity, and values the ways in which we are different. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or other characteristic protected by applicable law.


Qualified applicants with arrest or conviction records will be considered in accordance with the LA County Ordinance and the California Fair Chance Act

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Decision Making
  • Communication
  • Analytical Skills

Site Reliability Engineer (SRE) Related jobs