Match score not available

Incident Commander

EXTRA HOLIDAYS - EXTRA PARENTAL LEAVE
Remote: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

Experience in incident management role, Understanding of Containerization like Docker and Kubernetes, Experience with AWS, GCP or on-premise environments, Bachelor’s degree in Computer Science or Engineering, Familiarity with Postgres, MySQL, Elastic Search is a plus.

Key responsabilities:

  • Manage all SRE incidents and classifications
  • Enhance collaboration during real-time incident management
  • Develop and improve Practices and Frameworks
  • Lead communication with stakeholders regarding incidents
  • Drive continuous service improvements and initiatives
theScore logo
theScore Leisure & Entertainment SME https://scoremediaandgaming.com/
501 - 1000 Employees
See more theScore offers

Job description

Logo Jobgether

Your missions

theScore, a wholly-owned subsidiary of PENN Entertainment , empowers millions of sports fans through its digital media and sports betting products. Its media app ‘theScore’ is one of the most popular in North America, delivering fans highly personalized live scores, news, stats, and betting information from their favorite teams, leagues, and players. theScore’s sports betting app ‘theScore Bet Sportsbook & Casino’ delivers an immersive and holistic mobile sports betting and iCasino experience. theScore Bet is currently live in the Company's home province of Ontario. theScore also creates and distributes innovative digital content through its web, social and esports platforms.

About the Role & Team
As part of the theScore team, you will be working with a team of smart, friendly, and dedicated Engineers, Product Managers and Designers determined to deliver some of the best apps the market has to offer. We want you to be challenged and to get the full experience of what it’s like to work at theScore! We are looking for an Incident Commander to join our site reliability team, to work cross-functionally across engineering, and be the front line for incidents and working with Release Engineering to help prevent new events.

This is a management position responsible for all SRE incidents, which includes P1, P2, P3 and P4. Classifying and documenting all incidents and carrying out support, assisting and driving all incidents, regarding investigation, hierarchical and technical escalation, diagnosis and recovery and root cause analysis. Additionally driving improvements to our service delivery and release processes based on disruption reports.

About the work

  • Drive and enhance collaboration with other Command Support members and Commanders, Customer Support, Application teams, Release Engineering leader and cross-functional teams to lead real-time incident management.
  • Provides Leadership for developing Practices, Frameworks, Process Flows, Templates and Process Guides
  • Continuously improve and enhance the internal framework, methodology, processes, and tools
  • Developing and maintaining key practice capabilities
  • Collaborating with SRE Teams and Infrastructure teams to identify requirements.
  • Recommends innovative solutions that enable the organization to deliver on its objectives and goals.
  • Promote opportunities for Continuous Service Improvements
  • Lead SRE communications to stakeholders via email, Slack, Microsoft Teams in timely manner
  • Lead initiatives to promote JIRA Release Ticket management, quality and alignment with Incident management communication supporting SLAs
  • Other duties as required.

 About You

  • Experience in a similar role or incident management role.
  • Experience and understanding of Containerization (Docker & Kubernetes preferred)
  • Comfortable within Linux environments and needs.
  • Experience working with AWS, GCP, and/or on-premise environments needs.
  • Ability to work independently and learn quickly with little supervision.
  • Ability to handle multiple projects simultaneously.
  • Willingness to drop everything and take on an ad-hoc task.
  • You’re the type of individual who is extremely tech-savvy and passionate about learning new technologies and tools.
  • A bachelor’s degree in computer science, engineering, and/or similar experience.
  • Nice to have: Postgres, MySQL, Elastic Search, Kafka, Redis, Helmfile, Terragrunt, Prometheus, and any web programming.

What We Offer

  • Competitive compensation package.
  • Fun, relaxed work environment.
  • Education and conference reimbursements.
  • Parental leave top up.
  • Opportunities for career progression and mentoring others.

    #LI-REMOTE

 

Candidates residing in Ontario requiring special accommodation can email accessibilityoffice@thescore.com

theScore is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability or age.

 

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Industry :
Leisure & Entertainment
Spoken language(s):
Check out the description to know which languages are mandatory.

Soft Skills

  • Collaboration
  • Leadership
  • Communication

Related jobs