Match score not available

Senior Site Reliability Engineer

Remote: 
Full Remote
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

4+ years experience in infrastructure engineering, focusing on Cloud Infrastructure, Proficiency in Terraform, IaC, and programming languages like Python, Go or Java, Familiarity with network architecture, CI/CD pipelines, monitoring systems, and incident management, Strong troubleshooting and disaster recovery skills with the ability to optimize system performance, Excellent communication and mentoring skills.

Key responsabilities:

  • Design, develop and operate reliable infrastructure for online services
  • Automate deployment processes and develop systems for reliability and scalability optimization
  • Lead Incident Management, troubleshooting, and maintenance of live issues
  • Create tooling, data sources, monitoring dashboards, and alerting for online services
  • Stay updated on SRE technologies and industry trends
People Can Fly Studio logo
People Can Fly Studio Gaming SME https://www.peoplecanfly.com/
501 - 1000 Employees
See more People Can Fly Studio offers

Job description

Company Description

People Can Fly is one of the leading independent AAA games development studios with an international team of hundreds of talented individuals working from offices located in Poland, UK, Ireland, US, and Canada and from all over the world thanks to our remote work programs.

Founded in 2002, we made our mark on the shooter genre with titles such as Painkiller, Bulletstorm, Gears of War: Judgment, and Outriders. We are one of the most experienced Unreal Engine studios in the industry and we are expanding it with in-house solutions called PCF Framework.

Our creative teams are currently working on several exciting titles: Gemini is our new project being developed with Square Enix; Maverick is a Triple-A game developed in collaboration with Microsoft Corporation; Bifrost & Victoria are projects we're growing in the self-publishing model. We are also busy working on a VR and undisclosed projects, more information on those to come later.

With over 20 years of experience, PCF sets out to explore new horizons. We aim to combine our expertise with the creativity of the best and most forward-thinking talents in the industry to work together on the new generation of action games for the global gaming community.

If you decide to accompany us on this journey, you’ll have a chance to perfect your craft and expand your knowledge, working alongside leaders in the industry to bring a brand-new unique experience to the players worldwide.

Job Description
  • Design, develop, deploy and operate reliable and scalable infrastructure for the online services platform
  • Collaborate with cross-functional teams to translate business requirements into technical solutions, balancing user needs with technical constraints.
  • Automate deployment of the online services platform to cloud providers, including provisioning for various stages like development, testing, and external publishers.
  • Develop and implement systems to maximise reliability, scalability, and uptime while also optimising for cost,
  • Design and develop systems and tooling that support efficient maintenance, updates, and recovery
  • Create tooling, data sources, monitoring dashboards, and alerting for all online services products, with a particular focus on real time service health
  • Lead Incident Management of live issues, as well as troubleshooting, break-fix and resolution of those issues
  • Create, review and maintain essential operational documentation such as run books, post-mortem reports, and root cause analysis 
  • Assist leads with recruiting, onboarding, development and mentorship of engineers.
  • Stay updated on emerging SRE technologies and industry trends, evaluating their potential impact on our development processes and strategies.

Qualifications
  • 4+ years of extensive experience in infrastructure engineering, with a specific focus on Cloud Infrastructure
  • Strong knowledge of, and experience with, writing and optimising Terraform.
  • Strong knowledge of, and experience with Infrastructure-as-Code (IaC) and related best practices
  • Strong in at least one programming language (Python, Go, Kotlin, Java or similar) as well as with scripting and automation in general
  • Good grasp of network architecture and security  best practices.
  • Familiarity with CI/CD pipelines and tools like Github Actions, Jenkins
  • Proficient with Source Control and Code Review tools (Git/Github, Perforce/Swarm etc.).
  • Experience setting up monitoring and alerting systems
  • Experience with Incident Management and troubleshooting live issues
  • Ability to analyse and improve system performance, strong troubleshooting skills across various technology layers.
  • Knowledge in designing and implementing disaster recovery strategies.
  • Strong mentoring skills.
  • Strong verbal and written communication skills in English.

Nice to have:

  • Experience in the Video Games Industry
  • Unreal Engine knowledge (C++ in particular)
  • Experience in content distribution, ad-tech, news, mobile gaming, or finance domains
  • Additional language proficiency
  • Additional project management and bug tracking software knowledge

Additional Information

What we offer:

  • Private medical healthcare including dental treatment for PCF members and their families (Signal Iduna).
  • MultiSport card for you and your family members or friends.
  • Free library with a wide range of games and books you have unlimited access to.
  • In-company Polish and English language classes.
  • Fresh fruit, snacks, and beverages for everyone in the office.
  • Flexible working hours.
  • Free virtual health and mental wellbeing sessions are included in the plan for members and their dependents.
  • Personal development opportunities and ability to work in a global environment.
  • Work in a creative team with people full of passion for what they do.

We are committed to an inclusive and diverse work culture. PCF is an equal opportunity employer. We do not discriminate based on race, color, ethnicity, ancestry, national origin, religion, sex, gender, gender identity, gender expression, sexual orientation, age, disability, genetic information, marital status or any legally protected status.

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Industry :
Gaming
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Mentorship
  • Verbal Communication Skills

Site Reliability Engineer (SRE) Related jobs