Match score not available

Senior DevOps Engineer - AWS

extra holidays - fully flexible
Remote: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Bachelor’s degree in computer science or related field, Experience in Splunk and SignalFx, Background in Amazon Web Services including RDS, Proven experience in managing large production platforms, Strong coding ability in Python or similar languages.

Key responsabilities:

  • Plan and manage all aspects of the production environment
  • Define strategies for observability and identify areas for improvement
  • Respond to incidents and implement platform improvements
  • Maintain service health and scale systems sustainably through automation
  • Support service launch activities and manage data governance processes
3Pillar logo
3Pillar Large http://www.3PillarGlobal.com/
1001 - 5000 Employees
See more 3Pillar offers

Job description

🚀 Join Our Mission at 3Pillar: Elevate Your Impact! 🚀

As a Senior DevOps Engineer, you are responsible for ensuring that our platform is stable and healthy. We break down barriers to run our products by fostering developer-run ownership and empowering developers to build resilient products. We support our developers during the application build phase in software-run principles that include operational design, automation, capacity planning, and monitoring that leads to fault-tolerant, scalable products.


Desired Capabilities:
  • Strong attention to detail
  • Excellent communication skills
  • Ability to work well in a team
  • Analytical and problem-solving skills
  • Time management and organizational skills
  • Ability to learn quickly
  • Adaptability and flexibility
  • Proven ability to lead and mentor junior members of the QA team.


  • Key Responsibilities:
  • Plan, manage, and oversee all aspects of the production environment for all merchant loyalty use cases
  • Define strategies for all facets of observability
  • Identify areas of improvement in production
  • Ability to understand MTTR, SLO, SLI definitions and apply them to services.
  • Respond to Incidents and improvise platform based on feedback and measure the reduction of incidents over time.
  • Ensure reliable, fault-tolerant, efficiently scalable and cost-effective services and infrastructure.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Practice sustainable incident response and blameless postmortems.
  • Ensures that batch production scheduling and process are accurate and timely.
  • Able to create and execute queries to big data platforms and relational data tables to identify process issues or to perform mass updates, preferred.
  • Ability to isolate problems between hardware and software.
  • Analyze ITSM activities of the platform and provide a feedback loop to development teams on operational gaps or resiliency concerns
  • Support services before they go live through activities such as system design consulting, capacity planning and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
  • Work with a global team spread across tech hubs in multiple geographies and time zones


  • Minimum Qualifications:
  • Bachelor’s degree in computer science, software engineering, or a similar field.
  • Experience in Splunk and SignalFx
  • Experience with Amazon Web Services including RDS
  • Relevant data DevOps, SRE, or general systems engineering experience.
  • Experience in managing large production platforms.
  • Experience architecting and implementing data governance processes and tooling (data catalogues, lineage tools, role-based access control, PII handling)
  • Strong coding ability in Python or other languages like Java, C#, Golang, C, C++, Perl Ruby etc.


  • Additional Experience Desired:
  • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
  • Ability to help debug and optimize code and automate routine tasks.
  • Ability to support many different stakeholders. Experience in dealing with difficult situations and making decisions with a sense of urgency is needed.
  • Interest in designing, analyzing and troubleshooting large-scale distributed systems.
  • Appetite for change and pushing the boundaries of what can be done with automation.
  • Experience in working across development, operations, and product teams to prioritize needs and build relationships is a must.
  • Good Handle on Change Management and Release Management aspects of Software.


  • What is it like working for 3Pillar Global?
  • At 3Pillar, we offer a world of opportunity:

  • Imagine a flexible work environment – whether it's the office, your home, or a blend of both. From interviews to onboarding, we embody a remote-first approach. 
  • You will be part of a global team, learning from top talent around the world and across cultures, speaking English every day. Our global workforce enables our team to leverage global resources to accomplish our work in efficient and effective teams. 
  • We’re big on your well-being – as a company, we spend a whole trimester in our annual cycle focused on well-being. Whether it is taking advantage of fitness offerings, mental health plans (country-dependent), or simply leveraging generous time off, we want all of our team members to operate at their best.
  • Our professional services model enables us to accelerate career growth and development opportunities - across projects, offerings, and industries.
  • We are an equal-opportunity employer. It goes without saying that we live by values like Intrinsic Dignity and Open Collaboration to create cutting-edge technology AND reinforce our commitment to diversity - globally and locally. 
  • Required profile

    Experience

    Level of experience: Senior (5-10 years)
    Industry :
    Spoken language(s):
    EnglishEnglish
    Check out the description to know which languages are mandatory.

    Other Skills

    • Teamwork
    • Analytical Thinking
    • Problem Solving
    • Time Management
    • Organizational Skills
    • Mentorship
    • Verbal Communication Skills
    • Adaptability

    Cloud DevOps Engineer Related jobs