Lead Platform Engineer

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

5-7+ years of experience in DevOps, Site Reliability Engineering, or Platform Engineering roles with AWS expertise., Strong proficiency in Infrastructure as Code, particularly with Terraform., Experience with CI/CD pipelines and monitoring tools like Datadog and Sonarqube., Bachelor's degree in Computer Science, Engineering, or a related technical field..

Key responsibilities:

  • Design, implement, and manage scalable cloud infrastructure on AWS using Infrastructure as Code principles.
  • Develop and optimize CI/CD pipelines for applications and infrastructure.
  • Implement monitoring and alerting systems to ensure system health and performance.
  • Provide technical leadership and mentorship on DevOps and cloud architecture best practices.

finally logo
finally Fintech: Finance + Technology Scaleup https://www.finally.com/
51 - 200 Employees
See all jobs

Job description

About finally

finally is one of America’s fastest-growing and most exciting fintech companies, focused on being the premier financial automation platform for SMBs. Our innovative product suite integrates Credit & Banking, Billing & Invoicing, Bookkeeping, and Taxes, all harmonized through cutting-edge artificial intelligence to aid Small to Medium-sized businesses. Finally aims to declutter financial operations, providing businesses with a seamless financial journey, allowing them to focus on what truly matters – their growth.

We’re headquartered in sunny South Florida and we raised $200 million dollars just in 2024 to bolster our growth, to innovate, and to continue to serve our customers. Our company has more than 250 individuals today across 3 offices. We’re proud to serve as the official corporate card and spend management platform for iconic sports franchises like the Florida Panthers, Miami Heat, and Chicago Bulls.

The Opportunity

We are seeking an experienced Lead Platform Engineer / Senior DevOps Engineer to take ownership of our AWS cloud infrastructure and champion DevOps best practices across finally.com. You will be responsible for designing, building, automating, and maintaining our critical platform services, ensuring high availability, performance, and security. This is a key role where you will leverage your expertise in Infrastructure as Code (Terraform), CI/CD (Argocd), monitoring (Datadog), and code quality (Sonarqube) to empower our development teams and support our rapidly scaling applications and new data initiatives.

What You'll Do

  • Design, implement, and manage scalable, secure, and resilient cloud infrastructure on AWS using Infrastructure as Code principles, primarily with Terraform.

  • Develop, maintain, and optimize CI/CD pipelines for our applications (Python, React, TypeScript, Postgres) and infrastructure, utilizing tools like Argocd, GitHub Actions, or Jenkins.

  • Implement and manage comprehensive monitoring, logging, and alerting systems using Datadog to ensure system health, performance, and proactive issue resolution.

  • Integrate and manage code quality and security scanning tools like Sonarqube within our development lifecycle.

  • Automate infrastructure provisioning, configuration management, software deployments, and operational tasks—exploring AI-driven approaches where applicable—to improve efficiency and reliability.

  • Collaborate closely with software engineering and data engineering teams to streamline deployment processes, leveraging modern tooling and intelligent automation (including AI-assisted technologies) to optimize application performance, enhance developer productivity, and ensure infrastructure meets their evolving needs.

  • Champion and enforce security best practices across the AWS environment, including IAM, network security, and vulnerability management.

  • Provide technical leadership, mentorship, and guidance on DevOps, cloud architecture, and automation best practices to the engineering organization.

  • Develop and maintain clear documentation for infrastructure design, configurations, processes, and incident response playbooks.

  • Participate in on-call rotation and respond to production incidents as needed.

What You'll Need

  • 5-7+ years of experience in DevOps, Site Reliability Engineering, or Platform Engineering roles, with significant hands-on experience with AWS.

  • Strong proficiency in Infrastructure as Code, particularly with Terraform.

  • Experience designing, building, and managing CI/CD pipelines (e.g., Argocd, GitHub Actions, Jenkins, GitLab CI).

  • Hands-on experience with containerization technologies (Docker) and orchestration (Kubernetes is a plus, though not explicitly in current stack).

  • Expertise with monitoring and observability tools, preferably Datadog.

  • Experience with code quality tools like Sonarqube.

  • Proficiency in scripting languages such as Python or Bash.

  • Deep understanding of AWS services (EC2, S3, RDS, VPC, IAM, Route 53, etc.) and cloud architecture best practices.

  • Solid understanding of networking, security principles, and best practices in a cloud environment.

  • Excellent troubleshooting and problem-solving skills.

  • Strong communication and collaboration skills, with experience working effectively with development teams.

  • Bachelor's degree in Computer Science, Engineering, or a related technical field.

Bonus Points

  • Familiarity with and enthusiasm for applying AI-powered tools and techniques to enhance developer productivity, CI/CD processes, or operational efficiency.

  • Experience with managing Postgres databases on RDS or self-managed.

  • Knowledge of GitOps principles and tools.

  • Experience in a fast-paced startup or FinTech environment.

  • Relevant AWS certifications (e.g., AWS Certified DevOps Engineer, Solutions Architect).

Tech Stack You'll Work With

  • Cloud: AWS (EC2, S3, RDS, VPC, IAM, etc.)

  • Infrastructure as Code: Terraform

  • CI/CD: Argocd, GitHub Actions (or similar), Sonarqube

  • Monitoring & Logging: Datadog

  • Containerization: Docker (Kubernetes experience beneficial)

  • Applications: Python, React, TypeScript, Postgres

  • Scripting: Python, Bash

Benefits

  • Health insurance

  • Dental insurance

  • Employee stock purchase plan

  • Paid time off

  • Paid training

  • Vision insurance

Required profile

Experience

Industry :
Fintech: Finance + Technology
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Troubleshooting (Problem Solving)
  • Collaboration
  • Communication
  • Problem Solving

Platform Engineer Related jobs