Senior Systems Engineer - Autonomous Vehicle Infrastructure

Remote: 
Full Remote
Contract: 

Offer summary

Qualifications:

BS/MS in Computer Science, Engineering or related STEM field or equivalent experience., 8+ years of professional experience in a related field., Strong programming skills in Go and Python, with expertise in micro-services development., Advanced knowledge of Kubernetes and Infrastructure as Code (IaC) practices..

Key responsibilities:

  • Develop and maintain tooling and automation to enhance developer productivity.
  • Lead the development of infrastructure automation frameworks and CI/CD pipelines.
  • Engage with engineering users to improve their experience with cloud solutions.
  • Troubleshoot complex production issues and enhance cloud infrastructure reliability.

NVIDIA logo
NVIDIA XLarge http://www.nvidia.com
10001 Employees
See all jobs

Job description

The autonomous vehicle (AV) infrastructure group builds foundational infrastructure and tools to enable NVIDIA's AV program. We are seeking a motivated Senior Engineer to join our team in building and scaling our cloud-native infrastructure which powers 100s of micro-services and large scale HPC clusters (15k+ GPUs). You'll play a critical role in driving infrastructure innovation across our organization. Ideal candidates will have strong software development as well as operational (SRE) skills.

What you'll be doing:

  • Develop, operate and maintain tooling and automation to enhance developer productivity and operational efficiency for the org

  • Lead the development of infrastructure automation frameworks, and CI/CD pipelines, ensuring robust, scalable, and secure cloud-native applications deployment

  • Engage directly with engineering users to understand their needs and improve their experience by recommending robust, scalable cloud solutions.

  • Contribute to the design and architecture of the cloud infrastructure, traffic and networking components to meet the evolving needs of our internal developer platform

  • Play pivotal role in improving cloud infrastructure and services reliability and performance

  • Troubleshoot complex production issues

What we need to see:

  • BS/MS in Computer Science, Engineering or STEM related field (or equivalent experience)

  • 8+ years of professional experience in related field

  • Strong programming fundamentals with expertise in Go and Python

  • Experience developing and operating micro-services at scale

  • Good understanding of the SRE best practices, alerting and observability

  • Advanced Kubernetes workload management expertise, including traffic management, deployment strategies, observability, and security

  • Strong Infrastructure as Code (IaC) fundamentals with experience in developing infrastructure CI/CD pipelines, automation frameworks, and IaC libraries

Ways to stand out from the crowd:

  • Motivated self-starter with an equal balance of strong problem-solving skills and customer-facing communication skills

  • Excellent written and verbal interpersonal skills.

  • Contributions to open-source projects

  • Previous experience with building sophisticated tooling and SRE automation on the large GPU/CPU clusters

  • Deep AWS expertise across core services (VPC, IAM, EC2, S3, RDS, CloudFront, EKS) with proven experience in designing and managing scalable cloud infrastructure

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Problem Solving
  • Non-Verbal Communication
  • Social Skills

System Engineer Related jobs