Match score not available

Solutions Architect, Deep Learning - Grace Based Platforms

extra holidays - fully flexible
Remote: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

MS/PhD in relevant technical field, 5+ years experience with Python/C++, Knowledge of modern NLP and LLM, Experience with DL training libraries.

Key responsabilities:

  • Work directly with key customers
  • Develop solutions based on NVIDIA’s AI technologies

NVIDIA logo
NVIDIA XLarge http://www.nvidia.com
10001 Employees
See all jobs

Job description

NVIDIA’s Worldwide Field Operations (WWFO) team is looking for a Deep Learning focused Solution Architect with good understanding of ARM based architectures. In particular, a candidate that will support our key customers in adoption of our Grace-Hopper, Grace-Blackwell systems for DNN training and inference (e.g. understanding of model compression techniques, model compilation or model serving). We seek candidates with understanding of modern Natural Language Processing (NLP) and Large Language Models (LLM).

In our Solutions Architecture team, we work with the most exciting computing hardware and software, driving the latest breakthroughs in artificial intelligence! We need individuals who can enable customer productivity and develop lasting relationships with our technology partners, making NVIDIA an integral part of end-user solutions. We are looking for someone always passionate about artificial intelligence, someone who can maintain understanding of a fast paced field, someone able to coordinate efforts between corporate marketing, industry business development and engineering. We will be working with SOTA NLP, LLM, VLM models that are fundamentally changing the way people interact with technology! Solutions Architects, are the first line of technical expertise between NVIDIA and our customers. Your duties will vary from working on proof-of-concept demonstrations, to driving relationships with key executives and managers in order to promote adoption of NVIDIA based AI technology. Dynamically engaging with developers, scientific researchers, data scientists, IT managers and senior leaders is a significant part of the Solutions Architect role and will give you experience with a range of partners and concerns.

What you will be doing:

  • Work directly with key customers to understand their technology and provide the best solutions.

  • Develop and demonstrate solutions based on NVIDIA’s key AI technologies.

  • Perform in-depth analysis and optimization to ensure the best performance on GPU architecture systems (in particular Grace/ARM based systems). This includes support in optimization of both development and deployment NLP/LLM pipelines.

  • Partner with Engineering, Product and Sales teams to develop, plan best suitable solutions for customers. Enable development and growth of product features through customer feedback and proof-of-concept evaluations.

  • Build industry expertise and become a contributor in integrating NVIDIA technology into Enterprise Computing architectures.

What we need to see:

  • Excellent verbal, written communication, and technical presentation skills in English.

  • MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering fields.

  • 5+ years work or research experience with Python/ C++ / other software development

  • Work experience and knowledge of modern NLP including good understanding of transformer and state space model architectures. This can include either expertise in training or optimisation/compression/operation of DNNs.

  • Understanding of key libraries used for NLP/LLM training (such as Megatron-LN, NeMo, DeepSpeed etc.) and/or deployment (e.g. TensorRT-LLM, vLLM, Triton Inference Server).

  • Person excited to work with multiple levels and teams across organizations (Engineering, Product, Sales and Marketing team). Capable of working in a constantly evolving environment without losing focus.

  • Strong time-management and organization skills for coordinating multiple initiatives, priorities and implementations of new technology and products into very complex projects. Self-starter with demeanor for growth, passion for continuous learning and sharing findings across the team.

Ways to Stand Out from The Crowd:

  • Experience running/debugging large scale distributed DL training.

  • Experience working with larger transformer-based architectures for NLP, CV, ASR or other.

  • Background with applying NLP technology and its deployment to production.

  • Experience using DevOps technologies such as Docker, Kubernetes, Singularity, etc.

  • Understanding of HPC systems: data center design, high speed interconnect InfiniBand, Cluster Storage and Scheduling related design and/or management experience.

Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Time Management
  • Communication
  • Problem Solving

Solutions Architect Related jobs