Senior AI Software Engineer, GenAI Framework
Company: NVIDIA Corporation
Location: Redmond
Posted on: June 2, 2025
Job Description:
Senior AI Software Engineer, GenAI Framework page is
loadedSenior AI Software Engineer, GenAI FrameworkApply locations
US, CA, Santa Clara US, WA, Redmond time type Full time posted on
Posted 2 Days Ago job requisition id JR1997692We are now looking
for AI Software Engineers for NeMo! NVIDIA NeMo is an open-source,
scalable and cloud-native framework built for researchers and
developers working on Large Language Models (LLM), Multimodal (MM),
and Speech AI. NeMo provides end-to-end model training, including
data curation, alignment, customization, evaluation, deployment and
tooling to optimize performance and user experience.In this
critical role, you will expand NeMo Framework's capabilities,
enabling users to develop, train, and optimize models by designing
and implementing new features and optimizations, defining robust
APIs, meticulously analyzing and tuning performance, and expanding
our toolkits and libraries to be more comprehensive and coherent.
You will collaborate with internal partners, users, and members of
the open source community to analyze, define and implement highly
optimized solutions.What you'll be doing:
- Design and develop the GenAI open source NeMo Framework and
Megatron Core .
- Solve large-scale, end-to-end AI training and inference
challenges, spanning the full model lifecycle from initial data
curation and pre-processing, orchestration and running of model
training and tuning, to model deployment.
- Work at the intersection of deep learning applications,
libraries, frameworks, and the entire software stack.
- Performance tuning and optimizations of deep learning framework
& software components.
- Research, prototype and develop robust and scalable AI tools
and pipelines.What we need to see:
- MS, PhD or equivalent experience in Computer Science, AI,
Applied Math, or related field and 5+ years of industry
experience.
- Experience with AI Frameworks (e.g. PyTorch, JAX), and/or
inference and deployment environments (e.g. TRT, ONNX,
Triton).
- Proficient in Python programming, software design, debugging,
performance analysis, test design and documentation.
- Consistent record of working effectively across multiple
engineering initiatives and improving AI libraries with new
innovations.
- Strong understanding of deep learning fundamentals and their
practical application.Ways to stand out from the crowd:
- Expertise in large-scale AI training, with a deep understanding
of core compute system concepts (such as latency/throughput
bottlenecks, pipelining, and multiprocessing) and demonstrated
excellence in related performance analysis and tuning.
- Prior experience with Generative AI techniques applied to LLM
and MM learning (Text, Image, Video, Speech).
- Knowledge of GPU/CPU architecture and related numerical
software.
- Experience with cloud computing (e.g. end-to-end pipelines for
AI training and inference on CSP (AWS/Azure/GCP/OCI).
- Contributions to open source deep learning frameworks.NVIDIA is
widely considered to be one of the technology world's most
desirable employers. We have some of the most forward-thinking and
hardworking people on the planet working with us. If you're
creative and autonomous, we want to hear from you!The base salary
range is 148,000 USD - 287,500 USD. Your base salary will be
determined based on your location, experience, and the pay of
employees in similar positions.You will also be eligible for equity
and benefits . NVIDIA accepts applications on an ongoing
basis.NVIDIA is committed to fostering a diverse work environment
and proud to be an equal opportunity employer. As we highly value
diversity in our current and future employees, we do not
discriminate (including in our hiring and promotion practices) on
the basis of race, religion, color, national origin, gender, gender
expression, sexual orientation, age, marital status, veteran
status, disability status or any other characteristic protected by
law.Similar Jobs (5)Principal Generative-AI Software
Engineerlocations US, CA, Santa Clara time type Full time posted on
Posted 8 Days AgoSenior DGX Cloud AI Infrastructure Software
Engineerlocations 4 Locations time type Full time posted on Posted
9 Days AgoSenior Software Engineer - AI Infrastructurelocations US,
CA, Santa Clara time type Full time posted on Posted 19 Days
Ago
#J-18808-Ljbffr
Keywords: NVIDIA Corporation, Shoreline , Senior AI Software Engineer, GenAI Framework, IT / Software / Systems , Redmond, Washington
Didn't find what you're looking for? Search again!
Loading more jobs...