Engineering Manager, Deep Learning Inference

Remote Full-time
Job Description: • Lead, mentor, and scale a high-performing engineering team focused on deep learning inference and GPU-accelerated software • Drive the strategy, roadmap, and execution of NVIDIA’s inference frameworks engineering • Partner with internal compiler, libraries, and research teams to deliver end-to-end optimized inference pipelines • Oversee performance tuning, profiling, and optimization of large-scale models • Guide engineers in adopting best practices for CUDA, Triton, CUTLASS, and multi-GPU communications • Represent the team in roadmap and planning discussions • Foster a culture of technical excellence, open collaboration, and continuous innovation Requirements: • MS, PhD, or equivalent experience in Computer Science, Electrical/Computer Engineering, or a related field • 6+ years of software development experience • 3+ years in technical leadership or engineering management • Strong background in C/C++ software design and development • Proficiency in Python is a plus • Hands-on experience with GPU programming (CUDA, Triton, CUTLASS) • Proven record of deploying or optimizing deep learning models in production environments • Experience leading teams using Agile or collaborative software development practices Benefits: • Health insurance • Comprehensive benefits package Apply tot his job
Apply Now →

Similar Jobs

Data Engineer - Healthcare

Remote Full-time

Technical Project Manager – Robotics Hardware

Remote Full-time

(Remote) Director of Applied Science - Healthcare AI

Remote Full-time

[Remote] URGENT HIRING | Healthcare Customer Service Advocate - Remote

Remote Full-time

Rust Developer (Train AI Models Part Time!)

Remote Full-time

[Remote] Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)

Remote Full-time

Quality Assurance Engineer (AWS Lex and Google Dialogflow)

Remote Full-time

Quality Assurance Engineer – PT, up to 20 hours per week

Remote Full-time

VP, Quality Assurance, AI Tools

Remote Full-time

[Remote] Cloud Solution Architect - Cloud & AI Infrastructure

Remote Full-time

Faculty Advisor - Nursing

Remote Full-time

Desk adjuster - Senior Adjuster(Remote, 1099)

Remote Full-time

Experienced Remote Data Entry Clerk – Accurate Data Management and Entry for a Dynamic Team at blithequark

Remote Full-time

Project Manager (FEMA Emergency Management Training Program) at INTECON

Remote Full-time

Experienced Full Stack Technical Customer Service Specialist – Cloud Application Support and Account Management for US Government and Enterprise Clients at Blithequark

Remote Full-time

**Experienced Full Stack Live Chat Operator – Remote Customer Support for blithequark**

Remote Full-time

Customer Service Representative – FEMA Training Support

Remote Full-time

Experienced Senior Data Scientist – Advanced Data Analysis, Predictive Analytics, and Machine Learning Expertise for Strategic Business Insights and Decision Making at arenaflex

Remote Full-time

Cost Estimator

Remote Full-time

Family Development and Service Coordinator

Remote Full-time
← Back to Home