C++ Software Engineer - GPU Performance

zoox
Foster City, CA
On-site
Full-time
USD 168000-239000 per-year-salary
Posted 10 days ago
Software

Job Description

Zoox is building the world's most advanced self-driving hardware and software solution. The efficiency demands of such a system require an expert fine tuning of both the compute hardware architecture as well as the algorithms and middleware that runs on it to achieve maximum throughput at the most optimal power levels. 

The Software Performance team’s mission is to analyze, optimize and provide guidance to the software and hardware teams in order to meet the required specifications.   

As a GPU performance software engineer within the Software Performance team, you will instrument, monitor, analyze and optimize GPU based algorithms that are performance-critical for our solution. The scope for GPU usage ranges from traditional computer vision and deep learning architectures to complex geometric reasoning and multi-agent decision making. Your work will strongly influence design decisions of future compute platforms & resource allocation.

Qualifications

  • BS in computer science or related field and 3+ years of experience.
  • Strong knowledge of CUDA as applied to recent GPU microarchitectures (e.g., Ampere, Blackwell) and experience debugging/optimizing GPU kernels using tools like Nsight.
  • Strong knowledge of C++ and experience in large code bases, comfortable in Linux development environments.
  • Experience in development, debugging, and profiling of complex multiprocess systems (e.g., robotic systems, game engines).

Bonus Qualifications

  • Experience with GPU kernel development in a real-time environment, including PTX-level programming, CPU SIMD instructions (e.g., AVX intrinsics), and custom CUDA layers with frameworks like TensorRT & XLA.
  • Hands-on work with ML model optimization (post-training quantization, layer pruning, etc) or hand-tuning GPU kernels (in OpenGL, CUDA, RocM or similar).
  • Proficiency with SQL, DataBricks, Looker, or other business intelligence tools.

Requirements

Qualifications

  • BS in computer science or related field and 3+ years of experience.
  • Strong knowledge of CUDA as applied to recent GPU microarchitectures (e.g., Ampere, Blackwell) and experience debugging/optimizing GPU kernels using tools like Nsight.
  • Strong knowledge of C++ and experience in large code bases, comfortable in Linux development environments.
  • Experience in development, debugging, and profiling of complex multiprocess systems (e.g., robotic systems, game engines).

Bonus Qualifications

  • Experience with GPU kernel development in a real-time environment, including PTX-level programming, CPU SIMD instructions (e.g., AVX intrinsics), and custom CUDA layers with frameworks like TensorRT & XLA.
  • Hands-on work with ML model optimization (post-training quantization, layer pruning, etc) or hand-tuning GPU kernels (in OpenGL, CUDA, RocM or similar).
  • Proficiency with SQL, DataBricks, Looker, or other business intelligence tools.

Disclaimer: Real Jobs From Anywhere is an independent platform dedicated to providing information about job openings. We are not affiliated with, nor do we represent, any company, agency, or agent mentioned in the job listings. Please refer to our Terms of Services for further details.