AI Performance Engineer

AI Performance Engineer

Devire

Hexjobs Insights

Role: AI Performance Engineer. Responsibilities include optimizing AI models on Ascend NPUs, performance analysis, developing critical operators, and team leadership. Requires deep understanding of GPU/NPU architecture and relevant experience.

Schlüsselwörter

GPU architecture
NPU architecture
deep learning frameworks
performance analysis
software engineering
optimization
performance tuning
model optimization
Python

Vorteile

  • Full time office work in R&D Lab
  • Contract of employment with annual bonus
  • Private healthcare package
  • Sport Cards for various subscriptions
  • Benefit Platform with diverse options
  • Special discounts for employees
  • Office massages

What we offer

  • Full time office work in our R&D Lab
  • Contract of employment with annual bonus
  • Private healthcare package. We offer premium private healthcare package for our employees.
  • Sport Cards. Our employees can choose from many options within sport subscriptions and sport associations.
  • Benefit Platform. You can choose your benefits on our Benefit Platform e.g.: cinema/theater tickets and discounts, shopping cards and many more.
  • Special discounts for employees. We cooperate with various local companies to offer unique promotions only for our Employees.
  • Office massages. It is a 15-minute chair-based massage. The massage therapist focuses on your back, neck, shoulders and arms.

Requirements

  1. Deep understanding of GPU or NPU architecture, including execution units, memory hierarchy, interconnects, and thread scheduling, as well as performance bottleneck analysis methodologies.
  2. Familiarity with mainstream deep learning frameworks such as PyTorch, TensorFlow, or JAX.
  3. Hands-on experience in deep learning operator/kernel development and performance tuning, with the ability to implement and optimize complex operators.
  4. Proficiency with performance analysis and profiling tools (e.g., Nsight Compute, nvprof, torch.profiler), and ability to conduct quantitative analysis and performance modeling.
  5. Strong system design and software engineering skills, with the ability to balance performance, maintainability, and generality in complex systems.

 Education

  1. Master’s or Ph.D. degree in Computer Architecture, Compiler Design, High Performance Computing, or a related field.

Responsibilities

  1. Lead performance optimization of AI models on Ascend NPUs, including performance analysis, bottleneck identification, and optimization implementation for both training and inference workloads.
  2. Analyze performance bottlenecks of multimodal models and large language models (LLMs) on the Ascend platform, covering operators, kernels, memory access patterns, and scheduling; design and implement optimization strategies.
  3. Develop and optimize critical operators/kernels, continuously improving execution efficiency, memory access patterns, parallelization strategies, and hardware resource utilization.
  4. Research and apply advanced techniques such as auto-tuning, operator fusion, graph optimization, and scheduling optimization in real-world production scenarios.
  5. Build and lead an NPU performance optimization team; communicate findings to cross-functional teams and leadership, and contribute to the evolution of next-generation Ascend NPU architecture.
Aufrufe: 9
Veröffentlichtvor 15 Tagen
Läuft abin 15 Tagen

Ähnliche Jobs, die für Sie von Interesse sein könnten

Basierend auf "AI Performance Engineer"

Keine Angebote gefunden, versuchen Sie, Ihre Suchkriterien zu ändern.