Course Outline

Overview of CANN Optimization Capabilities

  • How inference performance is handled in CANN
  • Optimization goals for edge and embedded AI systems
  • Understanding AI Core utilization and memory allocation

Using Graph Engine for Analysis

  • Introduction to the Graph Engine and execution pipeline
  • Visualizing operator graphs and runtime metrics
  • Modifying computational graphs for optimization

Profiling Tools and Performance Metrics

  • Using CANN Profiling Tool (profiler) for workload analysis
  • Analyzing kernel execution time and bottlenecks
  • Memory access profiling and tiling strategies

Custom Operator Development with TIK

  • Overview of TIK and operator programming model
  • Implementing a custom operator using TIK DSL
  • Testing and benchmarking operator performance

Advanced Operator Optimization with TVM

  • Intro to TVM integration with CANN
  • Auto-tuning strategies for computational graphs
  • When and how to switch between TVM and TIK

Memory Optimization Techniques

  • Managing memory layout and buffer placement
  • Techniques to reduce on-chip memory consumption
  • Best practices for asynchronous execution and reuse

Real-World Deployment and Case Studies

  • Case study: performance tuning for smart city camera pipeline
  • Case study: optimizing autonomous vehicle inference stack
  • Guidelines for iterative profiling and continuous improvement

Summary and Next Steps

Requirements

  • Strong understanding of deep learning model architectures and training workflows
  • Experience with model deployment using CANN, TensorFlow, or PyTorch
  • Familiarity with Linux CLI, shell scripting, and Python programming

Audience

  • AI performance engineers
  • Inference optimization specialists
  • Developers working with edge AI or real-time systems
 14 Hours

Delivery Options

Private Group Training

Our identity is rooted in delivering exactly what our clients need.

  • Pre-course call with your trainer
  • Customisation of the learning experience to achieve your goals -
    • Bespoke outlines
    • Practical hands-on exercises containing data / scenarios recognisable to the learners
  • Training scheduled on a date of your choice
  • Delivered online, onsite/classroom or hybrid by experts sharing real world experience

Private Group Prices RRP from £3800 online delivery, based on a group of 2 delegates, £1200 per additional delegate (excludes any certification / exam costs). We recommend a maximum group size of 12 for most learning events.

Contact us for an exact quote and to hear our latest promotions


Public Training

Please see our public courses

Provisional Upcoming Courses (Contact Us For More Information)

Related Categories