AI Inference and Deployment with CloudMatrix Training Course

CloudMatrix serves as Huawei's comprehensive AI development and deployment platform, engineered to facilitate scalable, production-ready inference pipelines.

This live training, led by an instructor (available online or on-site), targets beginner to intermediate AI professionals aiming to deploy and monitor AI models using the CloudMatrix platform, leveraging its integration with CANN and MindSpore.

Upon completion of this training, participants will be equipped to:

Utilize CloudMatrix for model packaging, deployment, and serving.
Convert and optimize models specifically for Ascend chipsets.
Establish pipelines for both real-time and batch inference tasks.
Monitor deployments and fine-tune performance within production environments.

Course Format

Interactive lectures and discussions.
Practical application of CloudMatrix through real-world deployment scenarios.
Guided exercises centered on conversion, optimization, and scaling.

Customization Options

For customized training tailored to your specific AI infrastructure or cloud environment, please contact us to arrange.

This course is available as onsite live training in Kenya or online live training.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Course Outline

Introduction to Huawei CloudMatrix

CloudMatrix ecosystem and deployment flow
Supported models, formats, and deployment modes
Typical use cases and supported chipsets

Preparing Models for Deployment

Model export from training tools (MindSpore, TensorFlow, PyTorch)
Using ATC (Ascend Tensor Compiler) for format conversion
Static vs dynamic shape models

Deploying to CloudMatrix

Service creation and model registration
Deploying inference services via UI or CLI
Routing, authentication, and access control

Serving Inference Requests

Batch vs real-time inference flows
Data preprocessing and postprocessing pipelines
Calling CloudMatrix services from external applications

Monitoring and Performance Tuning

Deployment logs and request tracking
Resource scaling and load balancing
Latency tuning and throughput optimization

Integration with Enterprise Tools

Connecting CloudMatrix with OBS and ModelArts
Using workflows and model versioning
CI/CD for model deployment and rollback

End-to-End Inference Pipeline

Deploying a complete image classification pipeline
Benchmarking and validating accuracy
Simulating failover and system alerts

Summary and Next Steps

Requirements

A foundational understanding of AI model training workflows
Practical experience with Python-based ML frameworks
Basic familiarity with cloud deployment concepts

Audience

AI operations teams
Machine learning engineers
Cloud deployment specialists working with Huawei infrastructure

21 Hours

Need help picking the right course?
southafrica@nobleprog.co.za or +27 (0)10 005 5793

Testimonials (2)

The extensive selection of tools presented

Miruna Buzduga - Aeronamic Eastern Europe

Course - AI Enablement Training for Engineers

Step by step training with a lot of exercises. It was like a workshop and I am very glad about that.

AI Inference and Deployment with CloudMatrix Training Course

Course Outline

Requirements

Testimonials (2)

Miruna Buzduga - Aeronamic Eastern Europe

Course - AI Enablement Training for Engineers

Ireneusz - Inter Cars S.A.

Course - Intelligent Applications Fundamentals

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

AI Inference and Deployment with CloudMatrix Training Course

Course Outline

Requirements

Testimonials (2)

Miruna Buzduga - Aeronamic Eastern Europe

Course - AI Enablement Training for Engineers

Ireneusz - Inter Cars S.A.

Course - Intelligent Applications Fundamentals

Related Courses

Developing AI Applications with Huawei Ascend and CANN

Deploying AI Models with CANN and Ascend AI Processors

AI Engineering Fundamentals

AI Enablement Training for Engineers

Building Intelligent Applications with AI and ML

Introduction to CANN for AI Framework Developers

CANN for Edge AI Deployment

Understanding Huawei’s AI Compute Stack: From CANN to MindSpore

Optimizing Neural Network Performance with CANN SDK

CANN SDK for Computer Vision and NLP Pipelines

Building Custom AI Operators with CANN TIK and TVM

Migrating CUDA Applications to Chinese GPU Architectures

Intelligent Applications Fundamentals

Intelligent Applications Advanced

Performance Optimization on Ascend, Biren, and Cambricon

Related Categories

AI Engineering

Huawei Ascend

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites