Get in Touch

Course Outline

Hunyuan Multimodal Foundations and Lab Setup

  • Understanding Hunyuan's multimodal capabilities for image, 3D, and video use cases
  • Identifying practical business scenarios for creative, product, and content teams
  • Preparing the lab environment, sample assets, and model access
  • Executing initial generation tasks and reviewing outputs

Prompt Design and Workflow Patterns

  • Structuring prompts for consistent multimodal results
  • Working with text prompts, reference images, and basic input settings
  • Selecting appropriate workflows for image, video, or 3D generation
  • Iterating on prompts based on output quality and business intent

Image Generation and Review Labs

  • Creating marketing, product, and concept images from prompts
  • Refining visual style, composition, and content consistency
  • Reviewing outputs for usefulness, quality, and brand alignment
  • Organising image outputs for approval and downstream use

Video Generation Labs

  • Creating short video outputs from prompts and prepared inputs
  • Controlling style, scene intent, and output variation
  • Reviewing videos for clarity, continuity, and practical application
  • Preparing video outputs for demonstration or content workflows

3D Asset Creation Labs

  • Generating basic 3D assets from text or image inputs
  • Checking geometry, texture quality, and asset usability
  • Exporting assets for visualization, prototyping, or content pipelines
  • Comparing when 3D generation is appropriate versus image or video workflows

Integration, Governance, and Next Steps

  • Delivering generated assets through simple apps, services, or APIs
  • Connecting multimodal outputs to product, content, and review workflows
  • Applying practical checks for quality, brand safety, copyright, and responsible use
  • Planning pilot use cases and next steps for internal adoption

Requirements

  • Fundamental understanding of AI and generative AI concepts
  • Experience with web applications, APIs, or standard developer tools
  • Basic proficiency in Python or scripting

Audience

  • Developers creating AI-enabled product features
  • Technical product managers and solution architects
  • Innovation, media, and digital teams working with image, video, or 3D content
 14 Hours

Related Categories