Get in Touch

Course Outline

Introduction to Programming Big Data with R (bpdR)

  • Configuring your environment for pbdR usage
  • Exploring the scope and tools available in pbdR
  • Commonly used packages for Big Data alongside pbdR

Message Passing Interface (MPI)

  • Utilizing pbdR MPI 5
  • Implementing parallel processing
  • Point-to-point communication
  • Handling Send Matrices
  • Summing Matrices
  • Collective communication
  • Summing Matrices with Reduce
  • Scatter / Gather operations
  • Other MPI communication methods

Distributed Matrices

  • Creating a distributed diagonal matrix
  • Performing SVD on a distributed matrix
  • Building a distributed matrix in parallel

Statistics Applications

  • Monte Carlo Integration
  • Reading Datasets
  • Reading data on all processes
  • Broadcasting from a single process
  • Reading partitioned data
  • Distributed Regression
  • Distributed Bootstrap
 21 Hours

Testimonials (2)

Related Categories