Jump to content

EFA Now Supports NVIDIA GPUDirect RDMA

Recommended Posts

We are excited to announce that Elastic Fabric Adapter (EFA) now supports NVIDIA GPUDirect Remote Direct Memory Access (RDMA). GPUDirect RDMA support on EFA will be available on Amazon Elastic Compute Cloud (Amazon EC2) P4d instances- the next generation of GPU-based instances on AWS. P4d provides the highest performance for machine learning (ML) training and high performance computing (HPC) in the cloud for applications such a natural language processing, object detection and classification, seismic analysis, and computational drug discovery. GPUDirect RDMA support on EFA enables network interface cards (NICs) to directly access GPU memory. This avoids extra memory copies, making remote GPU-to-GPU communication across NVIDIA GPU-based Amazon EC2 instances faster, and reduces orchestration overhead on CPUs and user applications. As a result, our customers running applications using NVIDIA Collective Communications Library (NCCL) on P4d will be able to further accelerate their multi-node tightly-coupled workloads.

View the full article

Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

  • Create New...