How to Configure NVLink on VPS

NVIDIA’s NVLink is a high-speed interconnect technology designed to enable fast communication between GPUs and CPUs in a system, allowing for much higher data transfer rates than traditional PCIe connections. It’s particularly beneficial in high-performance computing (HPC), AI research, and deep learning tasks, where large volumes of data need to be processed quickly between GPUs. Configuring NVLink can significantly enhance performance for workloads that require multi-GPU setups. This guide will walk you through the steps to configure NVLink on your machine.

What is NVLink?

NVLink is an NVIDIA proprietary interconnect technology that enables data transfer between GPUs or between GPUs and CPUs at high bandwidths. It allows faster communication than standard PCI Express (PCIe) connections by using multiple lanes of data. This direct communication minimizes the need for data to go through the CPU, which can reduce bottlenecks and improve overall system performance.

Key advantages of NVLink:

Faster Communication: NVLink provides up to 25 GB/s of bidirectional bandwidth per link, significantly faster than PCIe 3.0.
Scalability: NVLink enables scaling across multiple GPUs for complex computations such as AI model training or scientific simulations.
Unified Memory Access: NVLink facilitates shared memory access between GPUs, allowing them to use each other’s memory and work on large datasets together.

Prerequisites for NVLink Configuration

Before setting up NVLink, make sure you have the following:

Supported GPUs: NVLink is only available on certain NVIDIA GPUs, such as the NVIDIA Tesla, Quadro, and GeForce RTX (2080, 2080 Ti, 3080, 3090, etc.). Verify that your GPUs support NVLink by checking NVIDIA’s documentation for the specific model you are using.
NVLink Bridge: An NVLink bridge is required to physically connect the GPUs. The number of bridges you need will depend on the number of GPUs and the configuration you want. Some GPUs can use multiple NVLinks for higher bandwidth.
NVLink-Enabled System: Ensure your system’s motherboard and BIOS support NVLink. The system must have enough PCIe slots and appropriate spacing to physically fit the GPUs and NVLink bridge.
NVIDIA Drivers and CUDA: You will need to install the latest NVIDIA drivers and CUDA toolkit to fully utilize NVLink. These are essential for enabling communication between the GPUs.

Step-by-Step Guide to Configuring NVLink

Step 1: Install the GPUs

Before configuring NVLink, install your supported NVIDIA GPUs into the PCIe slots of your system. Make sure the GPUs are securely seated in their respective slots. If your motherboard supports multiple GPUs, be mindful of the slot spacing, as it must accommodate the NVLink bridge.

Power down the machine and unplug it.
Open the system’s case.
Insert the GPUs into the PCIe slots and secure them in place.
Connect any necessary power cables from the power supply unit (PSU) to the GPUs.
Close the system case and power the system back on.

Step 2: Install the NVLink Bridge

Once the GPUs are installed, you can connect them using the NVLink bridge. Follow these steps:

Identify the NVLink connectors on the top of the GPUs. These connectors are designed specifically for the NVLink bridge.
Align the NVLink bridge with the connectors and carefully press it into place. Ensure that the bridge is seated firmly on both GPUs.
If your configuration uses multiple NVLink bridges (for some GPUs with two connectors), repeat this process for each bridge.

Step 3: Install NVIDIA Drivers

To enable NVLink, you need the latest NVIDIA drivers installed on your system. Follow these steps to install the drivers:

Download the Latest NVIDIA Drivers: Visit the NVIDIA Driver Downloads page, select your GPU model, and download the appropriate drivers for your operating system.
Install the Drivers: For Linux:
sudo apt-get update sudo apt-get install nvidia-driver-<version>
For Windows, run the downloaded driver installer and follow the on-screen instructions.
Reboot the System: After installing the drivers, reboot your system to ensure that the changes take effect.

Step 4: Install CUDA Toolkit

NVLink is often used in combination with CUDA, especially in deep learning and HPC tasks. To fully leverage NVLink, you need the CUDA Toolkit installed on your system.

Download CUDA Toolkit: Visit the CUDA Toolkit download page and download the appropriate version for your operating system.
Install CUDA: For Linux:
sudo apt-get install cuda
For Windows, run the CUDA installer and follow the prompts.
Verify CUDA Installation: After installation, verify that CUDA is correctly installed by running the following command:
nvcc --version
This should return the version of CUDA installed on your system.

Step 5: Enable NVLink

Now that your GPUs are installed, and the drivers and CUDA are set up, you can enable NVLink.

Check GPU Configuration: Verify that the GPUs are installed and recognized by the system using the following command (Linux):
nvidia-smi
This should list all GPUs installed in your system and show their respective PCIe and NVLink status.
Enable NVLink with nvidia-smi: The nvidia-smi tool is used to enable and monitor NVLink. To check if NVLink is enabled, run:
nvidia-smi topo -m
If NVLink is properly configured, the output will display links between the GPUs, indicating which GPUs are connected via NVLink. Look for the “NVLink” label between the GPU pairs.
Benchmark NVLink: You can run tests or benchmarks to measure NVLink’s performance. Tools such as cuBLAS or NCCL (NVIDIA Collective Communications Library) can be used to benchmark the data transfer rates between GPUs over NVLink.

Step 6: Configure Applications to Use NVLink

For your applications to take advantage of NVLink, you need to configure them to utilize the high-bandwidth interconnect between GPUs. Many deep learning frameworks like TensorFlow, PyTorch, and MXNet automatically detect NVLink and use it for multi-GPU operations.

For example, in TensorFlow, you can check the available GPUs and their NVLink connectivity using:

Frameworks such as NCCL (NVIDIA Collective Communications Library) are optimized for NVLink and are used for data transfers in multi-GPU environments.

Step 7: Monitor NVLink Status

Once NVLink is set up, it’s essential to monitor its status to ensure it’s functioning correctly. The nvidia-smi tool provides comprehensive monitoring of NVLink traffic, bandwidth utilization, and GPU performance. You can use:

This command shows detailed statistics about the NVLink connection, including throughput and any errors encountered during data transfers.

Conclusion

Configuring NVLink on a machine can significantly improve the performance of multi-GPU setups, especially for workloads that involve high-speed data transfer between GPUs, such as deep learning, AI training, and HPC tasks. By following the steps outlined in this guide, you can install and configure NVLink for optimal performance in your system. Make sure to properly install your GPUs, connect them with the appropriate NVLink bridge, and install the latest drivers and CUDA toolkit for full support of NVLink. Finally, monitor the performance and status of NVLink to ensure it’s working as intended, enabling you to fully leverage its high-speed interconnect capabilities.