admin管理员组

文章数量:1620390

Windows 系统上安装 CUDA Toolkit 10.1

  • 1. NVCUDA.DLL - NVIDIA CUDA 10.1.135 driver - NVIDIA 驱动程序版本
  • 2. CUDA Toolkit
  • 3. CUDA Toolkit 10.1 original Archive
  • 4. cuda_10.1.105_418.96_win10.exe
  • 5. nvcc -V
  • 6. C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1
  • 7. 环境变量 -> 系统变量
  • 8. demo_suite
  • References

1. NVCUDA.DLL - NVIDIA CUDA 10.1.135 driver - NVIDIA 驱动程序版本

NVIDIA 控制面板 -> 帮助 -> 系统信息 -> 组件

2. CUDA Toolkit

CUDA Toolkit
https://developer.nvidia/cuda-toolkit

CUDA Toolkit Download
https://developer.nvidia/cuda-downloads

CUDA Toolkit Archive
https://developer.nvidia/cuda-toolkit-archive

3. CUDA Toolkit 10.1 original Archive

CUDA Toolkit 10.1 original Archive
https://developer.nvidia/cuda-10.1-download-archive-base

CUDA Toolkit 10.1 (Feb 2019), Online Documentation
https://docs.nvidia/cuda/archive/10.1/

The checksums for the installer and patches can be found in Installer Checksums.
https://developer.download.nvidia/compute/cuda/10.1/Prod/docs/sidebar/md5sum.txt

For further information, see the Installation Guide for Microsoft Windows and the CUDA Quick Start Guide.
https://docs.nvidia/cuda/cuda-installation-guide-microsoft-windows/index.html
https://docs.nvidia/cuda/cuda-quick-start-guide/index.html

4. cuda_10.1.105_418.96_win10.exe

1. Double click cuda_10.1.105_418.96_win10.exe

C:\Users\cheng\AppData\Local\Temp\CUDA

2. NVIDIA 软件许可协议

3. 自定义

4. 选择安装位置

5. NVIDIA 安装程序失败

出现 “NVIDIA 安装程序失败” 的提示,关闭安装界面。重新安装,在 “自定义安装选项” 页面,取消勾选 Visual Studio Integration,即可正常安装。

6. 自定义安装选项 - 取消勾选 Visual Studio Integration

7. CUDA Samples

C:\ProgramData\NVIDIA Corporation\CUDA Samples\v10.1

5. nvcc -V

Microsoft Windows [版本 10.0.17763.1158]
(c) 2018 Microsoft Corporation。保留所有权利。

C:\Users\cheng>nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2019 NVIDIA Corporation
Built on Fri_Feb__8_19:08:26_Pacific_Standard_Time_2019
Cuda compilation tools, release 10.1, V10.1.105

C:\Users\cheng>

6. C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1

7. 环境变量 -> 系统变量

CUDA_PATH
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1

CUDA_PATH_V10_1
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1

NVCUDASAMPLES_ROOT
C:\ProgramData\NVIDIA Corporation\CUDA Samples\v10.1

NVCUDASAMPLES10_1_ROOT
C:\ProgramData\NVIDIA Corporation\CUDA Samples\v10.1

8. demo_suite

C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite

Shift + 鼠标右键 -> 在此处打开 Powershell 窗口

PS C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite> dir


    目录: C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite


Mode                LastWriteTime         Length Name
----                -------------         ------ ----
d-----         2020/5/4     21:16                nbody_data_files
d-----         2020/5/4     21:16                oceanFFT_data_files
d-----         2020/5/4     21:16                randomFog_data_files
-a----         2019/2/9     13:57         250880 bandwidthTest.exe
-a----         2019/2/9     13:57         279552 busGrind.exe
-a----         2019/2/9     13:57         200192 deviceQuery.exe
-a----         2019/2/9     13:57         311296 freeglut.dll
-a----         2019/2/9     13:57         229376 glew64.dll
-a----         2019/2/9     13:57        1265664 nbody.exe
-a----         2019/2/9     13:57         552448 oceanFFT.exe
-a----         2019/2/9     13:57         409088 randomFog.exe
-a----         2019/2/9     13:57         224256 vectorAdd.exe


PS C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite>

.\bandwidthTest.exe

PS C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite> .\bandwidthTest.exe
[CUDA Bandwidth Test] - Starting...
Running on...

 Device 0: GeForce GTX 1060 with Max-Q Design
 Quick Mode

 Host to Device Bandwidth, 1 Device(s)
 PINNED Memory Transfers
   Transfer Size (Bytes)        Bandwidth(MB/s)
   33554432                     6343.7

 Device to Host Bandwidth, 1 Device(s)
 PINNED Memory Transfers
   Transfer Size (Bytes)        Bandwidth(MB/s)
   33554432                     6409.8

 Device to Device Bandwidth, 1 Device(s)
 PINNED Memory Transfers
   Transfer Size (Bytes)        Bandwidth(MB/s)
   33554432                     146651.3

Result = PASS

NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.
PS C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite>

.\deviceQuery.exe

PS C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite> .\deviceQuery.exe
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite\deviceQuery.exe Starting...

 CUDA Device Query (Runtime API) version (CUDART static linking)

Detected 1 CUDA Capable device(s)

Device 0: "GeForce GTX 1060 with Max-Q Design"
  CUDA Driver Version / Runtime Version          10.1 / 10.1
  CUDA Capability Major/Minor version number:    6.1
  Total amount of global memory:                 6144 MBytes (6442450944 bytes)
  (10) Multiprocessors, (128) CUDA Cores/MP:     1280 CUDA Cores
  GPU Max Clock rate:                            1342 MHz (1.34 GHz)
  Memory Clock rate:                             4004 Mhz
  Memory Bus Width:                              192-bit
  L2 Cache Size:                                 1572864 bytes
  Maximum Texture Dimension Size (x,y,z)         1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
  Maximum Layered 1D Texture Size, (num) layers  1D=(32768), 2048 layers
  Maximum Layered 2D Texture Size, (num) layers  2D=(32768, 32768), 2048 layers
  Total amount of constant memory:               zu bytes
  Total amount of shared memory per block:       zu bytes
  Total number of registers available per block: 65536
  Warp size:                                     32
  Maximum number of threads per multiprocessor:  2048
  Maximum number of threads per block:           1024
  Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
  Max dimension size of a grid size    (x,y,z): (2147483647, 65535, 65535)
  Maximum memory pitch:                          zu bytes
  Texture alignment:                             zu bytes
  Concurrent copy and kernel execution:          Yes with 5 copy engine(s)
  Run time limit on kernels:                     Yes
  Integrated GPU sharing Host Memory:            No
  Support host page-locked memory mapping:       Yes
  Alignment requirement for Surfaces:            Yes
  Device has ECC support:                        Disabled
  CUDA Device Driver Mode (TCC or WDDM):         WDDM (Windows Display Driver Model)
  Device supports Unified Addressing (UVA):      Yes
  Device supports Compute Preemption:            Yes
  Supports Cooperative Kernel Launch:            No
  Supports MultiDevice Co-op Kernel Launch:      No
  Device PCI Domain ID / Bus ID / location ID:   0 / 1 / 0
  Compute Mode:
     < Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 10.1, CUDA Runtime Version = 10.1, NumDevs = 1, Device0 = GeForce GTX 1060 with Max-Q Design
Result = PASS
PS C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite>

References

[1] Yongqiang Cheng, https://yongqiang.blog.csdn/
[2] CUDA Toolkit, https://developer.nvidia/cuda-toolkit
[3] GPU-Accelerated Libraries for AI and HPC, https://developer.nvidia/gpu-accelerated-libraries
[4] NVIDIA Developer Tools Overview, https://developer.nvidia/tools-overview
[5] CUDA Toolkit Documentation, https://docs.nvidia/cuda/
[6] CUDA Installation Guide for Microsoft Windows, https://docs.nvidia/cuda/cuda-installation-guide-microsoft-windows/index.html
[7] CUDA Quick Start Guide, https://docs.nvidia/cuda/cuda-quick-start-guide/index.html
[8] CUDA Code Samples, https://developer.nvidia/cuda-code-samples
[9] An Even Easier Introduction to CUDA, https://devblogs.nvidia/even-easier-introduction-cuda/
[10] NVIDIA Developer Blog, https://devblogs.nvidia/
[11] Accelerated Computing - Training, https://developer.nvidia/accelerated-computing-training
[12] NVIDIA Developer Forums, https://forums.developer.nvidia/
[13] Display Driver Uninstaller (DDU), https://www.wagnardsoft/

本文标签: 系统WindowsCUDAToolkit