由于Jetson TK1上不能跑大數(shù)據(jù)集的訓(xùn)練任務(wù),所以還需要性能更強(qiáng)悍的顯卡來(lái)進(jìn)行這種訓(xùn)練任務(wù)。所以需要在筆記本上安裝ubuntu系統(tǒng)以及cuda和cudnn工具。
筆記本:聯(lián)想小新-700
CPU:Intel? Core? i5-6300HQ CPU @ 2.30GHz × 4
GPU:GeForce GTX 950M/PCIe/SSE2
ubuntu系統(tǒng):16.04
CUDA:8.0
CUDNN:cudnn-8.0-linux-x64-v6.0.tgz
具體的安裝過程可以參考Ubuntu16.04+CUDA8.0+caffe配置
- 首先安裝nvidia顯卡驅(qū)動(dòng) 可在官網(wǎng)上下載對(duì)應(yīng)的驅(qū)動(dòng)程序包安裝 也可以直接在系統(tǒng)設(shè)置-軟件與程序-驅(qū)動(dòng)中選擇nvidia的顯卡驅(qū)動(dòng)進(jìn)行更新(一般都是tested),這里我們采用后一種方法直接更新,然后reboot即可
tips:在ubuntu下更新完nvidia的驅(qū)動(dòng)后 需要再次進(jìn)入boot設(shè)置 否則找不到boot點(diǎn) - reboot完成之后,可以在關(guān)于這臺(tái)計(jì)算機(jī)中查看是否驅(qū)動(dòng)安裝成功 如果成功的話會(huì)顯示當(dāng)前采用的是何種nvidia顯卡,另外也可以采用命令查詢:
sudo nvidia-smi
命令行會(huì)輸出nvidia顯卡的信息:
Wed Apr 19 11:04:15 2017
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 375.39 Driver Version: 375.39 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 GeForce GTX 950M Off | 0000:01:00.0 Off | N/A |
| N/A 40C P8 N/A / N/A | 105MiB / 2002MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| 0 1026 G /usr/lib/xorg/Xorg 64MiB |
| 0 1722 G compiz 34MiB |
| 0 1953 G fcitx-qimpanel 6MiB |
+-----------------------------------------------------------------------------+
- 更新軟件包及安裝所需軟件
sudo apt-get install libprotobuf-dev libleveldb-dev libsnappy-dev libopencv-dev libhdf5-serial-dev protobuf-compiler
sudo apt-get install --no-install-recommends libboost-all-dev
sudo apt-get install libopenblas-dev liblapack-dev libatlas-base-dev
sudo apt-get install libgflags-dev libgoogle-glog-dev liblmdb-dev
- 接著我們安裝cuda8.0 已經(jīng)下載好了cuda8.0的.run文件
cuda_8.0.61_375.26_linux.run執(zhí)行:
sudo sh cuda_8.0.61_375.26_linux.run
一串的文件信息之后輸入accept即可,然后對(duì)于是否安裝nvidia顯卡選擇否,其他都選擇yes或者默認(rèn)路徑即可。然后設(shè)置環(huán)境變量
打開~/.bashrc文件:
sudo vim ~/.bashrc
將以下內(nèi)容寫入到~/.bashrc尾部:
export PATH=/usr/local/cuda-8.0/bin${PATH:+:${PATH}}
export LD_LIBRARY_PATH=/usr/local/cuda8.0/lib64${LD_LIBRARY_PATH:+:${LD_LIBRARY_PATH}}
(4)測(cè)試CUDA的sammples
cd /usr/local/cuda-8.0/samples/1_Utilities/deviceQuery #由自己電腦目錄決定
make #此處需要root權(quán)限
sudo ./deviceQuery
輸出為一系列的gpu信息即為安裝成功:
Detected 1 CUDA Capable device(s)
Device 0: "GeForce GTX 950M"
CUDA Driver Version / Runtime Version 8.0 / 8.0
CUDA Capability Major/Minor version number: 5.0
Total amount of global memory: 2003 MBytes (2100232192 bytes)
( 5) Multiprocessors, (128) CUDA Cores/MP: 640 CUDA Cores
GPU Max Clock rate: 1124 MHz (1.12 GHz)
Memory Clock rate: 1001 Mhz
Memory Bus Width: 128-bit
L2 Cache Size: 2097152 bytes
Maximum Texture Dimension Size (x,y,z) 1D=(65536), 2D=(65536, 65536), 3D=(4096, 4096, 4096)
Maximum Layered 1D Texture Size, (num) layers 1D=(16384), 2048 layers
Maximum Layered 2D Texture Size, (num) layers 2D=(16384, 16384), 2048 layers
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 49152 bytes
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535)
Maximum memory pitch: 2147483647 bytes
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes with 1 copy engine(s)
Run time limit on kernels: Yes
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Disabled
Device supports Unified Addressing (UVA): Yes
Device PCI Domain ID / Bus ID / location ID: 0 / 1 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 8.0, CUDA Runtime Version = 8.0, NumDevs = 1, Device0 = GeForce GTX 950M
Result = PASS
- 接著配置cudnn加速工具 選擇對(duì)應(yīng)于cuda8.0的cudnn-v6.0最新版本
cudnn-8.0-linux-x64-v6.0.tgz解壓可得到一個(gè)cuda文件夾里面包含include和lib64兩個(gè)文件夾,然后按照教材copy即可。
#include文件夾
sudo cp cudnn.h /usr/local/cuda/include/ #復(fù)制頭文件
#lib64文件夾
sudo cp lib* /usr/local/cuda/lib64/ #復(fù)制動(dòng)態(tài)鏈接庫(kù)
cd /usr/local/cuda/lib64/sudo rm -rf libcudnn.so libcudnn.so.6 #刪除原有動(dòng)態(tài)文件
sudo ln -s libcudnn.so.6.0.20 libcudnn.so.6 #生成軟銜接
sudo ln -s libcudnn.so.6 libcudnn.so #生成軟鏈接
- 下載opencv 編譯 -- 需要編譯一段時(shí)間
opencv3.2-source 解壓此文件即可
#建立build文件夾#
cd ~/opencv
mkdir build
cd build
#環(huán)境配置#
cmake -D CMAKE_BUILD_TYPE=Release -D CMAKE_INSTALL_PREFIX=/usr/local ..
#編譯#
make -j 8
#安裝#
sudo make install
遇到的問題fata error: LAPACKE_H_PATH-NOTFOUND when building OpenCV 3.2,解決方法參考fata error: LAPACKE_H_PATH-NOTFOUND when building OpenCV 3.2
sudo apt-get install liblapacke-dev checkinstall
- 編譯caffe
make clean
make -j 8 all
make -j 8 runtest
遇到的問題就是"libcudart.so.8.0 cannot open shared object file: No such file or directory"只需要執(zhí)行
#注意自己CUDA的版本號(hào)!
sudo cp /usr/local/cuda-8.0/lib64/libcudart.so.8.0 /usr/local/lib/libcudart.so.8.0 && sudo ldconfig
sudo cp /usr/local/cuda-8.0/lib64/libcublas.so.8.0 /usr/local/lib/libcublas.so.8.0 && sudo ldconfig
sudo cp /usr/local/cuda-8.0/lib64/libcurand.so.8.0 /usr/local/lib/libcurand.so.8.0 && sudo ldconfig
還遇到了問題"libcudnn.so.6 cannot open shared object file: No such file or directory"這是因?yàn)閏opy的cudnn中l(wèi)ib文件夾內(nèi)的該文件沒有copy到/usr/local/lib下:
sudo cp /usr/local/cuda/lib64/libcudnn.so.6 /usr/local/lib/libcudnn.so.6 && sudo ldconfig
- 編譯的過程中遇到一個(gè)問題:
/sbin/ldconfig.real: /usr/lib/nvidia-375/libEGL.so.1 is not a symbolic link
/sbin/ldconfig.real: /usr/lib32/nvidia-375/libEGL.so.1 is not a symbolic link
解決方法參考:libEGL.so.1 不是符號(hào)連接
sudo mv /usr/lib/nvidia-375/libEGL.so.1 /usr/lib/nvidia-375/libEGL.so.1.org
sudo mv /usr/lib32/nvidia-375/libEGL.so.1 /usr/lib32/nvidia-375/libEGL.so.1.org
sudo ln -s /usr/lib/nvidia-375/libEGL.so.375.39 /usr/lib/nvidia-375/libEGL.so.1
sudo ln -s /usr/lib32/nvidia-375/libEGL.so.375.39 /usr/lib32/nvidia-375/libEGL.so.1