Skip to content

upgrade to cuda9(Volta) GPU arch#118

Open
dzhwinter wants to merge 1 commit intobaidu-research:masterfrom
dzhwinter:master
Open

upgrade to cuda9(Volta) GPU arch#118
dzhwinter wants to merge 1 commit intobaidu-research:masterfrom
dzhwinter:master

Conversation

@dzhwinter
Copy link
Collaborator

Currently, I fix the cuda9 error and some issues in #117. However, our branch has diverged with original repo, this PR cherry-picked from my repo, now it can be safely merged. Please review this PR in your spare time, thanks!

@dzhwinter dzhwinter mentioned this pull request Apr 28, 2018
@t-vi
Copy link

t-vi commented Jul 4, 2018

@dzhwinter @wangkuiyi Say, would you be able to shed some light on the mask (0 if I see this correctly) you are using? in most other venues, it seems to be that shfl is replaced by shfl sync with all mask bits set. I would be most grateful!

@Diamondfan
Copy link

Seems not work for me with cuda9.2 and V100. With "./test_cpu: error while loading shared libraries: libwarpctc.so: cannot open shared object file: No such file or directory". Any suggestion?

@ThomasDelteil
Copy link

ThomasDelteil commented Oct 31, 2018

can this PR be merged? WarpCTC is failing on V100 otherwise.

edit: With this branch I am getting:

./test_gpu
Running GPU tests
Some or all tests fail

@aonotas
Copy link

aonotas commented Apr 8, 2019

@Diamondfan
maybe you need to set

$ cd build
$ export LD_LIBRARY_PATH=./:$LD_LIBRARY_PATH
$ ./test_gpu

I also get same error as @ThomasDelteil.

$ ./test_gpu
Running GPU tests
Some or all tests fail

I use cuda-9.0 and Volta-100.

@IamMegatron2025
Copy link

IamMegatron2025 commented Apr 22, 2019

use cuda10, and modify the code in CMakeLists.txt:

IF (CUDA_VERSION GREATER 7.6)
    set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_60,code=sm_60")
    set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_61,code=sm_61")
    set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_62,code=sm_62")
    set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_70,code=sm_70")
ENDIF()

it works.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants