1.5 KiB
Introduction
This sample demonstrates linking a libnvvm-generated module with an existing CUDA C library. The LLVM C++ API is used to generate an LLVM IR module that conforms to the NVVM IR specification and contains a call to an externally- defined function, and this module is compiled to PTX with libnvvm. The JIT linker (part of the CUDA Driver API) is then used to assemble the PTX and link it with the math library, creating a linked CUBIN image. This image is then executed on the first CUDA device on the system.
Files
-
cuda-c-linking.cpp - Main source file demonstrating the generated of a PTX file using libnvvm and linking it with a CUDA C device library
-
math-funcs - CUDA C device library source file
-
CMakeLists.txt - CMake build script
Building
This sample is built as part of the libnvvm samples tree. Please see the README file at the root of the libnvvm samples tree for build instructions.
Usage
Once built, the sample can be executed by running the "cuda-c-linking" binary.
Linux:
$ cd $SAMPLES_INSTALL_DIR
$ ./cuda-c-linking
Windows:
$ cd %SAMPLES_INSTALL_DIR%
$ cuda-c-linking.exe
For inspection purposes, the following command-line options are available:
- -save-ptx - Write generated PTX kernel to cuda-c-linking.kernel.ptx
- -save-ir - Write generated LLVM IR to cuda-c-linking.kernel.ll
- -save-cubin - Write linked CUBIN image to cuda-c-linking.linked.cubin