mirror of
https://github.com/NVIDIA/cuda-samples.git
synced 2025-01-19 19:35:49 +08:00
65 lines
3.1 KiB
Markdown
65 lines
3.1 KiB
Markdown
# simpleCUFFT_callback - Simple CUFFT Callbacks
|
|
|
|
## Description
|
|
|
|
Example of using CUFFT. In this example, CUFFT is used to compute the 1D-convolution of some signal with some filter by transforming both into frequency domain, multiplying them together, and transforming the signal back to time domain. The difference between this example and the Simple CUFFT example is that the multiplication step is done by the CUFFT kernel with a user-supplied CUFFT callback routine, rather than by a separate kernel call.
|
|
|
|
## Key Concepts
|
|
|
|
Image Processing, CUFFT Library
|
|
|
|
## Supported SM Architectures
|
|
|
|
[SM 3.5 ](https://developer.nvidia.com/cuda-gpus) [SM 5.0 ](https://developer.nvidia.com/cuda-gpus) [SM 5.3 ](https://developer.nvidia.com/cuda-gpus) [SM 6.0 ](https://developer.nvidia.com/cuda-gpus) [SM 6.1 ](https://developer.nvidia.com/cuda-gpus) [SM 7.0 ](https://developer.nvidia.com/cuda-gpus) [SM 7.2 ](https://developer.nvidia.com/cuda-gpus) [SM 7.5 ](https://developer.nvidia.com/cuda-gpus) [SM 8.0 ](https://developer.nvidia.com/cuda-gpus) [SM 8.6 ](https://developer.nvidia.com/cuda-gpus) [SM 8.7 ](https://developer.nvidia.com/cuda-gpus)
|
|
|
|
## Supported OSes
|
|
|
|
Linux
|
|
|
|
## Supported CPU Architecture
|
|
|
|
x86_64, ppc64le, aarch64
|
|
|
|
## CUDA APIs involved
|
|
|
|
### [CUDA Runtime API](http://docs.nvidia.com/cuda/cuda-runtime-api/index.html)
|
|
cudaFree, cudaMalloc, cudaMemcpy, cudaMemcpyFromSymbol, cudaGetDeviceProperties, cudaGetDevice
|
|
|
|
## Dependencies needed to build/run
|
|
[callback](../../README.md#callback), [CUFFT](../../README.md#cufft)
|
|
|
|
## Prerequisites
|
|
|
|
Download and install the [CUDA Toolkit 11.6](https://developer.nvidia.com/cuda-downloads) for your corresponding platform.
|
|
Make sure the dependencies mentioned in [Dependencies]() section above are installed.
|
|
|
|
## Build and Run
|
|
|
|
### Linux
|
|
The Linux samples are built using makefiles. To use the makefiles, change the current directory to the sample directory you wish to build, and run make:
|
|
```
|
|
$ cd <sample_dir>
|
|
$ make
|
|
```
|
|
The samples makefiles can take advantage of certain options:
|
|
* **TARGET_ARCH=<arch>** - cross-compile targeting a specific architecture. Allowed architectures are x86_64, ppc64le, aarch64.
|
|
By default, TARGET_ARCH is set to HOST_ARCH. On a x86_64 machine, not setting TARGET_ARCH is the equivalent of setting TARGET_ARCH=x86_64.<br/>
|
|
`$ make TARGET_ARCH=x86_64` <br/> `$ make TARGET_ARCH=ppc64le` <br/> `$ make TARGET_ARCH=aarch64` <br/>
|
|
See [here](http://docs.nvidia.com/cuda/cuda-samples/index.html#cross-samples) for more details.
|
|
* **dbg=1** - build with debug symbols
|
|
```
|
|
$ make dbg=1
|
|
```
|
|
* **SMS="A B ..."** - override the SM architectures for which the sample will be built, where `"A B ..."` is a space-delimited list of SM architectures. For example, to generate SASS for SM 50 and SM 60, use `SMS="50 60"`.
|
|
```
|
|
$ make SMS="50 60"
|
|
```
|
|
|
|
* **HOST_COMPILER=<host_compiler>** - override the default g++ host compiler. See the [Linux Installation Guide](http://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html#system-requirements) for a list of supported host compilers.
|
|
```
|
|
$ make HOST_COMPILER=g++
|
|
```
|
|
|
|
## References (for more details)
|
|
|