mirror of
				https://github.com/NVIDIA/cuda-samples.git
				synced 2025-10-21 04:10:08 +08:00 
			
		
		
		
	
		
			
				
	
	
		
			72 lines
		
	
	
		
			3.6 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
			
		
		
	
	
			72 lines
		
	
	
		
			3.6 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
| # dct8x8 - DCT8x8
 | |
| 
 | |
| ## Description
 | |
| 
 | |
| This sample demonstrates how Discrete Cosine Transform (DCT) for blocks of 8 by 8 pixels can be performed using CUDA: a naive implementation by definition and a more traditional approach used in many libraries. As opposed to implementing DCT in a fragment shader, CUDA allows for an easier and more efficient implementation.
 | |
| 
 | |
| ## Key Concepts
 | |
| 
 | |
| Image Processing, Video Compression
 | |
| 
 | |
| ## Supported SM Architectures
 | |
| 
 | |
| [SM 5.0 ](https://developer.nvidia.com/cuda-gpus)  [SM 5.2 ](https://developer.nvidia.com/cuda-gpus)  [SM 5.3 ](https://developer.nvidia.com/cuda-gpus)  [SM 6.0 ](https://developer.nvidia.com/cuda-gpus)  [SM 6.1 ](https://developer.nvidia.com/cuda-gpus)  [SM 7.0 ](https://developer.nvidia.com/cuda-gpus)  [SM 7.2 ](https://developer.nvidia.com/cuda-gpus)  [SM 7.5 ](https://developer.nvidia.com/cuda-gpus)  [SM 8.0 ](https://developer.nvidia.com/cuda-gpus)  [SM 8.6 ](https://developer.nvidia.com/cuda-gpus)  [SM 8.7 ](https://developer.nvidia.com/cuda-gpus)  [SM 8.9 ](https://developer.nvidia.com/cuda-gpus)  [SM 9.0 ](https://developer.nvidia.com/cuda-gpus)
 | |
| 
 | |
| ## Supported OSes
 | |
| 
 | |
| Linux, Windows
 | |
| 
 | |
| ## Supported CPU Architecture
 | |
| 
 | |
| x86_64, ppc64le, armv7l
 | |
| 
 | |
| ## CUDA APIs involved
 | |
| 
 | |
| ### [CUDA Runtime API](http://docs.nvidia.com/cuda/cuda-runtime-api/index.html)
 | |
| cudaMallocArray, cudaFreeArray, cudaFree, cudaMallocPitch, cudaDestroyTextureObject, cudaDeviceSynchronize, cudaCreateTextureObject
 | |
| 
 | |
| ## Prerequisites
 | |
| 
 | |
| Download and install the [CUDA Toolkit 12.2](https://developer.nvidia.com/cuda-downloads) for your corresponding platform.
 | |
| 
 | |
| ## Build and Run
 | |
| 
 | |
| ### Windows
 | |
| The Windows samples are built using the Visual Studio IDE. Solution files (.sln) are provided for each supported version of Visual Studio, using the format:
 | |
| ```
 | |
| *_vs<version>.sln - for Visual Studio <version>
 | |
| ```
 | |
| Each individual sample has its own set of solution files in its directory:
 | |
| 
 | |
| To build/examine all the samples at once, the complete solution files should be used. To build/examine a single sample, the individual sample solution files should be used.
 | |
| > **Note:** Some samples require that the Microsoft DirectX SDK (June 2010 or newer) be installed and that the VC++ directory paths are properly set up (**Tools > Options...**). Check DirectX Dependencies section for details."
 | |
| 
 | |
| ### Linux
 | |
| The Linux samples are built using makefiles. To use the makefiles, change the current directory to the sample directory you wish to build, and run make:
 | |
| ```
 | |
| $ cd <sample_dir>
 | |
| $ make
 | |
| ```
 | |
| The samples makefiles can take advantage of certain options:
 | |
| *  **TARGET_ARCH=<arch>** - cross-compile targeting a specific architecture. Allowed architectures are x86_64, ppc64le, armv7l.
 | |
|     By default, TARGET_ARCH is set to HOST_ARCH. On a x86_64 machine, not setting TARGET_ARCH is the equivalent of setting TARGET_ARCH=x86_64.<br/>
 | |
| `$ make TARGET_ARCH=x86_64` <br/> `$ make TARGET_ARCH=ppc64le` <br/> `$ make TARGET_ARCH=armv7l` <br/>
 | |
|     See [here](http://docs.nvidia.com/cuda/cuda-samples/index.html#cross-samples) for more details.
 | |
| *   **dbg=1** - build with debug symbols
 | |
|     ```
 | |
|     $ make dbg=1
 | |
|     ```
 | |
| *   **SMS="A B ..."** - override the SM architectures for which the sample will be built, where `"A B ..."` is a space-delimited list of SM architectures. For example, to generate SASS for SM 50 and SM 60, use `SMS="50 60"`.
 | |
|     ```
 | |
|     $ make SMS="50 60"
 | |
|     ```
 | |
| 
 | |
| *  **HOST_COMPILER=<host_compiler>** - override the default g++ host compiler. See the [Linux Installation Guide](http://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html#system-requirements) for a list of supported host compilers.
 | |
| ```
 | |
|     $ make HOST_COMPILER=g++
 | |
| ```
 | |
| 
 | |
| ## References (for more details)
 | |
| 
 | |
| [whitepaper](./doc/dct8x8.pdf)
 |