Dheemanth b7c5481c55
Release v13.3 of the CUDA samples with CUDA 13.3 Toolkit (#435)
This is the release of the CUDA 13.3 samples, which include additions for CUDA Tile C++, and updated CCCL and Python samples.
2026-05-27 16:50:59 -05:00

22 lines
671 B
Markdown

# tileTranspose
## Description
This sample demonstrates how to transpose a 2D matrix using CUDA Tile
C++. Each block handles an n x m sized chunk of the source matrix. The
block loads a chunk, transposes it locally, and stores it to the
correct position in the result matrix. A cuda::tiles::partition_view
is used to model the chunking of the source and result matrices.
## Expected Output
```
Success! Matrix transpose matches expected results.
```
## Prerequisites
- [CUDA Toolkit](https://developer.nvidia.com/cuda-downloads) version 13.3 or later.
- [CUDA Driver](https://www.nvidia.com/en-us/drivers/) version 580 or later.
- Host compiler with C++20 support.