cuda-samples

mirror of https://github.com/NVIDIA/cuda-samples.git synced 2025-08-23 23:40:31 +08:00

Author	SHA1	Message	Date
Matt Davis	1b36fefcd5	Test the compute capability minor number prior to using the result value.	2025-08-04 18:00:26 +00:00
Rob Armstrong	a5267b83a5	Merge branch 'shawnz_bug_fix' into 'master' Bug 5412815: Fix the issue of cudaTensorCoreGemm.cu See merge request cuda-samples/cuda-samples!125	2025-07-28 10:18:27 -07:00
Matt Davis	903c840a8c	Update the README mentioning testing requires a compatible driver.	2025-07-25 12:40:16 +00:00
Matt Davis	24e7c5b428	Update compute capability tests to look for >=sm75.	2025-07-25 10:28:53 +00:00
shawnz	b38ed29c95	Bug 5412815: Fix the issue of cudaTensorCoreGemm.cu	2025-07-25 15:16:12 +08:00
shawnz	2369d4c649	Fix the FindNVSCI.cmake syntax error and the typo in README.md	2025-07-10 17:13:31 +08:00
Rob Armstrong	8433f89993	Globally remove VSCode tasks.json referencing obsolete build steps	2025-07-07 08:31:22 -07:00
Rob Armstrong	45314d7ff8	README formatting changes	2025-06-24 12:44:08 -07:00
shawnz	0d61031846	Bug 5355361: Update README.md of 7_libNVVM for forward compatibility sample build	2025-06-24 17:04:28 +08:00
shawnz	e674cc36fe	Bug 5339530: Set socket creating folder to /tmp for QNX	2025-06-13 16:34:40 +08:00
shawnz	ce28796d6c	Bug 5189457: Disable -no-pie for hpc	2025-06-11 16:25:22 +08:00
shawnz	9424c15848	Bug 5331767: Specify sm list of cdp samples for QNX	2025-06-10 15:33:07 +08:00
shawnz	9075c50a3d	Bug 5323034 and 5323144: Disable .rsp for linking as qcc doesn't support lib path with double quotes in .rsp on QNX	2025-06-10 15:21:29 +08:00
shawnz	de5fa98e6e	Bug 5323118: Remove the -lpthread and -lrt which are not supported on QNX	2025-06-10 15:18:04 +08:00
shawnz	6c9e9d3cd2	Bug 5323163: Get correct cuda include path for finding header files	2025-06-10 15:15:56 +08:00
shawnz	7f5390cec3	Bug 5323124: Waive simpleAWBarrier on QNX	2025-06-10 15:14:33 +08:00
shawnz	5f6d46dfea	Bug 5323018: Update the CMakeLists.txt and Common/helper_multiprocess.cpp of ptxjit and memMapIPCDrv for QNX cross build	2025-06-09 19:53:25 +08:00
shawnz	49307463b5	Bug 5133216: Add QNX tooltrain and cross build support	2025-05-29 16:57:28 +08:00
Rob Armstrong	9c0d5aaae6	Merge branch 'shawnz_bug_fix' into 'master' Bug 5305842 and 5305854: Update CMakeLists for SBSA CUDA Toolkit support on aarch64-linux platforms See merge request cuda-samples/cuda-samples!115	2025-05-28 08:26:16 -07:00
shawnz	0d08748ffa	Bug 5305842: Update CMakeLists.txt for cdp samples for sbsa CUDA Toolkit supporting on aarch64-linux platforms	2025-05-28 11:42:49 +08:00
Rob Armstrong	27f47634a0	Merge branch 'shawnz_bug_fix' into 'master' Bug 5300528: Add MPI_C_LIBRARIES for user defined MPI path See merge request cuda-samples/cuda-samples!114	2025-05-23 07:57:54 -07:00
shawnz	40d297dfe7	Bug 5300528: Add MPI_C_LIBRARIES for user defined MPI path	2025-05-23 14:47:55 +08:00
Rob Armstrong	20b90b5063	Merge public 12.9 changes into 13.0 dev	2025-05-22 11:45:07 -07:00
Rob Armstrong	8a9e2c830c	Update 1_Utilities/README.md to redirect bandwidthTest to NVBandwidth (#371 )	2025-05-22 11:43:14 -07:00
Rob Armstrong	f47f06f077	Merge branch 'shawnz_bug_fix' into 'master' Update the CMAKE_CUDA_ARCHITECTURES from 50 to 75 for Samples/7_libNVVM/cuda-c-linking See merge request cuda-samples/cuda-samples!113	2025-05-22 08:16:44 -07:00
shawnz	0c78a3e0de	Update the CMAKE_CUDA_ARCHITECTURES from 50 to 75 for Samples/7_libNVVM/cuda-c-linking	2025-05-22 15:52:51 +08:00
Rob Armstrong	8219570c15	Merge branch 'shawnz_bug_fix' into 'master' Bug fix for 5280038, 5277193, 5281036 and 5294720 See merge request cuda-samples/cuda-samples!112	2025-05-21 10:51:12 -07:00
Rob Armstrong	fc88988f23	Merge public changes to internal ToT	2025-05-21 09:28:35 -07:00
shawnz	1141dd7af4	Bug 5281036: Limit the register number of debug version for cdpAdvancedQuicksort	2025-05-20 14:47:44 +08:00
shawnz	da3b7a2b3c	Update the vulkanImageCUDA/vulkanImageCUDA.cu for Windows headers	2025-05-19 17:43:08 +08:00
shawnz	5987a9e9fa	Update transpose for code format check	2025-05-19 17:38:42 +08:00
shawnz	107f3f537f	Update the include files sequence for vulkan samples on Windows	2025-05-19 17:38:22 +08:00
shawnz	c1b03b9f81	Bug 5277193: Remove the CUFFT_LICENSE_ERROR checking as it is deprecated since CUDA13.0	2025-05-16 11:21:07 +08:00
shawnz	ee0ee417c9	Update the sample list for API changes in CHANGELOG	2025-05-14 16:02:17 +08:00
shawnz	ebc1078379	Bug 5280038: Update cuda-c-linking as per CUDA 13.0 API change	2025-05-14 15:54:41 +08:00
Rob Armstrong	2861e78272	Merge branch 'peggyt_bug_fix' into 'master' Bug 5056055: limit register usage to 128 per thread in debug mode See merge request cuda-samples/cuda-samples!110	2025-05-12 08:55:45 -07:00
shawnz	8f33cc6094	Bug 5274280: Enable 8_Platform_Specific/Tegra/EGLSync_CUDAEvent_Interop	2025-05-12 15:02:31 +08:00
shawnz	2ec9cf394a	Bug 5272236: Update the include file copy path as path changes on 13.0	2025-05-12 15:00:52 +08:00
Peggy Tian	770e433a9e	Bug 5056055: limit register usage to 128 per thread in debug mode to comply with the maximum number of 32-bit registers per SM	2025-05-12 06:04:22 +00:00
Rob Armstrong	c6af90553e	Merge branch 'master' into 'master' Bug 5236593: Increase the pending kernel launch limit to 4096 See merge request cuda-samples/cuda-samples!109	2025-05-07 09:47:51 -07:00
Peggy Tian	611008fa86	Bug 5236593: Increase the pending kernel launch limit to 4096	2025-05-07 17:38:52 +08:00
shawnz	bf628887f1	Bug 5217339: Replace SM 101 with 110 for Thor	2025-05-07 14:10:35 +08:00
Rob Armstrong	330dd8472f	Merge external PR #363	2025-05-05 08:45:01 -07:00
Francesco Rizzi	b530f1cf42	Fix bug in 6_Performance/transpose: copy sharedmem kernel (#363 ) Update kernel loop bounds handling, main loop data copy to avoid incorrect reuse of output results. --------- Authored-by: Francesco Rizzi <francesco.rizzi@ng-analytics.com>	2025-05-05 08:43:23 -07:00
Rob Armstrong	f2645c5df8	Final merge of 12.9 changes into cuda_a_dev	2025-05-01 09:55:03 -07:00
Rob Armstrong	c70d79cf3b	Final 12.9 README updates	2025-05-01 09:39:06 -07:00
Rob Armstrong	9ac81370fa	Update 12.9 changes from 'master' into 'cuda_a_dev'	2025-04-30 09:48:21 -07:00
Rob Armstrong	14b1bfdcc4	Replace README references to "CUDA Toolkit 12.5" with general "CUDA Toolkit"	2025-04-30 09:46:45 -07:00
shawnz	b27b55ec70	Bug 5241914: Fix the error message for cuSolverDn_LinearSolver	2025-04-27 16:57:02 +08:00
shawnz	49159f3739	Bug 5164417 and 5097376: Fix the OpenMP issue finding issue for MSVC and Glang	2025-04-27 16:50:12 +08:00

1 2 3 4 5 ...

350 Commits