shawnz
							
						 
					 | 
					
						
						
						
						
							
						
						
							b013387a39
							
						
					 | 
					
						
						
							
							Update code format
						
						
						
						
						
						
					 | 
					
						2025-04-03 11:23:26 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								shawnz
							
						 
					 | 
					
						
						
						
						
							
						
						
							ad9908e32b
							
						
					 | 
					
						
						
							
							Bug4914019 & 4191696: Append pid in shmName for MIG multiple thread scenario
						
						
						
						
						
						
					 | 
					
						2025-04-02 11:20:09 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							ceab6e8bcc
							
						
					 | 
					
						
						
							
							Apply consistent code formatting across the repo. Add clang-format and pre-commit hooks.
						
						
						
						
						
						
					 | 
					
						2025-03-27 10:30:07 -07:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							c0ab53f986
							
						
					 | 
					
						
						
							
							Update all sample CMakeLists.txt to include ENABLE_CUDA_DEBUG flag to enable cuda-gdb
						
						
						
						
						
						
					 | 
					
						2025-03-26 10:08:59 -07:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							b87c243bbb
							
						
					 | 
					
						
						
							
							Add -lineinfo flag to all targets to include line information for developer tools
						
						
						
						
						
						
					 | 
					
						2025-03-26 09:44:20 -07:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							e214cd29aa
							
						
					 | 
					
						
						
							
							Update gencode arguments for separate kernel fatbin builds
						
						
						
						
						
						
					 | 
					
						2025-03-26 09:28:37 -07:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							9370f11e69
							
						
					 | 
					
						
						
							
							graphConditionalNodes: Additional tweaks to launch dimension initialization (#348)
						
						
						
						
						
						
					 | 
					
						2025-03-05 18:18:37 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							8d901e745d
							
						
					 | 
					
						
						
							
							graphConditionalNodes: Change launch dimension initialization for better cross-platform compatibility (#346)
						
						
						
						
						
						
					 | 
					
						2025-03-05 08:33:35 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							bcad2c9e61
							
						
					 | 
					
						
						
							
							graphConditionalNodes: Add switch, while, if/else conditional examples and minor cleanup (#344)
						
						
						
						
						
						
					 | 
					
						2025-03-03 17:50:22 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								XSShawnZeng
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							c357dd1e6b
							
						
					 | 
					
						
						
							
							Fixing issue #321: A potential bug in memMapIPCDrv/memMapIpc.cpp (#334)
						
						
						
						
						
						
					 | 
					
						2025-02-21 09:14:25 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							940a4c7a91
							
						
					 | 
					
						
						
							
							memMapIpc: Resolve build-time warnings and minor potential issues (#329)
						
						
						
						
						
						
						
						* Fix compute performance calculation type casting in gpuGetMaxGflopsDeviceIdDRV() for #109
* 3_CUDA_Features/memMapIPCDrv: Increase procIdx buffer size to prevent potential buffer overflow
* memMapIPCDrv: Fix memory leaks and improve header inclusion
- Remove redundant string.h header
- Add memory cleanup for dynamically allocated JIT options and log buffer
- Fix printf format specifier for unsigned long long 
						
						
					 | 
					
						2025-02-19 15:52:20 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								shawnz
							
						 
					 | 
					
						
						
						
						
							
						
						
							fb6fcb0110
							
						
					 | 
					
						
						
							
							Enhancement for finding GLFW on WIN and copy .dll files to executable dir for some samples
						
						
						
						
						
						
					 | 
					
						2025-02-14 22:37:51 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							7a9bd38ecc
							
						
					 | 
					
						
						
							
							Update OpenGL sample build settings on Windows
						
						
						
						
						
						
					 | 
					
						2025-02-10 23:12:38 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							95308ffc23
							
						
					 | 
					
						
						
							
							Add missing build targets to general samples
						
						
						
						
						
						
					 | 
					
						2025-02-08 13:04:26 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							56852fbb50
							
						
					 | 
					
						
						
							
							Add explicit system libraries needed on some Linux distributions
						
						
						
						
						
						
					 | 
					
						2025-02-07 15:43:17 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							e8a041d783
							
						
					 | 
					
						
						
							
							Ensure -Wno-deprecated-gpu-targets is passed to PTX and fatbin compilation steps
						
						
						
						
						
						
					 | 
					
						2025-02-04 12:07:51 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							c631850c15
							
						
					 | 
					
						
						
							
							Remove compute capability 8.7 build target for non-Tegra builds
						
						
						
						
						
						
					 | 
					
						2025-02-04 12:01:38 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							9539ca5fa3
							
						
					 | 
					
						
						
							
							Remove compute capability 7.2 build target for non-Tegra builds
						
						
						
						
						
						
					 | 
					
						2025-02-04 11:58:25 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							0f4bdfad99
							
						
					 | 
					
						
						
							
							CMake: Add '-Wno-deprecated-gpu-targets' to suppress warning messages during build about Maxwell, Pascal, and Volta
						
						
						
						
						
						
					 | 
					
						2025-02-04 11:51:17 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								shawnz
							
						 
					 | 
					
						
						
						
						
							
						
						
							ab0b386ac4
							
						
					 | 
					
						
						
							
							Update tegra SMs]
						
						
						
						
						
						
					 | 
					
						2025-01-23 16:20:34 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								shawnz
							
						 
					 | 
					
						
						
						
						
							
						
						
							08fae276b4
							
						
					 | 
					
						
						
							
							Add Tegra SMs in CMakeLists.txt general samples
						
						
						
						
						
						
					 | 
					
						2025-01-23 11:02:56 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							b203467419
							
						
					 | 
					
						
						
							
							Update CUDA architectures list
						
						
						
						
						
						
					 | 
					
						2025-01-22 17:49:44 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								XSShawnZeng
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							95128a1cf3
							
						
					 | 
					
						
						
							
							Update CMakeLists.txt
						
						
						
						
						
						
					 | 
					
						2025-01-20 10:05:45 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								XSShawnZeng
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							3848a7c63c
							
						
					 | 
					
						
						
							
							Update CMakeLists.txt
						
						
						
						
						
						
					 | 
					
						2025-01-20 10:05:18 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								XSShawnZeng
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							d9633314f8
							
						
					 | 
					
						
						
							
							Update CMakeLists.txt
						
						
						
						
						
						
					 | 
					
						2025-01-20 10:04:50 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								XSShawnZeng
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							576c24f97e
							
						
					 | 
					
						
						
							
							Update CMakeLists.txt
						
						
						
						
						
						
					 | 
					
						2025-01-20 10:04:24 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								XSShawnZeng
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							970a6a1151
							
						
					 | 
					
						
						
							
							Update CMakeLists.txt
						
						
						
						
						
						
					 | 
					
						2025-01-20 10:04:02 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								shawnz
							
						 
					 | 
					
						
						
						
						
							
						
						
							76e2d2052c
							
						
					 | 
					
						
						
							
							Update separate SM list for cdp samples and update the watershedSegmentationNPP for build failed of Bug 4668487
						
						
						
						
						
						
					 | 
					
						2025-01-17 12:04:27 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Jonathan Bentz
							
						 
					 | 
					
						
						
						
						
							
						
						
							e633580eff
							
						
					 | 
					
						
						
							
							a couple more typos in comments
						
						
						
						
						
						
					 | 
					
						2025-01-14 18:12:09 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Jonathan Bentz
							
						 
					 | 
					
						
						
						
						
							
						
						
							df9ea79df4
							
						
					 | 
					
						
						
							
							typo errors in a few CMakelists files
						
						
						
						
						
						
					 | 
					
						2025-01-14 18:09:26 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							f8fbd04007
							
						
					 | 
					
						
						
							
							Update CMake module search path
						
						
						
						
						
						
					 | 
					
						2025-01-14 09:14:29 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							ce045e2ae9
							
						
					 | 
					
						
						
							
							Merge branch 'cmake_transition' into aarch64_guarding
						
						
						
						
						
						
					 | 
					
						2025-01-09 10:03:16 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							f753e86e7a
							
						
					 | 
					
						
						
							
							Update all samples to build position-independent code
						
						
						
						
						
						
					 | 
					
						2025-01-09 09:59:36 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Jonathan Bentz
							
						 
					 | 
					
						
						
						
						
							
						
						
							a1cf9e4183
							
						
					 | 
					
						
						
							
							Changes for building for aarch64, specifically jetson orin nano
						
						
						
						
						
						
					 | 
					
						2025-01-08 17:05:45 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Jonathan Bentz
							
						 
					 | 
					
						
						
						
						
							
						
						
							f9dd4323af
							
						
					 | 
					
						
						
							
							adding code to Linux-only samples to guard against building on Windows
						
						
						
						
						
						
					 | 
					
						2024-12-18 14:36:12 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							bfd956bc5e
							
						
					 | 
					
						
						
							
							Remove README references to PPC processors (no longer supported)
						
						
						
						
						
						
					 | 
					
						2024-12-18 10:54:37 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							0f5821a8c8
							
						
					 | 
					
						
						
							
							Remove outdated build instructions from README.md
						
						
						
						
						
						
					 | 
					
						2024-12-18 10:52:24 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							abb97e1dfb
							
						
					 | 
					
						
						
							
							Update copy_directory_if_different to copy_directory for CMake 3.20 compatibility
						
						
						
						
						
						
					 | 
					
						2024-12-17 10:22:02 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							d54d4d7419
							
						
					 | 
					
						
						
							
							Update supported architecture list for some samples
						
						
						
						
						
						
					 | 
					
						2024-12-16 16:35:23 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							03719b7623
							
						
					 | 
					
						
						
							
							Change remaining build targets to specify active SM variants
						
						
						
						
						
						
					 | 
					
						2024-12-16 16:17:14 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							090f957854
							
						
					 | 
					
						
						
							
							Refactor CMakeLists.txt under 3_CUDA_Features
						
						
						
						
						
						
					 | 
					
						2024-12-16 14:37:14 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							2f826e305a
							
						
					 | 
					
						
						
							
							Update ptxjit
						
						
						
						
						
						
					 | 
					
						2024-12-12 11:34:01 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							9bebdf7ef4
							
						
					 | 
					
						
						
							
							Update jacobiCudaGraphs, memMapIPCDrv, newdelete, simpleCudaGraphs, tf32TensorCoreGemm, warpAggregatedAtomicsCG
						
						
						
						
						
						
					 | 
					
						2024-12-12 11:29:48 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							6fd8228242
							
						
					 | 
					
						
						
							
							Update graphMemoryFootprint, graphMemoryNodes, immaTensorCoreGemm
						
						
						
						
						
						
					 | 
					
						2024-12-12 11:19:17 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							76210c84f7
							
						
					 | 
					
						
						
							
							Update cudaCompressibleMemory, cudaTensorCoreGemm, dmmaTensorCoreGemm, globalToShmemAsyncCopy, graphConditionalNodes
						
						
						
						
						
						
					 | 
					
						2024-12-12 11:17:35 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							62d32b38d7
							
						
					 | 
					
						
						
							
							Update bindlessTexture, CDP samples
						
						
						
						
						
						
					 | 
					
						2024-12-12 11:11:51 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							dd73281bc6
							
						
					 | 
					
						
						
							
							Update StreamPriorities, bf16TensorCoreGemm, binaryPartitionCG
						
						
						
						
						
						
					 | 
					
						2024-12-12 11:02:13 -08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Armstrong
							
						 
					 | 
					
						
						
						
						
							
						
						
							82bcada84c
							
						
					 | 
					
						
						
							
							Remove now-unnecessary Visual Studio project files
						
						
						
						
						
						
					 | 
					
						2024-12-11 16:25:06 +00:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Nertney
							
						 
					 | 
					
						
						
						
						
							
						
						
							9c688d7ff7
							
						
					 | 
					
						
						
							
							Updating samples for CUDA 12.5
						
						
						
						
						
						
					 | 
					
						2024-07-25 16:30:13 +00:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Rob Nertney
							
						 
					 | 
					
						
						
						
						
							
						
						
							5f97d7d0df
							
						
					 | 
					
						
						
							
							Updating graphConditionalNodes orphan directory
						
						
						
						
						
						
					 | 
					
						2024-04-10 19:44:42 +00:00 | 
					
					
						
						
							
							
							
						
					 |