Rutwik Choughule
|
bf8c6dd043
|
update lib path for conda
|
2022-01-14 02:31:40 +05:30 |
|
Rutwik Choughule
|
2e41896e1b
|
add and update samples for CUDA 11.6
|
2022-01-13 11:35:24 +05:30 |
|
Rutwik Choughule
|
11de19f00c
|
update command line argument parsing
fix ignoring single letter arguments
|
2021-11-30 23:42:19 +05:30 |
|
Rutwik Choughule
|
471dd47f84
|
update simpleVulkan
fix typo in importCudaExternalMemory()
|
2021-11-26 18:47:48 +05:30 |
|
Rutwik Choughule
|
0563025cde
|
update simpleVulkan sample
|
2021-11-23 18:51:00 +05:30 |
|
Rutwik Choughule
|
3a05f29b94
|
update cuDLA samples
fix missing DPRINTF
fix Makefile
|
2021-11-23 14:34:52 +05:30 |
|
Rutwik Choughule
|
e64c65a0d3
|
update sample conjugateGradientMultiDeviceCG
remove use of deprecated function cudaLaunchCooperativeKernelMultiDevice()
|
2021-11-18 10:16:22 +05:30 |
|
Rutwik Choughule
|
af0e1af181
|
remove duplicate sample
|
2021-11-18 10:15:10 +05:30 |
|
Rutwik Choughule
|
01789304f0
|
update sample bf16TensorCoreGemm to add explicit casting
|
2021-11-01 13:22:44 +05:30 |
|
Rutwik Choughule
|
1f76a2d110
|
add and update samples for CUDA 11.5
|
2021-10-21 16:34:49 +05:30 |
|
Rutwik Choughule
|
3342d604fe
|
update sample conjugateGradientMultiDeviceCG to remove use of deprecated function cudaLaunchCooperativeKernelMultiDevice()
|
2021-08-09 22:52:15 +05:30 |
|
Rutwik Choughule
|
ba04faaf73
|
add and update samples with CUDA 11.4 update 1 support
|
2021-08-03 19:24:45 +05:30 |
|
Rutwik Choughule
|
8789eb6266
|
add and update samples with CUDA 11.4 update 1 support
|
2021-08-03 19:02:58 +05:30 |
|
Rutwik Choughule
|
e950012e72
|
add and update samples with CUDA 11.4 support
|
2021-06-30 11:26:41 +05:30 |
|
Rutwik Choughule
|
95b7cea7bc
|
Merge branch 'master' of https://github.com/NVIDIA/cuda-samples
|
2021-06-10 18:37:53 +05:30 |
|
Rutwik Choughule
|
2aeaf51b11
|
cudaNvSciNvMedia plane offset correction
|
2021-06-10 17:33:24 +05:30 |
|
Rutwik Choughule
|
ba5a483c6e
|
update vulkan samples with SPIR-V shaders
|
2021-06-10 17:30:25 +05:30 |
|
Rutwik Choughule
|
7a5b3e6c8c
|
update vulkan samples with SPIR-V shaders
|
2021-06-02 17:17:21 +05:30 |
|
Rutwik Choughule
|
0787bc0489
|
correction in include path
|
2021-05-04 16:36:42 +05:30 |
|
Rutwik Choughule
|
5c3ec60fae
|
correction in include path
|
2021-05-03 14:17:55 +05:30 |
|
Rutwik Choughule
|
568b39bd5b
|
add and update samples with CUDA 11.3 support
|
2021-04-16 11:54:26 +05:30 |
|
Mahesh Doijade
|
067cb65523
|
-- Add partitioned cuda pipeline prod-cons gemm kernel
-- Add cudaCompressibleMemory sample to use copy engine vs SM writes
depending on arch
|
2021-03-03 22:53:02 +05:30 |
|
Mahesh Doijade
|
b882fa00ee
|
-- Add freeglut and glew64 libs for simpleGL
-- delete freeimage libs and headers, user needs to install it for
building samples depending on it
|
2020-12-11 17:34:13 +05:30 |
|
Mahesh Doijade
|
1cd3264681
|
Add and update samples with CUDA 11.2 support
|
2020-12-10 01:05:32 +05:30 |
|
Mahesh Doijade
|
92b0568792
|
Add cudaNvSciNvMedia sample with/without nvsci* APIs, it takes RGBA image as input and produces YUV via nvmedia
this YUV is consumed by cuda which converts it to grayscale image which is written to file as output
|
2020-11-24 16:28:04 +05:30 |
|
Mahesh Doijade
|
c4e2869a2b
|
add multi-warp cooperative groups based reduction kernel in reduction sample
|
2020-09-24 16:49:58 +05:30 |
|
Mahesh Doijade
|
dd2dba3489
|
helper_string: fix the missing "/" at end of 3 up path in sdkFindFilePath()
|
2020-09-17 20:45:23 +05:30 |
|
Mahesh Doijade
|
44052982a7
|
add ewp tags to VS project of conjugateGradientMultiDeviceCG
|
2020-09-17 20:01:41 +05:30 |
|
Mahesh Doijade
|
cd76533c3f
|
Add and update samples for cuda 11.1 support
|
2020-09-15 23:45:56 +05:30 |
|
Mahesh Doijade
|
e6ce58fef4
|
[p2pBandwidthLatency] increase default buffer size and add support to specific buffer size through command line
|
2020-06-30 19:05:55 +05:30 |
|
Mahesh Doijade
|
0ec4bd58e5
|
cudaCompressibleMemory: when unable to allocate compressible memory waive the execution
|
2020-06-01 20:22:22 +05:30 |
|
Mahesh Doijade
|
908dddb207
|
cudaCompressibleMemory: refactor and refine, and use only cuMemMap API for non-compressible allocation.
hence, remove use of cudaMalloc.
|
2020-05-27 19:00:33 +05:30 |
|
Mahesh Doijade
|
b60c149b17
|
Correct the runtime API section in documentation when sample has both driver as well as runtime API
|
2020-05-22 22:33:02 +05:30 |
|
Mahesh Doijade
|
56b5b4c767
|
-- add cudaCompressibleMemory sample
-- fix simpleD3D12 build issue
-- fix simpleVulkanMMAP shader file locating issue
|
2020-05-22 22:07:30 +05:30 |
|
Mahesh Doijade
|
53e9e413a8
|
Update VS project files to use cuda 11.0 props/targets
|
2020-05-19 18:16:05 +05:30 |
|
Mahesh Doijade
|
4f6e02970b
|
Add and update samples for cuda 11.0 support
|
2020-05-18 22:22:06 +05:30 |
|
Mahesh Doijade
|
6be514679b
|
Add and update samples for cuda 10.2 support
|
2019-10-23 19:00:39 +05:30 |
|
Mahesh Doijade
|
489d9f7b1f
|
conjugateGradientCudaGraphs: Add cusparseScsrmv & cublasSdot instead of
gpuSpMV & gpuDotProduct as they are graph compliant
|
2019-08-19 15:47:24 +05:30 |
|
Mahesh Doijade
|
c8483e0798
|
Add release notes for cuda 10.1 update 2 support
|
2019-08-05 18:58:54 +05:30 |
|
Mahesh Doijade
|
58d0ceff18
|
Add and update samples for cuda 10.1 update 2 support
|
2019-08-02 23:15:47 +05:30 |
|
Mahesh Doijade
|
337815dbee
|
Add and update samples with CUDA 10.1 Update 1 support
|
2019-04-10 20:12:09 +05:30 |
|
Mahesh Doijade
|
1abc294982
|
--Update README.md for cuda 10.1 release
-- fix multidevice CG sample to check for concurrentManagedAccess as it needs systemwide atomics
|
2019-01-23 16:56:20 +05:30 |
|
Mahesh Doijade
|
b458dafcd6
|
Add and update samples with CUDA 10.1 support
|
2019-01-23 01:34:43 +05:30 |
|
Mahesh Doijade
|
32f0fc6111
|
fix the description of CUDA IPC as it is supported on Windows as well since CUDA 9.2
|
2018-10-24 13:35:39 +05:30 |
|
Mahesh Doijade
|
6bbaf9ac80
|
-- use DX_SDK env variable only for non-DX12 samples VS files
as DX12 does not need legacy DX SDK
|
2018-10-09 22:42:25 +05:30 |
|
Mahesh Doijade
|
fcb23487a8
|
-- Add simpleD3D12 sample which demonstrates DX12-CUDA interop
rendering sinewave
-- Update the documentation with description about this sample
|
2018-10-04 20:05:23 +05:30 |
|
Mahesh Doijade
|
91dc60dd96
|
simpleVulkan: windows.h should be included before VersionHelpers.h
|
2018-09-10 20:41:05 +05:30 |
|
Mahesh Doijade
|
21c36d3568
|
Add and Update samples for CUDA 10.0
|
2018-08-24 22:35:15 +05:30 |
|
mdoijade
|
63e044cd0f
|
remove VS project files for sample conjugateGradientMultiDeviceCG
as windows support is not expected
|
2018-04-16 22:13:17 +05:30 |
|
mdoijade
|
0bce11f7a5
|
-- Update README.md with changes for repo link and other updates
-- remove MDCG support on windows
-- integrate fixes to nvrtc samples makefile and driver api samples makefiles
|
2018-04-11 00:09:13 +05:30 |
|