diff options
| author | deshevoy <[email protected]> | 2026-02-01 20:25:47 +0300 |
|---|---|---|
| committer | deshevoy <[email protected]> | 2026-02-01 20:51:53 +0300 |
| commit | 2726a4c04b0e9ddc111f2d1ea56390624cb67228 (patch) | |
| tree | 0034e468b28e9208d1de67f3449c20f77d06b17d /contrib/tools/python3/Python/context.c | |
| parent | 8e5c46c2b5a0e107c5ae53fc662af23965d5aa8f (diff) | |
[build] cuda: Introduce CUDA_SRCS macro utilizing parallelized device code compilation
Instead of a single graph node launching NVCC to compile .cu for both host and all device architectures
CUDA_SRCS generates multiple nodes:
- node per each device architecture producing PTX and CUBIN
- node merging all PTX and CUBIN files into a single FATBIN blob
- node producing .cpp with host code
- node compiling host .cpp with embedded FATBIN blob
CUDA_ARCHITECTURES variable is used to determine the list of architectures to compile device code for.
ISSUE:
commit_hash:0a4c2797dd238ae062482af30694df6978301278
Diffstat (limited to 'contrib/tools/python3/Python/context.c')
0 files changed, 0 insertions, 0 deletions
