Skip to content

c.parallel: investigate making ptx-json generate valid PTX #5366

Description

@griwes

Currently, ptx-json emits invalid PTX, which means that regardless of whether we need to do a two C++ frontend pass compilation or not when building kernels, we have to do it. #5355 serves as a proof of concept for extracting data from PTX generated from LTOIR, and adapting that technique in c.parallel would allow us to eliminate one of the C++ frontend passes, speeding the build step considerably. Investigate making this possible.

Metadata

Metadata

Assignees

Labels

cFor all items related to the CCCL-C library
No fields configured for Enhancement.

Projects

Status
Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions