Skip to content

Commit

Permalink
launch bounds (#257)
Browse files Browse the repository at this point in the history
Co-authored-by: jluitjens <jluitjens@nvidia.com>
  • Loading branch information
luitjens and luitjens authored Aug 26, 2022
1 parent 3b71580 commit beaa582
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion include/matx/kernels/conv.cuh
Original file line number Diff line number Diff line change
Expand Up @@ -28,8 +28,9 @@ typedef enum {
MATX_C_METHOD_AUTO,
} matxConvCorrMethod_t;

#ifdef __CUDACC__
#ifdef __CUDACC__
template <typename OutType, typename InType, typename FilterType>
__launch_bounds__(1024)
__global__ void Conv1D(OutType d_out, InType d_in, FilterType d_filter,
index_t signal_len,
matxConvCorrMode_t mode)
Expand Down

0 comments on commit beaa582

Please sign in to comment.