Get a working prototype of indexed convolution written in CUDA.
Assigned to @luca.antiga and @jacquemont.