[!311] Enabling vectorization for arbitrary number of blocks
Merge branch 'feature/blockstructured-vectorization-for-arbitrary-blocksize' into 'master' ref:extensions/dune-codegen This enables vectorization for arbitrary number of blocks by manually handling the tail of the vectorized loop. The tail can also be vectorized, if a smaller vector length fits. Another approach could be to use padding, but I'm currently not sure how to do that. See merge request [extensions/dune-codegen!311] [extensions/dune-codegen!311]: gitlab.dune-project.org/extensions/dune-codegen/merge_requests/311
Showing
- python/dune/codegen/blockstructured/vectorization.py 253 additions, 102 deletionspython/dune/codegen/blockstructured/vectorization.py
- python/dune/codegen/options.py 2 additions, 0 deletionspython/dune/codegen/options.py
- test/blockstructured/poisson/CMakeLists.txt 6 additions, 1 deletiontest/blockstructured/poisson/CMakeLists.txt
- test/blockstructured/poisson/poisson_tensor.mini 7 additions, 2 deletionstest/blockstructured/poisson/poisson_tensor.mini
- test/blockstructured/poisson/poisson_vec_tail.mini 34 additions, 0 deletionstest/blockstructured/poisson/poisson_vec_tail.mini
Loading
Please register or sign in to comment