Skip to content
Snippets Groups Projects
Commit 11e29c44 authored by Dominic Kempf's avatar Dominic Kempf
Browse files

[!305] Performance transformation: Loop reordering in sumfact kernel

Merge branch 'feature/sumfact-loop-reordering' into 'master'

ref:extensions/dune-codegen Performance transformation through loop nest
reordering. There are two ways to reorder loops in a tensor contraction:

1.  Directly accumulate in output variable after setting to zero
2.  Accumulating in a large enough temporary

This merge request implements these ways of loop reordering and the
possibility to create an autotune target directly from the loopy kernel.

See merge request [extensions/dune-codegen!305]

  [extensions/dune-codegen!305]: gitlab.dune-project.org/extensions/dune-codegen/merge_requests/305
parents 0559d50f ecdeea01
No related branches found
No related tags found
No related merge requests found
Showing
with 1047 additions and 38 deletions
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment