Commits · 3709281b8872156ad35f4317b55019311570ed33 · Christian Heinigk / dune-codegen

Dec 14, 2018
- Document and cleanup · 3709281b
  René Heß authored 6 years ago
  
  3709281b
Dec 13, 2018
- [skip ci][WIP] Bad hack that hopefully fixes the last test... · f80ca52e
  René Heß authored 6 years ago
  
  f80ca52e
- [skip ci] Choose more sophisticated quadrature order · 7ca0782c
  René Heß authored 6 years ago
  
  7ca0782c
Dec 12, 2018
- [skip ci] Rename setup output · 8cd376a3
  René Heß authored 6 years ago
  
  8cd376a3
Nov 23, 2018

[skip ci] Improve sumfact kernel interface · 38072649

René Heß authored 6 years ago

Introduce different methods for realize_input/output
realize_direct_input/output and setup_input/output. The setup methods cover
code generation outside the sumfact kernel function (creating input array or
accumulating result). realize and realize_direct handle the input/output in the
nonfastdg and fastdg code branch.

Seperate interface methods make it a lot easier to find out where each of those
methods will be applied. Besides that most interface classes need to provide
more that two of those methods anyway...

38072649

Nov 22, 2018

[skip ci] Rename sumfact interface methods · 43148454
René Heß authored 6 years ago

43148454
Further conditional restructuring · 5cd4058c
René Heß authored 6 years ago

5cd4058c
[skip ci] Restructure conditionals · 69b5f281
René Heß authored 6 years ago

69b5f281

Restructure where permutation happens for sumfact vectorization · f1382d61

René Heß authored 6 years ago

Non-fastdg: Permutation of the input happens before the sum factorization
kernel when we setup the input. This is done by a method of the corresponding
interface class.

Fastdg: In this case the input will always be ordered according to x,y,... This
means the permutation needs to happen in the sumfact kernel. Since we want to
vectorize sumfact kernels with different input permutation in an upper/lower
way we need to do this permutation in the corresponding interface class. This
is done in the realize_direct method and in the vectorized case the
corresponding methods of the scalar sumfact kernels are called.

f1382d61

Nov 15, 2018
- [skip ci][wip] Move around things in realize_sumfact_kernel_function · 10b3bbde
  René Heß authored 6 years ago
  
  10b3bbde
- [skip ci][wip] Move input permutation to interface classes · e16b2619
  René Heß authored 6 years ago
  
  So far only for fastdg. This should also happen in the non-fastdg case.
  e16b2619
- Add permutation methods to Interface classes · 327658d5
  René Heß authored 6 years ago
  
  Note: They are not yet used but in the long term the permutation should be handled here since it is about input/output setup.
  327658d5
Nov 14, 2018

Add direct_is_possible and the quadrature size to the parallel key · db2e2977

René Heß authored 6 years ago

Note:

- direct_is_possible true/false could probably be handled in an upper/lower
  vectorization way.

- Vectorization of SF kernels should be based on cost permuted matrix sequence.

db2e2977

Nov 13, 2018
- [skip ci] Do not use matrix_sequence in VectorizedSumfactKernel · a7950143
  René Heß authored 6 years ago
  
  a7950143
- [Bugfix] Fix cherry-pick · 8735bd57
  René Heß authored 6 years ago
  
  8735bd57
- Move quadrature_permutation to interface SumfactKernelInterfaceBase · dd0363d9
  René Heß authored 6 years ago
  
  dd0363d9
- [Bugfix] Cherry pick · aaee849d
  René Heß authored 6 years ago
  
  aaee849d
- Use cost permuted matrix sequence form SumfactKernel in realization · 45c8f39d
  René Heß authored 6 years ago
  
  45c8f39d
- Add cost permuted matrix sequence to SumfactKernel · aa208392
  René Heß authored 6 years ago
  
  aa208392
- Rename permuted_matrix_sequence · dd0f64a9
  René Heß authored 6 years ago
  
  dd0f64a9
- If we have no SF kernels there is no cost · a9531484
  René Heß authored 6 years ago
  
  a9531484
Nov 09, 2018
- Choose vectorization strategy based on cost permuted matrix sequence · 24391f12
  René Heß authored 6 years ago
  
  24391f12
- Activate gradvec vectorization for unstructured tests · 8b1b7acd
  René Heß authored 6 years ago
  
  8b1b7acd
- Edge consistent grids without detour over gmsh file + cleanup tests · 6c98122f
  René Heß authored 6 years ago
  
  6c98122f
Oct 31, 2018
- Update README.md after moving the project · d21f57ea
  Dominic Kempf authored 6 years ago
  
  d21f57ea
Oct 30, 2018

[!283] Renaming dune-perftool -> dune-codegen · 58159f09

Dominic Kempf authored 6 years ago

Merge branch 'feature/project-renaming' into 'master'

See merge request [dominic/dune-perftool!283]

  [dominic/dune-perftool!283]: Nonedominic/dune-perftool/merge_requests/283

58159f09

Readd submodule · f2a9ec7b
Dominic Kempf authored 6 years ago

f2a9ec7b
Fix pep8 after renaming · afae67e5
Dominic Kempf authored 6 years ago

afae67e5
Siwtch to cloning CI strategy · e3ed3c5a
Dominic Kempf authored 6 years ago
```
Hoperfully achieving more robustness w.r.t. submodule changes
```
e3ed3c5a
Renaming part 1 · edb9e7b0
Dominic Kempf authored 6 years ago

edb9e7b0

[!282] Autotune merge · aa5ed756

Dominic Kempf authored 6 years ago

Merge branch 'autotune-merge' into 'master'

ref:dominic/dune-perftool Resolving conflicts of [!270]

See merge request [dominic/dune-perftool!282]

  [!270]: gitlab.dune-project.org/NoneNone/merge_requests/270
  [dominic/dune-perftool!282]: gitlab.dune-project.org/dominic/dune-perftool/merge_requests/282

aa5ed756

[!276] Feature/use custom geometry transformation for blockstructured · 5d5de047

Dominic Kempf authored 6 years ago

Merge branch 'feature/use-custom-geometry-transformation' into 'master'

ref:dominic/dune-perftool This computes the determinant and jacobian inverse
transposed directly within loopy and does not call the corresponding grid
functions. Using some simple precomputations this is faster if
number_of_blocks>=2.

This also allows straight forward vectorization for unstructured grids.

I don't know how the computation of the geometry transformation is done in the
sumfactored case, but maybe there is some overlap, which could be reduced.

See merge request [dominic/dune-perftool!276]

  [dominic/dune-perftool!276]: gitlab.dune-project.org/dominic/dune-perftool/merge_requests/276

5d5de047

Merge branch 'master' into feature/autotuned-sumfact-kernel · e3e3aaa3
Dominic Kempf authored 6 years ago

e3e3aaa3

[!271] Add AVX512 single precision tranposes and a transpose testing suite · df2c1d60

Dominic Kempf authored 6 years ago

Merge branch 'feature/skylake-single-precision-transposes' into 'master'

See merge request [dominic/dune-perftool!271]

  [dominic/dune-perftool!271]: Nonedominic/dune-perftool/merge_requests/271

df2c1d60

[!263] Implement code generation for matrix inversion in C++ · f5c18a67

Dominic Kempf authored 6 years ago

Merge branch 'feature/matrix-inversion' into 'master'

ref:dominic/dune-perftool Matrix inversion at code generation time does only
work to a very limited extent (up to n=4). We can instead assemble the tensor
in C++ and invert it there (e.g. using Dune::FieldMatrix)

This fixes [#123].

Still TODO:

-   \[ \] Vectorized Inversion

See merge request [dominic/dune-perftool!263]

  [#123]: gitlab.dune-project.org/NoneNone/issues/123
  [dominic/dune-perftool!263]: gitlab.dune-project.org/dominic/dune-perftool/merge_requests/263


Closes #123

f5c18a67

[cmake] Skip all transpose tests on Clang - CI does not accept my partial exclusion · b13701f0
Dominic Kempf authored 6 years ago

b13701f0

Oct 26, 2018

[!277] Add a first implementation of hooks · 2ff0ad9e

Dominic Kempf authored 6 years ago

Merge branch 'feature/code-generation-hooks' into 'master'

ref:dominic/dune-perftool This is the first minimal implementation of how code
generation hooks from downstream projects could look like.

There is a few more things to think about (feel invited to share ideas):

-   \[x\] How to document the arguments and return values expected from hooks
-   \[x\] How to handle multiple hooks registered to the same hook point and
    return values (this is quite relevant once you want to do loopy
    transformations in a hook. It means that you want to "chain" the hooks)

This fixes [#129].

See merge request [dominic/dune-perftool!277]

  [#129]: gitlab.dune-project.org/NoneNone/issues/129
  [dominic/dune-perftool!277]: gitlab.dune-project.org/dominic/dune-perftool/merge_requests/277


Closes #129

2ff0ad9e

Oct 25, 2018
- make pep8 happy · 7c99c08e
  Marcel Koch authored 6 years ago
  
  7c99c08e
- relax test requirements · 2b005ac4
  Marcel Koch authored 6 years ago
  
  2b005ac4
- use custom to_global only for volume integrals · 3da16eda
  Marcel Koch authored 6 years ago
  
  3da16eda