- Jul 31, 2019
-
-
Dominic Kempf authored
In the /sumfact_poisson_dg_3d_unstructured_deg2_symdiff_nonquadvec_gradvec test case this reduces the time spent in evaluation of it from >100s to 6s.
-
- Apr 26, 2019
-
-
René Heß authored
This just reverts to the old behaviour. The new implementation of loop reordering doesn't need that anymore.
-
- Apr 16, 2019
-
-
René Heß authored
-
- Apr 12, 2019
-
-
René Heß authored
-
- Apr 01, 2019
-
-
René Heß authored
-
- Mar 29, 2019
-
-
René Heß authored
-
- Feb 18, 2019
-
-
René Heß authored
The SumfactKernel object stores a tuple of transformations that will be registered in the realization and applied in the localoperator. These transformations can change the name of the sum factorization kernel function. This way it should be possible to use autotuning for picking transformations.
-
- Feb 13, 2019
-
-
Dominic Kempf authored
-
- Feb 12, 2019
-
-
Dominic Kempf authored
-
Dominic Kempf authored
-
Dominic Kempf authored
-
Dominic Kempf authored
The permutation is already in. Using the numbers requires to have the correct global state whenever the key is evaluated, which we cannot guarantee.
-
- Jan 29, 2019
-
-
Dominic Kempf authored
-
- Jan 28, 2019
- Jan 24, 2019
-
-
René Heß authored
The realize_output interface method was not really necessary and just adds a layer of indiretion.
-
- Jan 23, 2019
-
-
Dominic Kempf authored
-
- Jan 07, 2019
-
-
René Heß authored
-
- Dec 21, 2018
-
-
René Heß authored
-
- Dec 17, 2018
-
-
René Heß authored
-
- Dec 13, 2018
-
-
René Heß authored
-
- Dec 12, 2018
-
-
René Heß authored
-
- Dec 07, 2018
-
-
Dominic Kempf authored
-
- Nov 23, 2018
-
-
René Heß authored
Introduce different methods for realize_input/output realize_direct_input/output and setup_input/output. The setup methods cover code generation outside the sumfact kernel function (creating input array or accumulating result). realize and realize_direct handle the input/output in the nonfastdg and fastdg code branch. Seperate interface methods make it a lot easier to find out where each of those methods will be applied. Besides that most interface classes need to provide more that two of those methods anyway...
-
- Nov 22, 2018
-
-
René Heß authored
-
René Heß authored
Non-fastdg: Permutation of the input happens before the sum factorization kernel when we setup the input. This is done by a method of the corresponding interface class. Fastdg: In this case the input will always be ordered according to x,y,... This means the permutation needs to happen in the sumfact kernel. Since we want to vectorize sumfact kernels with different input permutation in an upper/lower way we need to do this permutation in the corresponding interface class. This is done in the realize_direct method and in the vectorized case the corresponding methods of the scalar sumfact kernels are called.
-
- Nov 15, 2018
- Nov 14, 2018
-
-
René Heß authored
Note: - direct_is_possible true/false could probably be handled in an upper/lower vectorization way. - Vectorization of SF kernels should be based on cost permuted matrix sequence.
-
- Nov 13, 2018
- Nov 09, 2018
-
-
René Heß authored
-
- Oct 30, 2018
-
-
Dominic Kempf authored
-
Dominic Kempf authored
-
- Oct 04, 2018
-
-
Dominic Kempf authored
-
- Sep 20, 2018
-
-
Dominic Kempf authored
Needed in dune-perftool-playground for Navier stokes
-