- Nov 22, 2018
-
-
René Heß authored
Non-fastdg: Permutation of the input happens before the sum factorization kernel when we setup the input. This is done by a method of the corresponding interface class. Fastdg: In this case the input will always be ordered according to x,y,... This means the permutation needs to happen in the sumfact kernel. Since we want to vectorize sumfact kernels with different input permutation in an upper/lower way we need to do this permutation in the corresponding interface class. This is done in the realize_direct method and in the vectorized case the corresponding methods of the scalar sumfact kernels are called.
-
- Nov 15, 2018
- Nov 14, 2018
-
-
René Heß authored
Note: - direct_is_possible true/false could probably be handled in an upper/lower vectorization way. - Vectorization of SF kernels should be based on cost permuted matrix sequence.
-
- Nov 13, 2018
- Nov 09, 2018
-
-
René Heß authored
-
- Oct 30, 2018
-
-
Dominic Kempf authored
-
Dominic Kempf authored
-