- Dec 06, 2017
-
-
Dominic Kempf authored
-
Dominic Kempf authored
-
Dominic Kempf authored
-
Dominic Kempf authored
-
- Nov 29, 2017
-
-
Dominic Kempf authored
-
- Nov 28, 2017
-
-
Dominic Kempf authored
-
- Nov 24, 2017
-
-
Dominic Kempf authored
Still not beautiful...
-
- Nov 23, 2017
-
-
Dominic Kempf authored
-
- Nov 22, 2017
-
-
Dominic Kempf authored
-
- Sep 25, 2017
-
-
Dominic Kempf authored
-
Dominic Kempf authored
-
- Sep 22, 2017
-
-
Dominic Kempf authored
It always aims for maximally horizontal vectorization where possible.
-
- Sep 08, 2017
-
-
Dominic Kempf authored
By using loopys group mechanism. Each sum factorization kernel defines a group that conflicts with all other sum factorization groups. Conflicts: python/dune/perftool/sumfact/realization.py python/dune/perftool/sumfact/vectorization.py
-
- Sep 07, 2017
-
-
René Heß authored
Save all stage 1 sum factorization kernels that are used in accumulation expression in the cache during the dry run. Discard all inactive sum factorization kernels in decide_vetorization_strategy.
-
- Sep 01, 2017
-
-
René Heß authored
-
- Aug 31, 2017
-
-
René Heß authored
Make anisotropic quadrature order possible for non DG examples.
-
- Aug 25, 2017
-
-
Dominic Kempf authored
The introduction of FunctionView turned out to be a major problem with more complicated forms. The original idea was to preserver the structure of the finite element in a way, that loops over components of a mixed element are realized by actual loops (treating them with free indices and such). However, this causes quite some nightmares and was never implemented as generically as needed (I even doubt that is possible). However, there is another option, which is to unroll any such loops on a symbolic level. While this may sound like a bad idea at first there is some really positive aspects about it: * ListTensor and ComponentTensor nodes collapse completely (and would otherwise have a big nightmare potential) * Symbolic zeroes do not generate code - important in hyperbolic problems where the system matrices are quite sparse or for axiparallel grids, where geometric quantities have many zeroes. * The compiler would unroll these small loops anyway. * TSFC (and I guess also FFC) do it the same way. Implementing this required me to redo the form splitting algorithm. I rethought it and integrated it into the main ufl->loopy visitor.
-
- Apr 28, 2017
-
-
Dominic Kempf authored
-
- Apr 19, 2017
-
-
Dominic Kempf authored
-
Dominic Kempf authored
-
- Apr 13, 2017
-
-
Dominic Kempf authored
-
Dominic Kempf authored
-
- Apr 12, 2017
-
-
Dominic Kempf authored
-
Dominic Kempf authored
-
- Apr 07, 2017
-
-
Dominic Kempf authored
-
Dominic Kempf authored
-
Dominic Kempf authored
-
- Apr 06, 2017
-
-
Dominic Kempf authored
-
Dominic Kempf authored
-
Dominic Kempf authored
-
- Apr 05, 2017
-
-
Dominic Kempf authored
-
Dominic Kempf authored
-
- Apr 03, 2017
-
-
Dominic Kempf authored
-
Dominic Kempf authored
-
- Mar 31, 2017
-
-
Dominic Kempf authored
-
Dominic Kempf authored
-
Dominic Kempf authored
-
Dominic Kempf authored
-
- Mar 30, 2017
-
-
Dominic Kempf authored
-
Dominic Kempf authored
-