Commits · 0c68888cc8de4ec226d76551efec45d74803898f · Christian Heinigk / dune-codegen

Dec 06, 2017
- Fixups · 0c68888c
  Dominic Kempf authored 7 years ago
  
  0c68888c
- Implement everything through a costmodel and adapt the ini options · 94e8e7df
  Dominic Kempf authored 7 years ago
  
  94e8e7df
- Implement a printing facility for vectorization strategies · 7c27277d
  Dominic Kempf authored 7 years ago
  
  7c27277d
- Implement a generator for *all* vectorization opportunities · 9087f6c9
  Dominic Kempf authored 7 years ago
  
  9087f6c9
Nov 29, 2017
- Allow full vertical vectorization through diagonal code path · 4772fdbc
  Dominic Kempf authored 7 years ago
  
  4772fdbc
Nov 28, 2017
- Make options ints · 847dc0ee
  Dominic Kempf authored 7 years ago
  
  847dc0ee
Nov 24, 2017
- Beautify the storage mechanism of custom quadrature order a bit · 01d103c2
  Dominic Kempf authored 7 years ago
  
  Still not beautiful...
  01d103c2
Nov 23, 2017
- Have vertical vectorization optionally set quadrature order · 850e0dc5
  Dominic Kempf authored 7 years ago
  
  850e0dc5
Nov 22, 2017
- Allow explicit control of the diagonal vectorization strategy · 3a6fc602
  Dominic Kempf authored 7 years ago
  
  3a6fc602
Sep 25, 2017
- Add a padding heuristic · a75470f1
  Dominic Kempf authored 7 years ago
  
  a75470f1
- Fix python3 integer division · 874cc208
  Dominic Kempf authored 7 years ago
  
  874cc208
Sep 22, 2017
- Implement a greedy vectorization strategy · 0e8b4ea5
  Dominic Kempf authored 7 years ago
  
  It always aims for maximally horizontal vectorization where possible.
  0e8b4ea5
Sep 08, 2017

Make sure that sum factorization kernels are not interleaved · 5b20e6c3

Dominic Kempf authored 7 years ago

By using loopys group mechanism. Each sum factorization kernel
defines a group that conflicts with all other sum factorization
groups.

Conflicts:
	python/dune/perftool/sumfact/realization.py
	python/dune/perftool/sumfact/vectorization.py

5b20e6c3

Sep 07, 2017

Do not generate code for stage 1 sumfact kernels that don't get used · 77c56f64

René Heß authored 7 years ago

Save all stage 1 sum factorization kernels that are used in
accumulation expression in the cache during the dry run. Discard all
inactive sum factorization kernels in decide_vetorization_strategy.

77c56f64

Sep 01, 2017
- Make it possible to report number of sf kernels · 4ab13135
  René Heß authored 7 years ago
  
  4ab13135
Aug 31, 2017
- Anisotropic quadrature order (not for facets!) · 55e7cc0b
  René Heß authored 7 years ago
  
  Make anisotropic quadrature order possible for non DG examples.
  55e7cc0b
Aug 25, 2017

Rewrite accumulation term splitting to not use FunctionView · 0f957482

Dominic Kempf authored 7 years ago

The introduction of FunctionView turned out to be a major problem
with more complicated forms. The original idea was to preserver the
structure of the finite element in a way, that loops over components
of a mixed element are realized by actual loops (treating them with
free indices and such). However, this causes quite some nightmares and
was never implemented as generically as needed (I even doubt that is
possible).

However, there is another option, which is to unroll any such loops
on a symbolic level. While this may sound like a bad idea at first
there is some really positive aspects about it:
* ListTensor and ComponentTensor nodes collapse completely (and would
  otherwise have a big nightmare potential)
* Symbolic zeroes do not generate code - important in hyperbolic problems
  where the system matrices are quite sparse or for axiparallel grids,
  where geometric quantities have many zeroes.
* The compiler would unroll these small loops anyway.
* TSFC (and I guess also FFC) do it the same way.

Implementing this required me to redo the form splitting algorithm.
I rethought it and integrated it into the main ufl->loopy visitor.

0f957482

Apr 28, 2017
- [bugfix] fix diagonal mode · 77ed7548
  Dominic Kempf authored 7 years ago
  
  77ed7548
Apr 19, 2017
- [bugfix] avoid vertical splitting of non-horizontal sumfact kernels in diagonal strategy · 7d85651b
  Dominic Kempf authored 7 years ago
  
  7d85651b
- Remove redundant input parameter on SumfactKernel · 23ca7c4b
  Dominic Kempf authored 7 years ago
  
  23ca7c4b
Apr 13, 2017
- Fix python 3 · 751e7798
  Dominic Kempf authored 7 years ago
  
  751e7798
- Basic changes for KNL · 98d4b240
  Dominic Kempf authored 7 years ago
  
  98d4b240
Apr 12, 2017
- WIP · 8cb22cf0
  Dominic Kempf authored 7 years ago
  
  8cb22cf0
- refactor horizontal vec for diagonal vec · 15b6bf4c
  Dominic Kempf authored 7 years ago
  
  15b6bf4c
Apr 07, 2017
- Stage 3 vertical vectorization · 465f90be
  Dominic Kempf authored 7 years ago
  
  465f90be
- Some bugfixes for verticality · cf2b8c63
  Dominic Kempf authored 7 years ago
  
  cf2b8c63
- Working vertical mass matrix example · a3ce6477
  Dominic Kempf authored 7 years ago
  
  a3ce6477
Apr 06, 2017
- fixup · f6884593
  Dominic Kempf authored 7 years ago
  
  f6884593
- WIP · d27a0a68
  Dominic Kempf authored 7 years ago
  
  d27a0a68
- First implementation of vertical vectorization · 6cb3a57e
  Dominic Kempf authored 7 years ago
  
  6cb3a57e
Apr 05, 2017
- Use padding on theta large matrices and get trid of explicit 4s · a8eadc59
  Dominic Kempf authored 7 years ago
  
  a8eadc59
- Refactor treatment of vectorized sumfact kernels · ea799b09
  Dominic Kempf authored 7 years ago
  
  ea799b09
Apr 03, 2017
- First improvement of vectorization strategy · 2c9cdfc6
  Dominic Kempf authored 8 years ago
  
  2c9cdfc6
- AMatrix -> BasisTabulationMatrix · afd6ad29
  Dominic Kempf authored 8 years ago
  
  afd6ad29
Mar 31, 2017
- pep8 · 492e275d
  Dominic Kempf authored 8 years ago
  
  492e275d
- Remove find_sumfact logic · 5c6c33cf
  Dominic Kempf authored 8 years ago
  
  5c6c33cf
- fix input naming · 6e2f3814
  Dominic Kempf authored 8 years ago
  
  6e2f3814
- Checkpoint! · 0eb8cbba
  Dominic Kempf authored 8 years ago
  
  0eb8cbba
Mar 30, 2017
- Also have instruciton dependencies on the sumfact node · f786d623
  Dominic Kempf authored 8 years ago
  
  f786d623
- fixup · eff10cb5
  Dominic Kempf authored 8 years ago
  
  eff10cb5

Admin message