blob: 73d27e3b2e18b770b5202e35455af10ae3f52b4e (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
|
HPTT is a high-performance C++ library for out-of-place tensor
transpositions.
Key features:
- Multi-threading support
- Explicit vectorization
- Auto-tuning (akin to FFTW)
- Loop order
- Parallelization
- Multi architecture support
- Explicitly vectorized kernels for (AVX and ARM)
- Supports float, double, complex and double complex data types
- Supports both column-major and row-major data layouts
By default it builds a scalar version, if you want to change it.
Set the environment variable to AVX=yes or ARM=yes (arm only).
If you want to install the Python API set PYTHON=yes.
|