diff options
author | Wladimir J. van der Laan <laanwj@gmail.com> | 2017-07-20 20:16:28 +0200 |
---|---|---|
committer | Wladimir J. van der Laan <laanwj@gmail.com> | 2017-07-20 20:28:35 +0200 |
commit | 16240f43a550c9b0c34d541df5fa80c2fb43d17c (patch) | |
tree | c11cfd3de39043f74eb7002c0a8a585ced03e906 /src/primitives | |
parent | 7c2400cb8ab7ebd5fe374b1f69657e0b0718ab73 (diff) | |
parent | 6b8d872e5e2dd68a5229ec55f5261dae34ff9bdb (diff) |
Merge #10821: Add SSE4 optimized SHA256
6b8d872 Protect SSE4 code behind a compile-time flag (Pieter Wuille)
fa9be90 Add selftest for SHA256 transform (Pieter Wuille)
c1ccb15 Add SSE4 based SHA256 (Pieter Wuille)
2991c91 Add SHA256 dispatcher (Pieter Wuille)
4d50f38 Support multi-block SHA256 transforms (Pieter Wuille)
Pull request description:
This adds an SSE4 assembly version of the SHA256 transform by Intel, and uses it at run time if SSE4 instructions are available, and use a fallback C++ implementation otherwise. Nearly every x86_64 CPU supports SSE4. The feature is only enabled when compiled with `--enable-experimental-asm`.
In order to avoid build dependencies and other complications, the original Intel YASM code was translated to GCC extended asm syntax.
This gives around a 50% speedup on the SHA256 benchmark for me.
It is based on an earlier patch by @laanwj, though only includes a single assembly version (for now), and removes the YASM dependency.
Tree-SHA512: d31c50695ceb45264291537b93c0d7497670be38edf021ca5402eaa7d4e1e0e1ae492326e28d4e93979d066168129e62d1825e0384b1b906d36f85d93dfcb43c
Diffstat (limited to 'src/primitives')
0 files changed, 0 insertions, 0 deletions