Denys Vlasenko
|
39369ff460
|
libbb/sha1: use SSE2 in unrolled x86-64 code. ~10% faster
function old new delta
.rodata 108241 108305 +64
sha1_process_block64 3502 3495 -7
------------------------------------------------------------------------------
(add/remove: 5/0 grow/shrink: 1/1 up/down: 64/-7) Total: 57 bytes
Signed-off-by: Denys Vlasenko <vda.linux@googlemail.com>
|
2022-01-23 12:57:27 +01:00 |
|
Denys Vlasenko
|
805ececa61
|
whitespace fix
Signed-off-by: Denys Vlasenko <vda.linux@googlemail.com>
|
2022-01-08 00:41:09 +01:00 |
|
Denys Vlasenko
|
c3cfcc9242
|
libbb/sha1: x86_64 version: reorder prologue/epilogue insns
Not clear exactly why, but this increases hashing speed
on Skylake from 454 MB/s to 464 MB/s.
Signed-off-by: Denys Vlasenko <vda.linux@googlemail.com>
|
2022-01-04 01:45:52 +01:00 |
|
Denys Vlasenko
|
7abb2bb96e
|
libbb/sha1: x86_64 version: tidying up, no code changes
Signed-off-by: Denys Vlasenko <vda.linux@googlemail.com>
|
2022-01-03 17:02:48 +01:00 |
|
Denys Vlasenko
|
4387077f8e
|
typo fix
Signed-off-by: Denys Vlasenko <vda.linux@googlemail.com>
|
2022-01-03 13:14:09 +01:00 |
|
Denys Vlasenko
|
947bef0dea
|
libbb/sha1: x86_64 version: generate from a script, optimize a bit
function old new delta
sha1_process_block64 3569 3502 -67
Signed-off-by: Denys Vlasenko <vda.linux@googlemail.com>
|
2022-01-03 13:10:30 +01:00 |
|
Denys Vlasenko
|
05fd13ebec
|
libbb/sha1: x86_64 version: move to a separate .S file, no code changes
Signed-off-by: Denys Vlasenko <vda.linux@googlemail.com>
|
2022-01-03 12:57:36 +01:00 |
|