optimize ossl_sm4_set_key speed
this optimization comes from libgcrypt, increse about 48% speed
Benchmark on my AMD Ryzen Threadripper 3990X
before:
Did
5752000 SM4 setup operations in 1000151us (
5751131.6 ops/sec)
after:
Did
8506000 SM4 setup operations in 1000023us (
8505804.4 ops/sec)
Reviewed-by: Paul Dale <pauli@openssl.org>
Reviewed-by: Hugo Landau <hlandau@openssl.org>
(Merged from https://github.com/openssl/openssl/pull/19270)
(cherry picked from commit
704e8090b4a789f52af07de9a3ebbe11db8e19f8)