FPGA Efficient Parallel Optimization of Crystal-Kyber
Polynomial multiplication operations limit the practical applications of lattice-based post-quantum cryptog-raphy.In order to improve the performance and efficiency of post-quantum cryptography Crystal_Kyber algorithm,and re-duce the running time and reduce the influence of polynomial multiplication,this paper designs a new butterfly operation unit to optimize the Kyber scheme with prime modulus q=3329.First of all,the algorithm is executed by 16-way parallel scheduling of the new butterfly operation unit,which shortens the calculation cycle.Secondly,using pipeline technology and improved K2RED algorithm,the design and implementation of a new butterfly operation unit for reducing resource con-sumption.Ultimately,the data is stored in the way of multi-RAM,and the multi-channel RAM is optimized to allow data to be stored alternately in RAM and improve the resource reuse rate.The experimental results show that the optimized NTT(number theoretic transform),INTT(Inverse NTT),PWM(point-wise multiplication)efficiency reaches 200 MHz,and the combined execution Kyber efficiency reaches 175 MHz,which is superior to other schemes and has good performance.
post quantum cryptographyCrystal-KyberK2REDbutterfly arithmeticpolynomial multiplicationhardware efficiency