Multiple Sequence Alignment Method for Biological Genes Based on Spark Cloud Computing
In the multi sequence alignment process of biological genes,early algorithms only calculate a single Spark cluster pa-rameter,resulting in poor parallel performance of the algorithms.For this purpose,a multi sequence alignment parallel algorithm for biological genes based on Spark cloud computing was designed.The obtained biological genetic sequence data was optimized,and the dynamic planning of the biological gene multi sequence alignment was carried out by calculating the matching degree between different sequences.Spark cloud computing technology was used to build Spark clusters and calculate the parameters of multiple Spark clus-ters.By utilizing the similarities and differences between multiple biological gene sequences,the optimal matching path was selected.On this basis,the parallel computing model for multiple biological gene sequences was established and solved,and the corresponding parallel algorithm for aligning multiple sequences was obtained.Experimental results show that the algorithm has better parallelism and can effectively improve the performance of multiple sequence alignment.