Create cleanup_nested_mmseq.pl for speedup by mmseqs#590
Create cleanup_nested_mmseq.pl for speedup by mmseqs#590starskyzheng wants to merge 1 commit intooushujun:masterfrom
Conversation
#Update: 07/11/2025 by Zeyu Zheng, MMSeqs optimization for faster sequence search
|
Hello Zeyu, Thank you for your contribution! It looks like a nice integration of a much faster aligner. Can you please provide some comparisons between the old and new versions? Some benchmarks on the rice genome will also be great. Thanks! |
|
Hello Zeyu, Sorry for the delay. We've just got a chance to test this script. It appears to produce highly similar results to the original cleanup_nested script in rice. However, the speed of the mmseqs script is much slower on our end despite similar CPU usage: Original cleanup_nested:
New cleanup_nested_mmseqs:
The new script has more iterations, and if you just count the same initial iterations (iter0-4), the new script took a longer time (29 min vs 42 min). Can you please double-check the speed-up aspect of the script? Thank you for your contribution! |
… The script has not been used in the EDTA pipeline
#Update: 07/11/2025 by Zeyu Zheng, MMSeqs optimization for faster sequence search
PR checklist
nextflow_rebootbranchcondaandcontainerdirectives.meta.idwith each data channelprocess_highversions.ymlor versions topicmain.nf. We can have a process in a sub-workflow file