Hello,
I'm trying to reimplementate your work with A100*8EA.
However, performance was not reached to your one.
I found that some configurations are different compared with your training log.
Could you share more details about training configurations or exact training scripts?