Hi,
I notice that SVT is trained after loading the parameters of a pretrained model (including encoder & decoder).
I am curious about if pretraining is necessary. Had you tried train SVT from scratch?
What's the difference between these two training schemes?
如果您能看懂中文,希望可以加微信高效沟通,我的微信号 He_2262,感激不尽!
Hi,
I notice that SVT is trained after loading the parameters of a pretrained model (including encoder & decoder).
I am curious about if pretraining is necessary. Had you tried train SVT from scratch?
What's the difference between these two training schemes?
如果您能看懂中文,希望可以加微信高效沟通,我的微信号 He_2262,感激不尽!