hello rishikksh20, thanks for your contribution!
I found a problem when training with these code.
in line 415, fastspeech.py
if avg_mel is not None:
avg_mel = avg_mel.unsqueeze(0)
# inference
before_outs, outs, d_outs, _ = self._forward(xs, ilens=ilens, ys=ref_mel, avg_mel=avg_mel,
is_inference=True,
phn_level_predictor=phn_level_predictor) # (L, odim)
else:
before_outs, outs, d_outs, _ = self._forward(xs, ilens=ilens, ys=ref_mel, is_inference=True,
phn_level_predictor=phn_level_predictor) # (L, odim)
# inference
_, outs, _, _, _ = self._forward(xs, ilens, is_inference=True) # (L, odim)
return outs[0]
I think the last inference don't need to forward?, like below.
if avg_mel is not None:
avg_mel = avg_mel.unsqueeze(0)
# inference
before_outs, outs, d_outs, _ = self._forward(xs, ilens=ilens, ys=ref_mel, avg_mel=avg_mel,
is_inference=True,
phn_level_predictor=phn_level_predictor) # (L, odim)
else:
before_outs, outs, d_outs, _ = self._forward(xs, ilens=ilens, ys=ref_mel, is_inference=True,
phn_level_predictor=phn_level_predictor) # (L, odim)
# inference
#_, outs, _, _, _ = self._forward(xs, ilens, is_inference=True) # (L, odim)
return outs[0]
hello rishikksh20, thanks for your contribution!
I found a problem when training with these code.
in line 415, fastspeech.py
I think the last inference don't need to forward?, like below.