https://arxiv.org/pdf/1603.06042 This has been on my radar for a while now. But haven't had the time to implement it Tasks: * [ ] Add beam search instead of greedy decoding * [ ] Locally normalize decisions
https://arxiv.org/pdf/1603.06042
This has been on my radar for a while now. But haven't had the time to implement it
Tasks: