Skip to content

Copy-in embeddings in reduced precision and handle precision conversion during inference#73

Closed
mikepapadim wants to merge 3 commits intomainfrom
feat/fp16-emb
Closed

Copy-in embeddings in reduced precision and handle precision conversion during inference#73
mikepapadim wants to merge 3 commits intomainfrom
feat/fp16-emb