GitHub

I finetuned another LLM on financial Q&A.

From scratch.

Implementation details: • 355M param LLM • 6K training samples • 0.67 training loss • 0.90 validation loss

I used a single A100 and it took ~7 minutes.

Really cool to see the before and after results.

Before: LLM generates random text.

After: LLM generates an answer attempt.

Shout out to @rasbt for the code.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Readme.md		Readme.md
instruction_finetuning_v0_3.ipynb		instruction_finetuning_v0_3.ipynb

Provide feedback