You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This project is a minimal GPT implementation focused on a question-answering system. The model was trained from scratch on the WikiText-2 dataset without leveraging transfer learning or pretrained models, resulting in limited performance. It serves as an educational experiment to explore the fundamentals of training language models from scratch.