Deep Learning Discussion: The Bigram Language Model

Part 2

beginner
code
Deep Learning
Author

Daniel Kick

Published

November 12, 2024

For Next Session:

  • Explore the Bigram Model – Try to improve on the Bigram model we wrote this week. How much is performance influenced by…

    • Optimizer choice?

    • Learning rate?

    • Number and size of hidden layers?

    • Batch size?

    • Randomness of training set

  • Next session we will be discussing attention and transformers. For next session:

Materials