How to add new tokens to a transformer model vocabulary
In this post, we will see how to expand the vocabulary of a transformers model by adding your own words or tokens.
Why do you need to expand the vocabulary?
All the language models that are trained for a specific task in NLP domain have a vocabulary. The vocabulary is the unique words of the text corpus that the model has been trained with. Therefore, dep…
Keep reading with a 7-day free trial
Subscribe to The MLnotes Newsletter to keep reading this post and get 7 days of free access to the full post archives.