Tokenizer trained on code-search-net python data(as part of [huggingface nlp course](https://huggingface.co/learn/nlp-course/chapter6/2?fw=pt))