Telugu Question-Answering model trained on Tydiqa dataset from Google
How to use
Use the below script from your python terminal as the web interface for inference has few encoding issues for Telugu
from transformers.pipelines import pipeline, AutoModelForQuestionAnswering, AutoTokenizer
model = AutoModelForQuestionAnswering.from_pretrained(model_name)
tokenizer = AutoTokenizer.from_pretrained("kuppuluri/telugu_bertu_tydiqa",
clean_text=False,
handle_chinese_chars=False,
strip_accents=False,
wordpieces_prefix='##')
nlp = pipeline('question-answering', model=model, tokenizer=tokenizer)
result = nlp({'question': question, 'context': context})
Training data
I used Tydiqa Telugu data from Google https://github.com/google-research-datasets/tydiqa
PS: If you find my model useful, I would appreciate a note from you as it would encourage me to continue improving it and also add new models.