Model Description
This is a Khmer Language fill masked build on top of pre-trained model of FacebookAI/xlm-roberta-base. This model is fine-tunned with around 26K+ khmer sentences/clauses (80% for training set & 20% for validation set). This model is perform well with Khmer Language ONLY.
Model Usage
>>> from transformers import pipeline
>>> unmasker = pipeline('fill-mask', model='channudam/khmer-xlm-roberta-base')
>>> unmasker("α’αΆααΆαααΆαα»αααα
ααααΆαα α
αΌαα’αααααΉα<mask>α²ααααΆαα
αααΎαα")
[
{
'score': 0.9788032174110413,
'token': 41440,
'token_str': 'ααΉα',
'sequence': 'α’αΆααΆαααΆαα»αααα
ααααΆαα α
αΌαα’αααααΉαααΉα α²ααααΆαα
αααΎαα'
},
{
'score': 0.012485685758292675,
'token': 191670,
'token_str': 'ααααΆ',
'sequence': 'α’αΆααΆαααΆαα»αααα
ααααΆαα α
αΌαα’αααααΉαααααΆ α²ααααΆαα
αααΎαα'
},
{
'score': 0.0014946138253435493,
'token': 162483,
'token_str': 'ααΆα',
'sequence': 'α’αΆααΆαααΆαα»αααα
ααααΆαα α
αΌαα’αααααΉαααΆα α²ααααΆαα
αααΎαα'
},
{
'score': 0.001305083278566599,
'token': 49245,
'token_str': 'αααΈ',
'sequence': 'α’αΆααΆαααΆαα»αααα
ααααΆαα α
αΌαα’αααααΉααααΈ α²ααααΆαα
αααΎαα'
},
{
'score': 0.0007108347490429878,
'token': 51863,
'token_str': 'ααΉα',
'sequence': 'α’αΆααΆαααΆαα»αααα
ααααΆαα α
αΌαα’αααααΉα ααΉα α²ααααΆαα
αααΎαα'
}
]
- Downloads last month
- 17
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.