mdeberta-id-20k

This model is a vocabulary-pruned version of microsoft/mdeberta-v3-base, specifically optimized for the Indonesian language.

Vocabulary: 20k tokens (Indonesian)

Note: This model is part of an ongoing research project on efficient Transformer deployment. Full paper and benchmarks will be linked upon publication.

Downloads last month
2
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including muchad/mdeberta-id-20k