About
Copper City Labs is a research lab that specializes in making computers understand the Uzbek language. You may reach us at hello@coppercitylabs.com.
Publications
Models
- Uzbek news category classifier (based on UzBERT) [Hugging Face Hub]
- UzBERT (BERT for Cyrillic Uzbek) [Hugging Face Hub]
- Uzbek tokenizers [GitHub]
-
Word embeddings for Uzbek (Cyrillic):
- 100d fasttext (CBOW) [figshare]
- 100d fasttext (skipgram) [figshare]
- 100d word2vec (CBOW, negative sampling) [figshare]
- 100d word2vec (skipgram, negative sampling) [figshare]
- 300d fasttext (CBOW) [figshare]
- 300d fasttext (skipgram) [figshare]
- 300d glove [figshare]
- 300d word2vec (CBOW, hierarchical softmax) [figshare]
- 300d word2vec (skipgram, hierarchical softmax) [figshare]