l3cube-pune
/

hing-fast-text-embedding

Model card Files Files and versions

l3cube-pune commited on Nov 5, 2023

Commit

41d60e5

·

1 Parent(s): 61f2ea4

Create README.md

Files changed (1) hide show

README.md +35 -0

README.md ADDED Viewed

	@@ -0,0 +1,35 @@

+---
+language:
+- hi
+- en
+- multilingual
+license: cc-by-4.0
+tags:
+- hi
+- en
+- codemix
+datasets:
+- L3Cube-HingCorpus
+---
+## HingFT
+HingFT is a Hindi-English code-mixed fast text embedding model trained on Roman + Devanagari text of L3Cube-HingCorpus.
+<br>
+[dataset link] (https://github.com/l3cube-pune/code-mixed-nlp)
+More details on the dataset, models, and baseline results can be found in our [paper] (https://arxiv.org/abs/2204.08398)
+### Citing:
+```
+@inproceedings{nayak-joshi-2022-l3cube,
+    title = "{L}3{C}ube-{H}ing{C}orpus and {H}ing{BERT}: A Code Mixed {H}indi-{E}nglish Dataset and {BERT} Language Models",
+    author = "Nayak, Ravindra  and Joshi, Raviraj",
+    booktitle = "Proceedings of the WILDRE-6 Workshop within the 13th Language Resources and Evaluation Conference",
+    month = jun,
+    year = "2022",
+    address = "Marseille, France",
+    publisher = "European Language Resources Association",
+    url = "https://aclanthology.org/2022.wildre-1.2",
+    pages = "7--12",
+}
+```