mtreviso commited on
Commit
056ea83
·
verified ·
1 Parent(s): 86eedb7

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +19 -11
README.md CHANGED
@@ -13,13 +13,15 @@ pipeline_tag: feature-extraction
13
 
14
  # NILC Portuguese Word Embeddings — GloVe 400d
15
 
16
- NILC-Embeddings is a repository for storing and sharing **word embeddings** for the Portuguese language.
17
- The goal is to provide ready-to-use vector resources for **Natural Language Processing (NLP)** and **Machine Learning** tasks.
 
 
 
 
 
18
 
19
- The embeddings were trained on a large Portuguese corpus (Brazilian + European), composed of 17 corpora (~1.39B tokens).
20
- Training was carried out with the following algorithms: **Word2Vec** [1], **FastText** [2], **Wang2Vec** [3], and **GloVe** [4].
21
 
22
- This repository contains the **GloVe 400d** model in **safetensors** format.
23
 
24
  ---
25
 
@@ -94,12 +96,18 @@ Hartmann, N. et al. (2017), STIL 2017.
94
 
95
  ### BibTeX
96
  ```bibtex
97
- @inproceedings{hartmann2017nilc,
98
- title={Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks},
99
- author={Hartmann, Nathan and Fonseca, Erick and Shulby, Christopher and Treviso, Marcos and Rodrigues, Juliano and Aluísio, Sandra},
100
- booktitle={Proceedings of the Symposium in Information and Human Language Technology (STIL)},
101
- year={2017}
102
- }
 
 
 
 
 
 
103
  ```
104
 
105
  ---
 
13
 
14
  # NILC Portuguese Word Embeddings — GloVe 400d
15
 
16
+ This repository contains the **GloVe 400d** model in **safetensors** format.
17
+
18
+ ## About
19
+
20
+ NILC-Embeddings is a repository for storing and sharing **word embeddings** for the Portuguese language. The goal is to provide ready-to-use vector resources for **Natural Language Processing (NLP)** and **Machine Learning** tasks.
21
+
22
+ The embeddings were trained on a large Portuguese corpus (Brazilian + European), composed of 17 corpora (~1.39B tokens). Training was carried out with the following algorithms: **Word2Vec**, **FastText**, **Wang2Vec**, and **GloVe**.
23
 
 
 
24
 
 
25
 
26
  ---
27
 
 
96
 
97
  ### BibTeX
98
  ```bibtex
99
+ @inproceedings{{hartmann-etal-2017-portuguese,
100
+ title = {{{{P}}ortuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks}},
101
+ author = {{Hartmann, Nathan and Fonseca, Erick and Shulby, Christopher and Treviso, Marcos and Silva, J{{'e}}ssica and Alu{{'i}}sio, Sandra}},
102
+ year = 2017,
103
+ month = oct,
104
+ booktitle = {{Proceedings of the 11th {{B}}razilian Symposium in Information and Human Language Technology}},
105
+ publisher = {{Sociedade Brasileira de Computa{{\c{{c}}}}{{\~a}}o}},
106
+ address = {{Uberl{{\^a}}ndia, Brazil}},
107
+ pages = {{122--131}},
108
+ url = {{https://aclanthology.org/W17-6615/}},
109
+ editor = {{Paetzold, Gustavo Henrique and Pinheiro, Vl{{'a}}dia}}
110
+ }}
111
  ```
112
 
113
  ---