Commit
·
9b1bf94
1
Parent(s):
e18488b
Update README.md
Browse files
README.md
CHANGED
|
@@ -2,9 +2,11 @@
|
|
| 2 |
# Visual semantic with BERT-CNN
|
| 3 |
|
| 4 |
This model can be used to assign an object-to-caption relatedness score, which is valuable for
|
| 5 |
-
(1) caption diverse re-ranking, and (2) generate soft labels for caption filtering when scraping
|
| 6 |
|
| 7 |
-
|
|
|
|
|
|
|
| 8 |
|
| 9 |
For the [dataset](https://huggingface.co/datasets/AhmedSSabir/Textual-Image-Caption-Dataset)
|
| 10 |
|
|
|
|
| 2 |
# Visual semantic with BERT-CNN
|
| 3 |
|
| 4 |
This model can be used to assign an object-to-caption relatedness score, which is valuable for
|
| 5 |
+
(1) caption diverse re-ranking, and (2) generate soft labels for caption filtering when scraping text-to-captions from the internet.
|
| 6 |
|
| 7 |
+
The model is trained with a strict filter of 0.4 similarity distance thresholds between the object and its related caption.
|
| 8 |
+
|
| 9 |
+
For a quick start please have a look at this [colab](https://colab.research.google.com/drive/1N0JVa6y8FKGLLSpiG7hd_W75UYhHRe2j?usp=sharing)
|
| 10 |
|
| 11 |
For the [dataset](https://huggingface.co/datasets/AhmedSSabir/Textual-Image-Caption-Dataset)
|
| 12 |
|