Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
xinyiW915
/
ReLaX-VQA
like
1
Visual Question Answering
5 datasets
deep-learning
vision
VQA
Transformer
CNN
arxiv:
2407.11496
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
ReLaX-VQA
508 MB
1 contributor
History:
15 commits
Xinyi Wang
update README
045b2f8
11 months ago
metadata
first commit
11 months ago
model
Upload model
11 months ago
src
first commit
11 months ago
ugc_original_videos
first commit
11 months ago
.gitattributes
Safe
1.6 kB
first commit
11 months ago
.gitignore
Safe
78 Bytes
Update
11 months ago
Framework.png
Safe
18.9 MB
xet
first commit
11 months ago
README.md
9.52 kB
update README
11 months ago
reported_result.ipynb
Safe
66.8 kB
first commit
11 months ago
requirements.txt
Safe
2.57 kB
first commit
11 months ago