Team-PIXEL (Team-PIXEL)

ilkerkesen

authored 2 papers 2 months ago

Multilingual Pretraining for Pixel Language Models

Paper • 2505.21265 • Published May 27, 2025

Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish

Paper • 2508.16431 • Published Aug 22, 2025 • 1

lyan62

authored a paper 4 months ago

Lost in Embeddings: Information Loss in Vision-Language Models

Paper • 2509.11986 • Published Sep 15, 2025 • 28

jflotz

published 2 datasets 7 months ago

Team-PIXEL/rendered-bookcorpus-bigrams

Viewer • Updated Apr 13, 2023 • 7.7M • 288

Team-PIXEL/rendered-wiki_en-bigrams

Viewer • Updated Apr 14, 2023 • 13.4M • 18

lyan62

authored a paper 7 months ago

Hanfu-Bench: A Multimodal Benchmark on Cross-Temporal Cultural Understanding and Transcreation

Paper • 2506.01565 • Published Jun 2, 2025 • 3

jflotz

published a model 7 months ago

Team-PIXEL/pixel-m4

Updated Dec 16, 2023 • 254 • 1

jflotz

authored 3 papers 9 months ago

plip

updated a Space 10 months ago

PIXEL

🐱

19

Generate text-masked images using PIXEL model

elliottd

authored a paper 10 months ago

Can Community Notes Replace Professional Fact-Checkers?

Paper • 2502.14132 • Published Feb 19, 2025 • 6

e-bug

authored a paper about 1 year ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 133

lyan62

authored 3 papers over 1 year ago

FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture

Paper • 2406.11030 • Published Jun 16, 2024

Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning

Paper • 2406.02265 • Published Jun 4, 2024 • 7

The Role of Data Curation in Image Captioning

Paper • 2305.03610 • Published May 5, 2023

e-bug

authored 2 papers over 1 year ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 72

Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

Paper • 2404.16820 • Published Apr 25, 2024 • 17

ilkerkesen

authored a paper almost 2 years ago

ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models

Paper • 2311.07022 • Published Nov 13, 2023 • 1

jflotz

updated a dataset almost 2 years ago

Team-PIXEL/PIXELSum_zh_wiki_for_TA

Viewer • Updated Jan 21, 2024 • 2.56M • 552

AI & ML interests

Team members 13

Team-PIXEL's activity

PIXEL