Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

GEM benchmark

https://gem-benchmark.com
Activity Feed Request to join this org

AI & ML interests

We develop infrastructure for the evaluation of generated text.

Recent Activity

lewtun  submitted a paper about 1 month ago
Single-minus gluon tree amplitudes are nonzero
lewtun  submitted a paper about 1 month ago
Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL
gentaiscool  authored a paper about 2 months ago
PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues
View all activity

Sebastian Gehrmann's profile pictureAbhik Bhattacharjee's profile pictureLewis Tunstall's profile pictureAlex Wang's profile pictureJenny Chim's profile pictureRatish Puduppully's profile pictureYacine Jernite's profile pictureTosin Adewumi's profile pictureSK Mainul Islam's profile pictureAshish Upadhyay's profile pictureAshish Shrivastava's profile pictureMathias Creutz's profile pictureRabin's profile pictureAshwin Devaraj's profile pictureJurik Juraska's profile pictureQi Zhu's profile pictureMoussa Kamal Eddine's profile pictureNico Daheim's profile pictureTianhao Shen's profile pictureronald cardenas acosta's profile pictureSebastien Montella's profile pictureYufang Hou's profile pictureHiroaki Hayashi's profile pictureVipul Raheja's profile pictureAnna Shvets's profile pictureJenna Kanerva's profile pictureChandra B's profile pictureGenta Indra Winata's profile pictureTahmid Hasan's profile pictureBernd's profile pictureWang's profile pictureBryan Wilie's profile pictureCristina Garbacea's profile pictureLi Zhang's profile pictureMille's profile pictureVikas Raunak's profile pictureNouamane Tazi's profile pictureLeonardo Ribeiro's profile pictureJordan Clive's profile pictureFaisal Ladhak's profile pictureSamarth Agarwal's profile pictureVivian Tsai's profile pictureBingsheng Yao's profile pictureDoan Anh Tien's profile pictureOndrej Dusek's profile pictureAlbert Villanova del Moral's profile pictureDaniel Hershcovich's profile pictureGyan Prakash's profile pictureOwais Ahmad's profile pictureSimon van de Fliert's profile pictureJiaan Wang's profile pictureAbinaya Mahendiran's profile picturePawan Sasanka Ammanamanchi's profile pictureJohn Jackson's profile pictureMastane Achab's profile pictureAmeer Azam's profile pictureGrantley Cullar's profile pictureParaskevi Kivroglou's profile pictureMuhammad Imran Zaman's profile pictureDuong Trong Chi's profile pictureMuhammad Noman Gul's profile pictureRuslan Magana Vsevolodovna's profile pictureVan os's profile pictureEnder's profile pictureMrSomething's profile pictureNymbo's profile pictureToan M. DINH's profile pictureSalman Rasheed's profile pictureRam Kadiyala's profile pictureKaustubh Dholé's profile pictureVinit Tavde's profile pictureHusnain's profile pictureSanshruthR's profile pictureAbdul Samad Siddiqui's profile picturedada's profile pictureDJ Sri Vigneshwar's profile pictureFrank Soboczenski's profile pictureSitam Meur's profile pictureAmmar's profile pictureNguyễn Vũ Dương's profile pictureMinhazul Hasan Sohan's profile pictureHilda Cran May's profile pictureAayan Mishra's profile pictureRadu Butucelea's profile pictureAIMaster7's profile picturewave's profile pictureDan Clipca's profile pictureSubhansh Malviya's profile pictureVincenzo Gallo's profile pictureVORTEX's profile pictureParvesh Rawal's profile pictureAniket Kumar's profile pictureRAY's profile pictureAgblevor's profile pictureMohammad Othman's profile pictureTanmoy Shome's profile pictureNiansuh's profile pictureagsdgdae's profile picture

GEM 's Spaces 4

Running

README

🏢

Nov 2, 2024
Runtime error
9

DatasetCardForm

👁

Jun 29, 2022
Runtime error
3

Gem Submissions

💎

Jun 23, 2022
Running
3

Gem Results

📊

Display benchmark results in a resizable iframe

Feb 28, 2022
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs