The 44.000 de-duplicated benchmark questions used in "How Much Can We Forget about Data Contamination?" (ICML 2025)