Guille Pérez-Torró

guishe

AI & ML interests

Information Retrieval, Few-Shot Learning, Named Entity Recognition, Named Entity Disambiguation, Semantic Search, Aspect-based Sentiment Analysis

Recent Activity

liked a Space 10 days ago

OpenEvals/evaluation-guidebook

upvoted an article about 2 months ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

upvoted an article about 2 months ago

Merge Large Language Models with mergekit

View all activity

Organizations

None yet

liked a Space 10 days ago

Evaluation Guidebook

📝

221

Display benchmark evaluation data for LLMs

upvoted 2 articles about 2 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

265

Article

Merge Large Language Models with mergekit

Jan 9, 2024

•

147

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.75k

The secrets to building world-class LLMs

upvoted an article 2 months ago

Article

Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text

Oct 20, 2025

•

upvoted an article 4 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

Sep 4, 2025

•

267

liked 2 models 5 months ago

unsloth/Qwen3-4B-Instruct-2507-unsloth-bnb-4bit

Text Generation • 4B • Updated Aug 6, 2025 • 70k • 11

unsloth/gpt-oss-20b-bnb-4bit

Text Generation • 21B • Updated Aug 6, 2025 • 3.86k • 12

upvoted a collection 5 months ago

gpt-oss

Collection

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 395

New activity in Qwen/Qwen3-Embedding-0.6B 6 months ago

task description for clustering

#33 opened 6 months ago by

guishe

updated a collection 7 months ago

Small LLMs

Collection

6 items • Updated Jun 10, 2025

upvoted an article 7 months ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

Oct 14, 2024

•

100

updated a collection 7 months ago

Instruct LLMs

Collection

6 items • Updated May 26, 2025

updated 2 collections 8 months ago

Multi-Vector Embedding Models

Collection

2 items • Updated May 16, 2025

Instruct LLMs

Collection

6 items • Updated May 26, 2025

New activity in unsloth/Qwen3-32B-unsloth-bnb-4bit 8 months ago

Qwen3-32B-unsloth-bnb-4bit vs. bnb-8bit vs. gguf-Q8_0 et.al.?

👀 ➕ 2

#2 opened 8 months ago by

ideosphere

liked a model 8 months ago

lightonai/GTE-ModernColBERT-v1

upvoted a collection 8 months ago

Qwen3

Collection

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 8 days ago • 250

liked a model 8 months ago

intfloat/multilingual-e5-large-instruct

Feature Extraction • 0.6B • Updated Jul 10, 2025 • 1.38M • • 592

updated a collection 8 months ago

Embedding Encoder-only Models

Collection

14 items • Updated Apr 28, 2025

Guille Pérez-Torró

AI & ML interests

Recent Activity

Organizations

guishe's activity

Evaluation Guidebook

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Merge Large Language Models with mergekit

The Smol Training Playbook

Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text

Welcome EmbeddingGemma, Google's new efficient embedding model

task description for clustering

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

Qwen3-32B-unsloth-bnb-4bit vs. bnb-8bit vs. gguf-Q8_0 et.al.?