Steerability of Instrumental-Convergence Tendencies in LLMs Paper • 2601.01584 • Published 3 days ago
Steerability of Instrumental-Convergence Tendencies in LLMs Paper • 2601.01584 • Published 3 days ago
Adversarial Confusion Attack: Disrupting Multimodal Large Language Models Paper • 2511.20494 • Published Nov 25, 2025
TinyClick: Single-Turn Agent for Empowering GUI Automation Paper • 2410.11871 • Published Oct 9, 2024 • 3
Hiding Text in Large Language Models: Introducing Unconditional Token Forcing Confusion Paper • 2406.02481 • Published Jun 4, 2024
Can We Use Probing to Better Understand Fine-tuning and Knowledge Distillation of the BERT NLU? Paper • 2301.11688 • Published Jan 27, 2023
Large Language Models for Expansion of Spoken Language Understanding Systems to New Languages Paper • 2404.02588 • Published Apr 3, 2024 • 1
NL-ITI: Optimizing Probing and Intervention for Improvement of ITI Method Paper • 2403.18680 • Published Mar 27, 2024