共计 152 篇文章
2025
IBD:通过图像有偏解码减轻大型视觉-语言模型中的幻觉
Implicit Bias Injection Attacks against Text-to-Image Diffusion Models
VASparse:通过视觉感知的 token 稀疏化实现高效视觉幻觉缓解
Be My Eyes:通过多智能体协作将大型语言模型扩展到新模态
把MoE整合进LLaVA
ARC Is a Vision Problem
UpSafe℃: Upcycling for Controllable Safety in Large Language Models
Do Not Merge My Model! Safeguarding Open-Source LLMs Against Unauthorized Model Merging
非文本的上下文学习
借助主动检索增强缓解大型视觉语言模型的幻觉问题