Bear
  • 首页
  • 目录
  • 标签
  • latex识别
  • 每日arxiv
  • 关于
目录

共计 317 篇文章


2026

02-09
Hallucination Begins Where Saliency Drops
02-08
REACT:SYNERGIZING REASONING AND ACTING IN LANGUAGE MODELS
02-08
RAGLens: Toward Faithful Retrieval-Augmented Generation with Sparse Autoencoders
02-06
Why Steering Works:Toward a Unified View of Language Model Parameter Dynamics
02-06
LLM-VA: Resolving the Jailbreak-Overrefusal Trade-off via Vector Alignment
02-06
AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint
02-05
FRAUDAR: Bounding Graph Fraud in the Face of Camouflage
02-03
DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models
02-02
One-shot Optimized Steering Vector for Hallucination Mitigation for VLMs
01-15
RelayLLM: Efficient Reasoning via Collaborative Decoding
1234…32

搜索

LJX Hexo
博客已经运行 天