update
This commit is contained in:
58
storage/3KB89IIG/.zotero-ft-cache
Normal file
58
storage/3KB89IIG/.zotero-ft-cache
Normal file
@@ -0,0 +1,58 @@
|
||||
Skip to main content
|
||||
Computer Science > Computation and Language
|
||||
arXiv:2502.17125 (cs)
|
||||
[Submitted on 24 Feb 2025]
|
||||
LettuceDetect: A Hallucination Detection Framework for RAG Applications
|
||||
Ádám Kovács, Gábor Recski
|
||||
View PDF
|
||||
HTML (experimental)
|
||||
Retrieval Augmented Generation (RAG) systems remain vulnerable to hallucinated answers despite incorporating external knowledge sources. We present LettuceDetect a framework that addresses two critical limitations in existing hallucination detection methods: (1) the context window constraints of traditional encoder-based methods, and (2) the computational inefficiency of LLM based approaches. Building on ModernBERT's extended context capabilities (up to 8k tokens) and trained on the RAGTruth benchmark dataset, our approach outperforms all previous encoder-based models and most prompt-based models, while being approximately 30 times smaller than the best models. LettuceDetect is a token-classification model that processes context-question-answer triples, allowing for the identification of unsupported claims at the token level. Evaluations on the RAGTruth corpus demonstrate an F1 score of 79.22% for example-level detection, which is a 14.8% improvement over Luna, the previous state-of-the-art encoder-based architecture. Additionally, the system can process 30 to 60 examples per second on a single GPU, making it more practical for real-world RAG applications.
|
||||
Comments: 6 pages
|
||||
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
|
||||
Cite as: arXiv:2502.17125 [cs.CL]
|
||||
(or arXiv:2502.17125v1 [cs.CL] for this version)
|
||||
|
||||
https://doi.org/10.48550/arXiv.2502.17125
|
||||
Focus to learn more
|
||||
Submission history
|
||||
From: Ádám Kovács [view email]
|
||||
[v1] Mon, 24 Feb 2025 13:11:47 UTC (1,188 KB)
|
||||
|
||||
Access Paper:
|
||||
View PDFHTML (experimental)TeX Source
|
||||
view license
|
||||
Current browse context: cs.CL
|
||||
< prev next >
|
||||
|
||||
newrecent2025-02
|
||||
Change to browse by: cs cs.AI
|
||||
References & Citations
|
||||
NASA ADS
|
||||
Google Scholar
|
||||
Semantic Scholar
|
||||
Export BibTeX Citation
|
||||
Bookmark
|
||||
Bibliographic Tools
|
||||
Bibliographic and Citation Tools
|
||||
Bibliographic Explorer Toggle
|
||||
Bibliographic Explorer (What is the Explorer?)
|
||||
Connected Papers Toggle
|
||||
Connected Papers (What is Connected Papers?)
|
||||
Litmaps Toggle
|
||||
Litmaps (What is Litmaps?)
|
||||
scite.ai Toggle
|
||||
scite Smart Citations (What are Smart Citations?)
|
||||
Code, Data, Media
|
||||
Demos
|
||||
Related Papers
|
||||
About arXivLabs
|
||||
Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
|
||||
About
|
||||
Help
|
||||
Contact
|
||||
Subscribe
|
||||
Copyright
|
||||
Privacy Policy
|
||||
Web Accessibility Assistance
|
||||
|
||||
arXiv Operational Status
|
||||
Reference in New Issue
Block a user