Recently Published
Descrierea planului PNRR
The study focuses on analyzing the correlation between the objectives (investments, reforms, targets, milestones, etc.) proposed and scheduled for funding under the approved NRRP plan, as well as on evaluating the absorption rate of European funds based on the four funding requests submitted to date. In addition, the study assesses the financial progress and the degree of fulfillment of the milestones and targets associated with the first four payment requests submitted.
Retrieval-Augmented Verification Under Ambiguity: Benchmarking Classification Reliability Across Traditional and Large Language Model Architectures
Automated misinformation verification has increasingly shifted from surface-text classification toward evidence-grounded claim verification. While large language models (LLMs) demonstrate strong language understanding capabilities, standalone configurations remain vulnerable to unsupported factual judgments and hallucination. Retrieval-augmented generation (RAG) has emerged as a potential mechanism for improving evidence grounding by supplying external information during inference, yet empirical findings remain inconsistent regarding when retrieval improves or redistributes classification reliability.
This study evaluates retrieval augmentation as an empirical verification condition rather than as an assumed solution. Using a quantitative comparative benchmark design, the study compares three verification architecture types on short fact-checked English claims: (1) a TF-IDF + Logistic Regression baseline, (2) standalone LLM configurations, and (3) bounded-corpus RAG LLM configurations. The benchmark evaluates one traditional baseline and five LLM families under controlled retrieval conditions using a binary misleading/not-misleading classification task derived from the LIAR dataset.
Performance is evaluated using Macro F1, class-level recall, precision, Matthews Correlation Coefficient (MCC), confusion matrices, statistical significance testing, error agreement analysis, textual pattern analysis, and subject-level performance analysis. The study additionally examines ambiguity-sensitive claim conditions, including numerical ambiguity, temporal ambiguity, and partial-truth structures, to identify claim categories that remain difficult across architectures.
The study positions retrieval augmentation as a retrieval-conditioned evidence-grounding mechanism whose effects depend on evidence relevance and contextual alignment rather than as an inherently superior verification approach. Findings contribute to misinformation verification research, evidence-grounded AI evaluation, and Library and Information Science discussions concerning retrieval, credibility-related classification reliability, and responsible AI-assisted verification. The study further demonstrates the importance of balanced reliability metrics and ambiguity-aware evaluation in benchmark-based misinformation research.
VIASM 2026 B2
Trực quan hóa dữ liệu gói cơ bản và nâng cao