What is the difference between faithfulness and plausible explanations?

Search results

arxiv.org › html › 2402Faithfulness vs. Plausibility: On the (Un)Reliability of ...

arxiv.org › html › 2402
- Cached
Feb 8, 2024 · We highlight that the current trend towards increasing the plausibility of explanations, primarily driven by the demand for user-friendly interfaces, may come at the cost of diminishing their faithfulness. We assert that the faithfulness of explanations is critical in LLMs employed for high-stakes decision-making.
openreview.net › pdfWALK THE TALK? MEASURING THE FAITHFULNESS OF LARGE LANGUAGE ...

openreview.net › pdf
definition of faithfulness. Since LLM explanations mimic human explanations, they often reference high-level concepts in the input question that purportedly influenced the model. We define faithfulness in terms of the difference between the set of concepts that LLM explanations imply are influential and the set thattruly are.
arxiv.org › abs › 2402Faithfulness vs. Plausibility: On the (Un)Reliability of ...

arxiv.org › abs › 2402
- Cached
Feb 7, 2024 · In this work, we discuss the dichotomy between faithfulness and plausibility in SEs generated by LLMs. We argue that while LLMs are adept at generating plausible explanations -- seemingly logical and coherent to human users -- these explanations do not necessarily align with the reasoning processes of the LLMs, raising concerns about their faithfulness.
- Cite as: arXiv:2402.04614 [cs.CL]
- Subjects: Computation and Language (cs.CL)
arxiv.org › pdf › 2209Towards Faithful Model Explanation in NLP: A Survey - arXiv.org

arxiv.org › pdf › 2209
to which an explanation accurately reflects a model’s reasoning process (Jacovi and Goldberg2020). In other words, an explanation should not “lie” about the underlying mechanism at work. Explanations that lack faithfulness can be dangerous, especially when they still appear plausible, i.e., convincing to humans. This can mislead the
www.semanticscholar.org › paper › Faithfulness-vsFaithfulness vs. Plausibility: On the (Un)Reliability of ...

www.semanticscholar.org › paper › Faithfulness-vs
Feb 7, 2024 · It is asserted that the faithfulness of explanations is critical in LLMs employed for high-stakes decision-making and called upon the community to develop novel methods to enhance the faithfulness of self explanations thereby enabling transparent deployment of LLMs in diverse high-stakes settings. Large Language Models (LLMs) are deployed as powerful tools for several natural language ...
human.libretexts.org › Bookshelves › Philosophy3.2: Inference to the Best Explanation and the Seven ...

human.libretexts.org › Bookshelves › Philosophy
- Cached
Apr 21, 2023 · The seven explanatory virtues are: Explanatoriness: Explanations must explain all the observed facts. Depth: Explanations should not raise more questions than they answer. Power: Explanations should apply in a range of similar contexts, not just the current situation in which the explanation is being offered.
People also ask
What is the difference between faithfulness and plausible explanations?
However, these plausible explanations might be misleading if they do not correspond to the LLM’s internal decision-making process. On the other hand, faithfulness represents the accuracy of explanations in illustrating the LLM’s actual reasoning, i.e., why and how the model reached a particular decision.

Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from L…

arxiv.org/html/2402.04614v2
See all results for this question
When is an explanation considered faithful?
An explanation is considered faithful if it accurately represents the reasoning of the underlying model. Evaluating the faithfulness of explanations is a non-trivial problem due to the lack of ground truth explanations.

Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from L…

arxiv.org/html/2402.04614v2
See all results for this question
Can a plausible explanation be unfaithful?
Below, we argue that plausible explanations that are not necessarily faithful can potentially result in. Misplaced Trust and Over-reliance: When LLMs provide plausible but unfaithful explanations, there’s a significant danger of making erroneous decisions in high-stakes environments like healthcare, finance, and legal systems.

Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from L…

arxiv.org/html/2402.04614v2
See all results for this question
Do we need a systematic characterization of faithfulness-plausibility requirements?
Moreover, we emphasize the need for a systematic characterization of faithfulness-plausibility requirements of different real-world applications and ensure explanations meet those needs. While there are several approaches to improving plausibility, improving faithfulness is an open challenge.

[2402.04614] Faithfulness vs. Plausibility: On the (Un)Reliability of Explan…

arxiv.org/abs/2402.04614
See all results for this question
Is the plausibility of explanations a problem in LLMs?
We highlight that the current trend towards increasing the plausibility of explanations, primarily driven by the demand for user-friendly interfaces, may come at the cost of diminishing their faithfulness. We assert that the faithfulness of explanations is critical in LLMs employed for high-stakes decision-making.

Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from L…

arxiv.org/html/2402.04614v2
See all results for this question
Is the faithfulness of explanations important in LLMs?
We assert that the faithfulness of explanations is critical in LLMs employed for high-stakes decision-making. Moreover, we emphasize the need for a systematic characterization of faithfulness-plausibility requirements of different real-world applications and ensure explanations meet those needs.

[2402.04614] Faithfulness vs. Plausibility: On the (Un)Reliability of Explan…

arxiv.org/abs/2402.04614
See all results for this question
ar5iv.labs.arxiv.org › html › 2004Towards Faithfully Interpretable NLP Systems: - ar5iv

ar5iv.labs.arxiv.org › html › 2004
- Cached
Mar 2, 2024 · However, in the context of faithfulness, we must warn against HCI-inspired evaluation, as well: increased performance in this setting is not indicative of faithfulness; rather, it is indicative of correlation between the plausibility of the explanations and the model’s performance.

Yahoo Canada Web Search

Search results

arxiv.org › html › 2402Faithfulness vs. Plausibility: On the (Un)Reliability of ...

openreview.net › pdfWALK THE TALK? MEASURING THE FAITHFULNESS OF LARGE LANGUAGE ...

arxiv.org › abs › 2402Faithfulness vs. Plausibility: On the (Un)Reliability of ...

arxiv.org › pdf › 2209Towards Faithful Model Explanation in NLP: A Survey - arXiv.org

www.semanticscholar.org › paper › Faithfulness-vsFaithfulness vs. Plausibility: On the (Un)Reliability of ...

human.libretexts.org › Bookshelves › Philosophy3.2: Inference to the Best Explanation and the Seven ...

Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from L…

Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from L…

Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from L…

[2402.04614] Faithfulness vs. Plausibility: On the (Un)Reliability of Explan…

Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from L…

[2402.04614] Faithfulness vs. Plausibility: On the (Un)Reliability of Explan…

ar5iv.labs.arxiv.org › html › 2004Towards Faithfully Interpretable NLP Systems: - ar5iv

Related searches