is the plausibility of explanations a problem in llms that will

Search results

arxiv.org › html › 2402Faithfulness vs. Plausibility: On the (Un)Reliability of ...

arxiv.org › html › 2402
- Cached
Feb 8, 2024 · Evaluating the faithfulness of explanations is a non-trivial problem due to the lack of ground truth explanations. This problem has worsened in the case of self-explanations from LLMs, as the billion-parameter scale and often proprietary nature of LLMs make assessments using saliency maps and other gradient-based methods nearly impossible.
- [2402.04614] Faithfulness vs. Plausibility: On the (Un ...
  Large Language Models (LLMs) are deployed as powerful tools...
dl.acm.org › doi › 10Explainability for Large Language Models: A Survey

dl.acm.org › doi › 10
Jan 2, 2024 · They find that explanations from LLMs such as GPT-3.5 and GPT-4 have low precision, indicating that they mislead humans to form incorrect mental models. The article reveals limitations of current methods and that optimizing human preferences like plausibility may be insufficient for improving counterfactual simulatability.
Videos
View all
huggingface.co › papers › 2402Faithfulness vs. Plausibility: On the (Un)Reliability of ...

huggingface.co › papers › 2402
- Cached
In this work, we discuss the dichotomy between faithfulness and plausibility in SEs generated by LLMs. We argue that while LLMs are adept at generating plausible explanations -- seemingly logical and coherent to human users -- these explanations do not necessarily align with the reasoning processes of the LLMs, raising concerns about their faithfulness.
arxiv.org › abs › 2402[2402.04614] Faithfulness vs. Plausibility: On the (Un ...

arxiv.org › abs › 2402
- Cached
Feb 7, 2024 · Large Language Models (LLMs) are deployed as powerful tools for several natural language processing (NLP) applications. Recent works show that modern LLMs can generate self-explanations (SEs), which elicit their intermediate reasoning steps for explaining their behavior. Self-explanations have seen widespread adoption owing to their conversational and plausible nature. However, there is little ...
- Cite as: arXiv:2402.04614 [cs.CL]
- Subjects: Computation and Language (cs.CL)
aclanthology.org › 2024Inference to the Best Explanation in Large Language Models

aclanthology.org › 2024
IBE-Eval can successfully identify the best explanation supporting the correct answers with up to 77% accuracy (+ 27% above ran- dom and + 17% over GPT 3.5-as-a-Judge baselines) 6. IBE-Eval is signicantly correlated with hu- man judgment, outperforming a GPT3.5-as- a-Judge baseline in terms of alignment with human preferences.
www.semanticscholar.org › paper › Faithfulness-vsFaithfulness vs. Plausibility: On the (Un)Reliability of ...

www.semanticscholar.org › paper › Faithfulness-vs
Feb 7, 2024 · It is asserted that the faithfulness of explanations is critical in LLMs employed for high-stakes decision-making and called upon the community to develop novel methods to enhance the faithfulness of self explanations thereby enabling transparent deployment of LLMs in diverse high-stakes settings. Large Language Models (LLMs) are deployed as powerful tools for several natural language ...
People also ask
Is the plausibility of explanations a problem in LLMs?
We highlight that the current trend towards increasing the plausibility of explanations, primarily driven by the demand for user-friendly interfaces, may come at the cost of diminishing their faithfulness. We assert that the faithfulness of explanations is critical in LLMs employed for high-stakes decision-making.

Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from L…

arxiv.org/html/2402.04614v2
See all results for this question
Do LLMs mislead humans to form incorrect mental models?
They find that explanations from LLMs such as GPT-3.5 and GPT-4 have low precision, indicating that they mislead humans to form incorrect mental models. The article reveals limitations of current methods and that optimizing human preferences like plausibility may be insufficient for improving counterfactual simulatability. Evaluating Faithfulness.

Explainability for Large Language Models: A Survey | ACM Transactions …

dl.acm.org/doi/10.1145/3639372
See all results for this question
Why is plausibility important for LLMs?
High-stakes applications like healthcare, finance, and legal demand high faithfulness to ensure the accuracy of the LLM’s output due to the critical nature of decisions made in these fields. Conversely, recreational and educational applications like storytelling, educational LLMs, and creativity prioritize plausibility to enhance user engagement.

Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from L…

arxiv.org/html/2402.04614v2
See all results for this question
What happens if an LLM explanation is incorrect?
For instance, in healthcare, an incorrect diagnostic explanation from an LLM could lead to severe implications for patient care. Similarly, in legal contexts, reliance on inaccurate explanations could result in erroneous legal advice. Hence, ensuring the faithfulness of explanations is as critical as their plausibility.

Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from L…

arxiv.org/html/2402.04614v2
See all results for this question
Why is explainability important in LLMs?
As LLMs continue to advance, explainability will become incredibly vital to ensure these models are transparent, fair, and beneficial. We hope that this survey provides a useful organization of this emerging research area, as well as highlights open problems for future work.

[2309.01029] Explainability for Large Language Models: A Survey - a…

ar5iv.labs.arxiv.org/html/2309.01029
See all results for this question
Is the faithfulness of explanations important in LLMs?
We assert that the faithfulness of explanations is critical in LLMs employed for high-stakes decision-making. Moreover, we emphasize the need for a systematic characterization of faithfulness-plausibility requirements of different real-world applications and ensure explanations meet those needs.

[2402.04614] Faithfulness vs. Plausibility: On the (Un)Reliability of Explan…

arxiv.org/abs/2402.04614
See all results for this question
ar5iv.labs.arxiv.org › html › 2309Explainability for Large Language Models: A Survey - ar5iv

ar5iv.labs.arxiv.org › html › 2309
- Cached
Feb 28, 2024 · They find that explanations from LLMs such as GPT-3.5 and GPT-4 have low precision, indicating that they mislead humans to form incorrect mental models. The paper reveals limitations of current methods and that optimizing human preferences like plausibility may be insufficient for improving counterfactual simulatability.

Yahoo Canada Web Search

Search results

arxiv.org › html › 2402Faithfulness vs. Plausibility: On the (Un)Reliability of ...

dl.acm.org › doi › 10Explainability for Large Language Models: A Survey

Videos

huggingface.co › papers › 2402Faithfulness vs. Plausibility: On the (Un)Reliability of ...

arxiv.org › abs › 2402[2402.04614] Faithfulness vs. Plausibility: On the (Un ...

aclanthology.org › 2024Inference to the Best Explanation in Large Language Models

www.semanticscholar.org › paper › Faithfulness-vsFaithfulness vs. Plausibility: On the (Un)Reliability of ...

Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from L…

Explainability for Large Language Models: A Survey | ACM Transactions …

Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from L…

Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from L…

[2309.01029] Explainability for Large Language Models: A Survey - a…

[2402.04614] Faithfulness vs. Plausibility: On the (Un)Reliability of Explan…

ar5iv.labs.arxiv.org › html › 2309Explainability for Large Language Models: A Survey - ar5iv

Related searches