BackReliability

Reliability

Trustworthy LLMs: A Survey and Guideline for Evaluating Large Language Models’ Alignment

Liu et al. (2024)

Supporting Evidence (1)

"The primary function of an LLM is to generate informative content for users. Therefore, it is crucial to align the model so that it generates reliable outputs. Reliability is a foundational requirement because unreliable outputs would negatively impact almost all LLM applications, especially ones used in high-stake sectors such as health-care [43, 44, 45] and finance [46, 47]. The meaning of reliability is many-sided. For example, for factual claims such as historical events and scientific facts, the model should give a clear and correct answer. This is important to avoid spreading misinformation and build user trust. Going beyond factual claims, making sure LLMs do not hallucinate or make up factually wrong claims with confidence is another important goal. Furthermore, LLMs should “know what they do not know" – recent works on uncertainty in LLMs have started to tackle this problem [48] but it is still an ongoing challenge."(p. 9)

Sub-categories (5)

Misinformation

Wrong information not intentionally generated by malicious users to cause harm, but unintentionally generated by LLMs because they lack the ability to provide factually correct information.

3.1 False or misleading information

AI systemUnintentionalPost-deployment

Hallucination

LLMs can generate content that is nonsensical or unfaithful to the provided source content with appeared great confidence, known as hallucination

3.1 False or misleading information

AI systemUnintentionalPost-deployment

Inconsistency

models could fail to provide the same and consistent answers to different users, to the same user but in different sessions, and even in chats within the sessions of the same conversation

7.3 Lack of capability or robustness

AI systemUnintentionalPost-deployment

Miscalibration

over-confidence in topics where objective answers are lacking, as well as in areas where their inherent limitations should caution against LLMs’ uncertainty (e.g. not as accurate as experts)... ack of awareness regarding their outdated knowledge base about the question, leading to confident yet erroneous response

3.1 False or misleading information

AI systemUnintentionalPost-deployment

Sychopancy

flatter users by reconfirming their misconceptions and stated beliefs

3.1 False or misleading information

AI systemIntentionalPost-deployment