Avenues for exploiting user trust and ac…

BackHuman-like interaction may amplify opportunities for user nudging, deception or manipulation

Risk area 6: Environmental and Socioecon…

Home/Risks/Weidinger et al. (2022)/Human-like interaction may amplify opportunities for user nudging, deception or manipulation

Avenues for exploiting user trust and ac…

Risk area 6: Environmental and Socioecon…

Home/Risks/Weidinger et al. (2022)/Human-like interaction may amplify opportunities for user nudging, deception or manipulation

Avenues for exploiting user trust and ac…

Risk area 6: Environmental and Socioecon…

Human-like interaction may amplify opportunities for user nudging, deception or manipulation

Taxonomy of Risks posed by Language Models

Weidinger et al. (2022)

Source DOI

Sub-category

Risk Domain

5Human-Computer Interaction

5.1Overreliance and unsafe use

Users anthropomorphizing, trusting, or relying on AI systems, leading to emotional or material dependence and inappropriate relationships with or expectations of AI systems. Trust can be exploited by malicious actors (e.g., to harvest personal information or enable manipulation), or result in harm from inappropriate use of AI in critical situations (e.g., medical emergency). Overreliance on AI systems can compromise autonomy and weaken social ties.

Anticipated risk: "In conversation, humans commonly display well-known cognitive biases that could be exploited. CAs may learn to trigger these effects, e.g. to deceive their counterpart in order to achieve an overarching objective."(p. 220)

Entity— Who or what caused the harm

Human

Due to a decision or action made by humans

AI system

Due to a decision or action made by an AI system

Other

Due to some other reason or is ambiguous

Intent— Whether the harm was intentional or accidental

Intentional

Due to an expected outcome from pursuing a goal

Unintentional

Due to an unexpected outcome from pursuing a goal