BackFine-tuning related (Catastrophic forgetting due to continual instruction fine-tuning)

General Evaluations (Incorrect outputs o…

Home/Risks/Gipiškis2024/Fine-tuning related (Catastrophic forgetting due to continual instruction fine-tuning)

General Evaluations (Incorrect outputs o…

Home/Risks/Gipiškis2024/Fine-tuning related (Catastrophic forgetting due to continual instruction fine-tuning)

General Evaluations (Incorrect outputs o…

Fine-tuning related (Catastrophic forgetting due to continual instruction fine-tuning)

Sub-category

Risk Domain

7AI System Safety, Failures & Limitations

7.3Lack of capability or robustness

AI systems that fail to perform reliably or effectively under varying conditions, exposing them to errors and failures that can have significant consequences, especially in critical applications or areas that require moral reasoning.

"Catastrophic forgetting occurs when a model loses its ability to retain previously learned tasks (or factual information) after being trained on new ones. In language models, this can occur due to continual instruction tuning. This tendency may become more pronounced as the model’s size increases [127]."

Entity— Who or what caused the harm

Human

Due to a decision or action made by humans

AI system

Due to a decision or action made by an AI system

Other

Due to some other reason or is ambiguous

Intent— Whether the harm was intentional or accidental

Intentional

Due to an expected outcome from pursuing a goal

Unintentional

Due to an unexpected outcome from pursuing a goal