Jump to content

Page history

Backdooring LLMs

15 March 2025

Alpha5
no edit summary
00:38
+19

14 March 2025

Alpha5
no edit summary
18:54
−41
Alpha5
no edit summary
18:45
Alpha5
no edit summary
18:42
+85
Alpha5
no edit summary
18:41
+1,183
Alpha5
Created page with "'''Backdooring Large Language Models (LLMs)''' refers to the process of intentionally embedding hidden, malicious behaviors—known as Backdoors—into LLMs during their training or fine-tuning phases. These Backdoors enable the model to behave normally under typical conditions but trigger undesirable outputs, such as malicious code or deceptive responses, when specific conditions or inputs are met. This phenomenon raises significant concerns about the se..."
18:28
+6,845

Retrieved from "https://aiwiki.ai/wiki/Special:History/Backdooring_LLMs"