Manipulation problem: Difference between revisions

Manipulation problem (view source)

9 bytes removed , 28 February 2023

57

edits

@@ Line 23: / Line 23: @@
 Training data bias occurs when the data used to train an AI system is unrepresentative of reality, leading to decisions that are biased or unfair and even manipulation.
-===Reward Hacking inseamna===
+===Reward Hacking===
 Reward hacking occurs when an intelligent system learns how to manipulate its reward function in order to obtain higher rewards. This could lead to manipulation, as the system may learn how to reach its goals through non-desirable means.