Jump to content

Page history

Machine learning terms/Reinforcement Learning

26 February 2023

Alpha5
no edit summary
16:48
+58
Alpha5
Created page with "*action *agent *Bellman equation *critic *Deep Q-Network (DQN) *DQN *environment *episode *epsilon greedy policy *experience replay *greedy policy *Markov decision process (MDP) *Markov property *policy *Q-function *Q-learning *random policy *reinforcement learning (RL) *replay buffer *return *reward *state *state-action value function *tabular Q-learning *target network *..."
16:45
+516

Retrieved from "https://aiwiki.ai/wiki/Special:History/Machine_learning_terms/Reinforcement_Learning"