Q-function: Revision history

Diff selection: Mark the radio buttons of the revisions to compare and hit enter or the button at the bottom.
Legend: (cur) = difference with latest revision, (prev) = difference with preceding revision, m = minor edit.

18 March 2023

  • curprev 21:5421:54, 18 March 2023Walle talk contribs 3,046 bytes +3,046 Created page with "{{see also|Machine learning terms}} ==Q-function in Machine Learning== The Q-function, also known as the state-action value function or simply Q-value, is a fundamental concept in the field of Reinforcement Learning (RL). It represents the expected cumulative reward an agent will receive from a specific state by taking a certain action and then following a given policy. Mathematically, the Q-function is denoted as Q(s, a), where 's' represents the state and 'a' repre..."
Retrieved from "http:///wiki/Q-function"