- #1
ngrunenberg
- 9
- 2
I know I'm not that bright and I realize that this is a silly question to anyone in the field, but I was curious what the reward is in reinforcement learning algorithms.
I understand the concept behind reinforcement learning, though I am unsure of how you could program a reward into a program. There is no limbic system that would respond positively because it has been rewarded with an influx of dopamine, and even if we could program this into an algorithm, how would it know to respond as a biological entity would; I imagine void of having a biological "purpose" to perpetuate ones genes, there would be no real reward that would bring the agent closer to said purpose.
Again, apologies for my ignorance and thanks in advance for taking the time to reply.
I understand the concept behind reinforcement learning, though I am unsure of how you could program a reward into a program. There is no limbic system that would respond positively because it has been rewarded with an influx of dopamine, and even if we could program this into an algorithm, how would it know to respond as a biological entity would; I imagine void of having a biological "purpose" to perpetuate ones genes, there would be no real reward that would bring the agent closer to said purpose.
Again, apologies for my ignorance and thanks in advance for taking the time to reply.