potential_reward
PotentialReward
Bases: BaseRewardFunction
Potential reward Assume task has get_potential implemented; Low potential is preferred (e.g. a common potential for goal-directed task is the distance to goal)
Parameters:
Name | Type | Description | Default |
---|---|---|---|
potential_fcn
|
method
|
function for calculating potential. Function signature should be: potential = potential_fcn(env) where @env is a Environment instance, and @potential is a float value representing the calculated potential |
required |
r_potential
|
float
|
Reward weighting to give proportional to the potential difference calculated in between env timesteps |
1.0
|
Source code in omnigibson/reward_functions/potential_reward.py
reset(task, env)
Compute the initial potential after episode reset
:param task: task instance :param env: environment instance