Merge "Make reward constants configurable." into tm-dev