Probabilistic Reinforcement Learning: Using Data to Define Desired Outcomes and Inferring How to Get There