On Non-Stationarity In Reinforced Deep Markov Models With Applications In Portfolio Optimization