Towards Learning Feasible Hierarchical Decision-Making Policies in Urban Autonomous Driving