Visual Forecasting for Interactive Embodied Agent