Generating Synthetic Video Sequences by Explicitly Modeling Object Motion