Usually a Convolutional Neural Network (CNN) that extracts spatial information.
Most researchers using these specific file IDs are implementing or testing Deep Feature Flow . This framework solves the problem of per-frame processing being too slow for real-time video. 5faafede261a08fdaba46_source.mp4
Labeling every pixel in the video (e.g., distinguishing "road" from "sidewalk"). Usually a Convolutional Neural Network (CNN) that extracts
A lighter network (like FlowNet) that estimates the motion between frames. 5faafede261a08fdaba46_source.mp4