Abstract: Constructing a high-fidelity representation of the 3D scene using a monocular camera can enable a wide range of applications on low-energy devices, such as micro-robots, smartphones, and ...
Abstract: Learning multiobject dynamics purely from visual data is challenging due to the need for robust object representations that can be learned through robot interactions. In previous work ...