Abstract: In this paper, we focus on the problem of efficiently locating a target object described with free-form text using a mobile robot equipped with vision sensors (e.g., an RGBD camera).
Abstract: Video summarization can help people retrieve videos quickly. Existing unsupervised video summarization models make insufficient use of video frame uniqueness in the self-attention mechanism ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results