Abstract: Audiovisual event localization aims to localize the event that is both visible and audible in a video. Previous works focus on segment-level audio and visual feature sequence encoding and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results