Abstract: Video event localization tasks include temporal action localization (TAL), sound event detection (SED) and audio-visual event localization (AVEL). Existing methods tend to over-specialize on ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results