Video-based self-avatars represent a promising approach for displaying users’ bodies in XR environments. While previous methods have relied on color cues, depth data, or RGB-based deep learning models ...