Abstract: Infrared image super-resolution (IRSR) is challenging due to weak structures and textures. While Mamba-based state-space models (SSMs) efficiently model long-range dependencies, their ...
Abstract: High-quality image captions play a crucial role in improving the performance of cross-modal applications such as text-to-image generation, text-to-video generation, and text-image retrieval.