Abstract: We propose a novel description-based controllable text-to-speech (TTS) method with cross-lingual control capability. To address the lack of audio-description paired data in the target ...
PutergenAI is a lightweight, robust Python SDK for interacting with the Puter.js API, an open-source cloud operating system focused on privacy and AI capabilities. This SDK provides a clean interface ...
Abstract: In spite of the fact that Braille is an important channel of communication for the visually impaired, conventional systems require specialized training and expensive devices that are hard to ...