Generate speech from text in Japanese or English
Detect anime faces and landmarks in an image
Towards Unified Music Emotion Recognition across Dimensional