Abstract: In this paper, we propose a novel bidirectional learned animation codec that generates natural facial videos by using past and future keyframes. First, we introduce a compact auxiliary ...
AcademiCodec ├── academicodec │ ├── utils.py # common parts of various models │ ├── modules # common parts of various models │ ├── ... │ ├── quantization # common parts of various models ...
VALL-E 2 is the latest advancement in neural codec language models that marks a milestone in zero-shot text-to-speech synthesis (TTS), achieving human parity for the first time. Building upon the ...
Abstract: End-to-end image and video codecs are becoming increasingly competitive, compared to traditional compression techniques that have been developed through decades of manual engineering efforts ...