Abstract: In this paper, we propose a novel bidirectional learned animation codec that generates natural facial videos by using past and future keyframes. First, we introduce a compact auxiliary ...
AcademiCodec ├── academicodec │ ├── utils.py # common parts of various models │ ├── modules # common parts of various models │ ├── ... │ ├── quantization # common parts of various models ...
VALL-E 2 is the latest advancement in neural codec language models that marks a milestone in zero-shot text-to-speech synthesis (TTS), achieving human parity for the first time. Building upon the ...
Abstract: End-to-end image and video codecs are becoming increasingly competitive, compared to traditional compression techniques that have been developed through decades of manual engineering efforts ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results