User contributions for Nicoboomer

Search for contributionsExpandCollapse
⧼contribs-top⧽
⧼contribs-date⧽

2 March 2023

  • 17:0717:07, 2 March 2023 diff hist +871 Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers (VALL-E)→‎Vall-E
  • 16:5616:56, 2 March 2023 diff hist +2,938 N Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers (VALL-E)Created page with "{{see also|Papers}} ==Introduction== In the last decade, there have been significant advances in speech synthesis via neural networks and end to end modeling. Current text-to-speech (TTS), systems require high-quality data from recording studios. They also suffer from poor generalization for unseen speaker in zero-shot situations. A new TTS framework, VALL-E, has been developed to address this issue. It uses audio codec codes for an intermediate representation as well a..."
  • 16:3516:35, 2 March 2023 diff hist +2,867 N Template:Paper infoboxCreated page with "<includeonly><div class="model"> <div class="heading">[[{{PAGENAME}}]]</div> <div style="width: 500px; display: flex; flex-direction: column;"> <div class="model-infobox-row"> {{#if:{{{name|}}}| <div class="model-infobox-cell">'''Name'''</div> <div class="model-infobox-cell">[[Has paper name::{{{name}}}]]</div> }} </div> <div class="model-infobox-row"> {{#if:{{{type|}}}| <div class="model-infobox-cell">'''Type'''</div> <div class="model-infobox-cell">{{#arraymap:{{{ty..."