Expressive_tacotron

screenshot of Expressive_tacotron

Tensorflow Implementation of Expressive Tacotron

Overview

This product is a TensorFlow implementation of the paper "Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron," focusing on verifying its concept. The project utilizes the LJ Speech Dataset for training and provides steps for installation, training, and sample synthesis for evaluating the model's performance.

Features

  • Implementation of Research Paper: Implements the concepts from the specified research paper on expressive speech synthesis.
  • Training Model: Provides guidelines for training the model using the LJ Speech Dataset.
  • Sample Synthesis: Generates speech samples for evaluation based on a specific script and reference audio clips.
  • Analysis of Results: Offers insights into the model's performance and the effectiveness of prosody transfer.

Summary

This TensorFlow implementation of the expressive Tacotron model aims to replicate the concepts outlined in a research paper on prosody transfer for speech synthesis. By utilizing the LJ Speech Dataset and following specific training and evaluation steps, users can explore the effectiveness of the model in synthesizing expressive speech. The analysis of sample synthesis results provides insights into the model's capacity for prosody embedding and transfer, prompting further research possibilities in the field of speech synthesis.