Giter Club home page Giter Club logo

crystal.ttvs's Introduction

Crystal Text-to-Visual-Speech (TTVS) Engine

Crystal TTVS engine is a real-time audio-visual Multilingual (Mandarin, Cantonese and English) speech synthesizer with a 3D expressive avatar.

The avatar model is parameterized according to the MPEG-4 facial animation standard, which offers a compact set of facial animation parameters (FAPs) and feature points (FPs) to enable realization of 20 visemes and 7 facial expressions. A set of TTVS engines (including Mandarin, Cantoense and English) converts the input phoneme sequence with time information into visemes and then further into FAP sequence. The 3D avatar animation is then rendered according to the FAP sequence by the Xface open source toolkit.

Framework

  • Real-time facial animation to generate synchronized lip movements on the avatar's face
  • Natual facial expression and head movement with human-like avatar
  • Platform independent framework for both stationary and embedded applications

Reference

Please use the following papers for reference to this project:

Support of Multilingual

The engine supports TTVS in (but not limited to) the following languages: Chinese Mandarin, Cantonese, and English. You can actually implement your TTVS engine by overriding the CSTHead::FapTTVS (/TTVS/FapTTVS.h/.cpp), just like CSTHead::FapMandarin, CSTHead::FapCantonese, or CSTHead::FapEnglish.

Support of Cross-platform

It is also possible to run the TTVS engine on different platforms, as the following figures illustrate.

Support of Expressive Head Animation and Facial Expression

Six basic expressions of the 3D avatar:

Head movement on the 3D avatar:

Compiling the Projects

Compiling

  1. Compile TinyXML

    • Open "TinyXML" solution in "TinyXML" directory
    • Set "tinyxmlSTL" project as "StartUp Project"
    • Set "Solution Platforms" to "Win32"
    • Compile "tinyxmlSTL" project in "Release" version with "Multi-threaded DLL (/MD)" option
    • Compile "tinyxmlSTL" project in "Debug" version (Optional) with "Multi-threaded debug DLL (/MDd)" option
  2. Complie Xface

    • Open "XFace.sln" solution in "XFace" directory
    • Set "XFace" project as "StartUp Project"
    • Set "Solution Platforms" to "Win32"
    • Compile "Xface" project in "Release" version
    • Compile "Xface" project in "Debug" version
  3. Compile TTVS

    • Open "TTVSTest.sln" solution in "TTVSTest" directory
    • Set "TTVSTest" project as "Startup Project"
    • Set "Solution Platforms" to "Win32"
    • Compile "TTVS" and "TTVSTest" project in "Release" version
    • Compile "TTVS" and "TTVSTest" project in "Debug" version

Runing the demo

  • Go to "binary/win32" directory
  • Go to "release" or "debug" directory
  • Run TTVSTest.exe
  • Select menu "File/Open" to open the "fdp" file in "data/talkinghead/Song/song.fdp"
  • Select menu "TTVS/Mandarin", "TTVS/Cantonese", "TTVS/English" to switch between different langauges.
  • Select menu "TTVS/Screen Animation" for demo animation.

References

  • Motion Pictures Expert Group, ISO/IEC 14496-2:1999/Amd. 1: 2000(E). International Standard, Information Technology - Coding of Audio-Visual Objects. Part 2: Visual; Amendment 1: Visual Extensions.
  • Balci, K., "Xface: MPEG-4 based Open Source Toolkit for 3D Facial Animation", [in] Advance Visual Interfaces, 399-402, 2004.
  • Xface 1.0 open source tools

crystal.ttvs's People

Contributors

johnsontsing avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

crystal.ttvs's Issues

Can we use this project test my own text?

Hello, After I compile this project, I found the apperance is not same as you show ? there no text input screen or any bottom for operate , but only can choose file open song.fdp show demo you supported.

CSTHead.dll unresolved externals

When compiling the third and last solution, the TTVSTest, I get an error for the CSTHead.dll.
IT says CSTHead.dll unresolved externals

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.