Page 13 of 19 FirstFirst ... 31112131415 ... LastLast
Results 121 to 130 of 181

Thread: HOWTO: Make festival TTS use better voices (MBROLA / CMU / HTS)

  1. #121
    Join Date
    Aug 2007
    Beans
    33
    Distro
    Ubuntu 7.10 Gutsy Gibbon

    Re: HOWTO: Make festival TTS use better voices (MBROLA / CMU / HTS)

    hmm well its already done compiling, I'm not sure how to use/install it for use with festival, I don't have time to fool around with it now though

  2. #122
    Join Date
    Apr 2006
    Beans
    22
    Distro
    Ubuntu 9.04 Jaunty Jackalope

    Re: HOWTO: Make festival TTS use better voices (MBROLA / CMU / HTS)

    Quote Originally Posted by mrplow View Post
    you think 12-18 hours is bad, I'm currently compiling HTS-demo_CMU-ARCTIC-ADAPT and it's INSTALL says it should take 2-3 days! I'll let you know how it goes, I'm on 10.10 btw
    Please tell us your story, after it finishes. For 50 hours of building, even saving one person from doing the same mistake, it will help a lot.

  3. #123
    Join Date
    Aug 2007
    Beans
    33
    Distro
    Ubuntu 7.10 Gutsy Gibbon

    Re: HOWTO: Make festival TTS use better voices (MBROLA / CMU / HTS)

    oops it was still running, I noticed on top my CPU was still running at 90%, I rebooted my comp, then began to rebuild it 8 hours ago, I noticed now it appears done and the last few lines from the terminal are
    Code:
    echo "Running a training/synthesis perl script (Training.pl) in background...."
    Running a training/synthesis perl script (Training.pl) in background....
    /usr/bin/perl scripts/Training.pl scripts/Config.pm > log 2>&1 &
    and my cpu is still at 90%, I'm not sure how it will notify me when its done since the make command is finished. I guess I'll just watch for my CPU to drop back to normal.

  4. #124
    Join Date
    Aug 2007
    Beans
    33
    Distro
    Ubuntu 7.10 Gutsy Gibbon

    Re: HOWTO: Make festival TTS use better voices (MBROLA / CMU / HTS)

    damn it, computer crashed, stupid flash plugin... starting again, 24 hours cpu time wasted so far

  5. #125
    Join Date
    Aug 2007
    Beans
    33
    Distro
    Ubuntu 7.10 Gutsy Gibbon

    Re: HOWTO: Make festival TTS use better voices (MBROLA / CMU / HTS)

    arg, 24 hours in this time and it hard locked. My computer sure doesn't like its cpu taxed %100 for long periods. 48 hours cpu time wasted so far...

  6. #126
    Join Date
    Aug 2007
    Beans
    33
    Distro
    Ubuntu 7.10 Gutsy Gibbon

    Re: HOWTO: Make festival TTS use better voices (MBROLA / CMU / HTS)

    starting again...

  7. #127
    Join Date
    Aug 2007
    Beans
    33
    Distro
    Ubuntu 7.10 Gutsy Gibbon

    Re: HOWTO: Make festival TTS use better voices (MBROLA / CMU / HTS)

    arg, 50 hours in and it crashed again... over 100 hours wasted so far lol

  8. #128
    Join Date
    Dec 2009
    Beans
    12

    Re: HOWTO: Make festival TTS use better voices (MBROLA / CMU / HTS)

    Quote Originally Posted by mrplow View Post
    arg, 50 hours in and it crashed again... over 100 hours wasted so far lol
    Before anyone spends any more time on this, I thought I should copy an email I got from the festival mailing list this afternoon - there's a new version! And it mentions these voices. Here it is:

    Code:
    message from Alan W Black <HIDDEN@cs.cmu.edu> to festival-talk
    = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = =
    
           The Festival Speech Synthesis System version 2.1
            and Edinburgh Speech Tools Library version 2.1
                           November 2010
    
    Surprisingly we have a new release.  Please give feedback for
    installation issues so they can be fixed in a 2.1 release.
    
    Festival offers a general framework for building speech synthesis
    systems as well as including examples of various modules.  As a whole
    it offers full text to speech through a number APIs: from shell level,
    though a Scheme command interpreter, as a C++ library, from Java, and
    an Emacs interface.  Festival is multi-lingual (currently English
    (British and American), and Spanish) though English is the most
    advanced.  Other groups release new languages for the system.  And
    full tools and documentation for build new voices are available
    through Carnegie Mellon's FestVox project (http://festvox.org).  This
    version also supports voices built with the latest version of Nagoya
    Institute of Technologies' HTS system (http://hts.sp.nitech.ac.jp)
    
    The system is written in C++ and uses the Edinburgh Speech Tools
    Library for low level architecture and has a Scheme (SIOD) based
    command interpreter for control.  Documentation is given in the FSF
    texinfo format which can generate, a printed manual, info files and
    HTML.
    
    Festival is free software.  Festival and the speech tools are
    distributed under an X11-type licence allowing unrestricted commercial
    and non-commercial use alike.
    
    This distribution includes:
      * Full English (British and American English) text to speech
      * Full C++ source for modules, SIOD interpreter, and Scheme library
      * Lexicon based on CMULEX and OALD (OALD is restricted to non-commercial
        use only)
      * Edinburgh Speech Tools, low level C++ library
      * rab_diphone: British English Male residual LPC, diphone
      * kal_diphone: American English Male residual LPC diphone
      * cmu_us_slt_arctic_hts: American Female, HTS
      * cmu_us_rms_cg: American Male using  clustergen
      * cmu_us_awb_cg: Scottish English Male (with US frontend) clustergen
      * Full documentation (html, postscript and GNU info format)
    
    Note there are some licence restrictions on the voices themselves.
    The US English voices have the same restrictions as Festival.
    
    The UK lexicon (OALD) is restricted to non-commercial use.
    
    Addition voices are also available.
    
    Festival version 2.1 sources, voices
    
    In Europe:
       http://www.cstr.ed.ac.uk/downloads/festival/
    In North America:
       http://festvox.org/festival
    
    Requirements
    
    To run Festival you need:
      * A Unix-like environment, e.g Linux, FreeBSD, OSX, cygwin under Windows.
      * A C++ compiler: we have used GCC  versions. 2.x upto 4.5
      * GNU Make any recent version
    
    New in 2.1
      * Support for the new versions of C++ that have been released
      * Integrated and updated support for HTS, Clustergen, Multisyn and Clunits
        voices
      * "Building Voices in Festival" document describing process of building
        new voices in the system
          http://festvox.org/
    
    Alan W Black (CMU)
    Rob Clark (Edinburgh)
    Junichi Yamagishi (Edinburgh)
    Keiichiro Oura (Nagoya)
    = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = =
    =    University of Edinburgh's Festival Speech Synthesis System       =
    = http://festvox.org/festival      Sent Via festival-talk@festvox.org =
    =                           To unsubscribe mail majordomo@festvox.org =
    = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = =

  9. #129
    Join Date
    Aug 2007
    Beans
    33
    Distro
    Ubuntu 7.10 Gutsy Gibbon

    Re: HOWTO: Make festival TTS use better voices (MBROLA / CMU / HTS)

    well that was fun, I'm not sure how far I made it but I eventually ran into this error 70 hours into compiling
    Code:
    ====================================================================================
    Start synthesizing waveforms (speaker independent) at Thu Nov 11 18:38:44 PST 2010
    ====================================================================================
    
    Processing directory /home/mrplow/Desktop/HTS/HTS-demo_CMU-ARCTIC-ADAPT/HTS-demo_CMU-ARCTIC-ADAPT/gen/qst001/ver1/SI/0:
     Synthesizing a speech waveform from cmu_us_arctic_slt_alice01.mgc and cmu_us_arctic_slt_alice01.lf0.../usr/bin/sox: invalid option -- w
    /usr/bin/sox FAIL sox: invalid option
    
    /usr/bin/sox: SoX v14.3.1
    
    Usage summary: [gopts] [[fopts] infile]... [fopts] outfile [effect [effopt]]...
    
    SPECIAL FILENAMES (infile, outfile):
    -                        Pipe/redirect input/output (stdin/stdout); may need -t
    -d, --default-device     Use the default audio device (where available)
    -n, --null               Use the `null' file handler; e.g. with synth effect
    -p, --sox-pipe           Alias for `-t sox -'
    
    SPECIAL FILENAMES (infile only):
    "|program [options] ..." Pipe input from external program (where supported)
    http://server/file       Use the given URL as input file (where supported)
    
    GLOBAL OPTIONS (gopts) (can be specified at any point before the first effect):
    --buffer BYTES           Set the size of all processing buffers (default 8192)
    --clobber                Don't prompt to overwrite output file (default)
    --combine concatenate    Concatenate all input files (default for sox, rec)
    --combine sequence       Sequence all input files (default for play)
    -D, --no-dither          Don't dither automatically
    --effects-file FILENAME  File containing effects and options
    -G, --guard              Use temporary files to guard against clipping
    -h, --help               Display version number and usage information
    --help-effect NAME       Show usage of effect NAME, or NAME=all for all
    --help-format NAME       Show info on format NAME, or NAME=all for all
    --i, --info              Behave as soxi(1)
    --input-buffer BYTES     Override the input buffer size (default: as --buffer)
    --no-clobber             Prompt to overwrite output file
    -m, --combine mix        Mix multiple input files (instead of concatenating)
    -M, --combine merge      Merge multiple input files (instead of concatenating)
    --magic                  Use `magic' file-type detection
    --multi-threaded         Enable parallel effects channels processing (where
                             available)
    --norm                   Guard (see --guard) & normalise
    --play-rate-arg ARG      Default `rate' argument for auto-resample with `play'
    --plot gnuplot|octave    Generate script to plot response of filter effect
    -q, --no-show-progress   Run in quiet mode; opposite of -S
    --replay-gain track|album|off  Default: off (sox, rec), track (play)
    -R                       Use default random numbers (same on each run of SoX)
    -S, --show-progress      Display progress while processing audio data
    --single-threaded        Disable parallel effects channels processing
    --temp DIRECTORY         Specify the directory to use for temporary files
    --version                Display version number of SoX and exit
    -V[LEVEL]                Increment or set verbosity level (default 2); levels:
                               1: failure messages
                               2: warnings
                               3: details of processing
                               4-6: increasing levels of debug messages
    FORMAT OPTIONS (fopts):
    Input file format options need only be supplied for files that are headerless.
    Output files will have the same format as the input file where possible and not
    overriden by any of various means including providing output format options.
    
    -v|--volume FACTOR       Input file volume adjustment factor (real number)
    --ignore-length          Ignore input file length given in header; read to EOF
    -t|--type FILETYPE       File type of audio
    -s/-u/-f/-U/-A/-i/-a/-g  Encoding type=signed-integer/unsigned-integer/floating
                             point/mu-law/a-law/ima-adpcm/ms-adpcm/gsm-full-rate
    -e|--encoding ENCODING   Set encoding (ENCODING in above list)
    -b|--bits BITS           Encoded sample size in bits
    -1/-2/-3/-4/-8           Encoded sample size in bytes
    -N|--reverse-nibbles     Encoded nibble-order
    -X|--reverse-bits        Encoded bit-order
    --endian little|big|swap Encoded byte-order; swap means opposite to default
    -L/-B/-x                 Short options for the above
    -c|--channels CHANNELS   Number of channels of audio data; e.g. 2 = stereo
    -r|--rate RATE           Sample rate of audio
    -C|--compression FACTOR  Compression factor for output format
    --add-comment TEXT       Append output file comment
    --comment TEXT           Specify comment text for the output file
    --comment-file FILENAME  File containing comment text for the output file
    --no-glob                Don't `glob' wildcard match the following filename
    
    AUDIO FILE FORMATS: 8svx aif aifc aiff aiffc al amb amr-nb amr-wb anb au avr awb caf cdda cdr cvs cvsd cvu dat dvms f32 f4 f64 f8 fap flac fssd gsm gsrt hcom htk ima ircam la lpc lpc10 lu mat mat4 mat5 maud nist ogg paf prc pvf raw s1 s16 s2 s24 s3 s32 s4 s8 sb sd2 sds sf sl smp snd sndfile sndr sndt sou sox sph sw txw u1 u16 u2 u24 u3 u32 u4 u8 ub ul uw vms voc vorbis vox w64 wav wavpcm wv wve xa xi
    PLAYLIST FORMATS: m3u pls
    AUDIO DEVICE DRIVERS: alsa
    
    EFFECTS: allpass band bandpass bandreject bass bend biquad chorus channels compand contrast crop+ dcshift deemph delay dither divide+ earwax echo echos equalizer fade filter* fir firfit+ flanger gain highpass input# key* ladspa loudness lowpass mcompand mixer noiseprof noisered norm oops output# overdrive pad pan* phaser pitch polyphase* rabbit* rate remix repeat resample* reverb reverse riaa silence sinc spectrogram speed splice stat stats stretch swap synth tempo treble tremolo trim vad vol
      * Deprecated effect    + Experimental effect    # LibSoX-only effect
    EFFECT OPTIONS (effopts): effect dependent; see --help-effect
    Error in /usr/local/SPTK/bin/excite -p 80 /home/mrplow/Desktop/HTS/HTS-demo_CMU-ARCTIC-ADAPT/HTS-demo_CMU-ARCTIC-ADAPT/gen/qst001/ver1/SI/0/cmu_us_arctic_slt_alice01.pit | /usr/local/SPTK/bin/mglsadf -m 24 -p 80 -a 0.42 -c 0 /home/mrplow/Desktop/HTS/HTS-demo_CMU-ARCTIC-ADAPT/HTS-demo_CMU-ARCTIC-ADAPT/gen/qst001/ver1/SI/0/cmu_us_arctic_slt_alice01.mgc | /usr/local/SPTK/bin/x2x +fs | /usr/bin/sox -c 1 -s -w -t raw -r 16000 - -c 1 -s -w -t wav -r 16000 /home/mrplow/Desktop/HTS/HTS-demo_CMU-ARCTIC-ADAPT/HTS-demo_CMU-ARCTIC-ADAPT/gen/qst001/ver1/SI/0/cmu_us_arctic_slt_alice01.wav
    it can probably be fixed by changing scripts/Config.pm line 248
    $SOXOPTION = 'w';
    but I've spent enough time and I'll wait until someone tries out that new festival and reports back

  10. #130
    Join Date
    Aug 2007
    Beans
    33
    Distro
    Ubuntu 7.10 Gutsy Gibbon

    Re: HOWTO: Make festival TTS use better voices (MBROLA / CMU / HTS)

    oh ya the working directory ended up at 29 gigs and who knows how much further it would have grown, just FYI

Page 13 of 19 FirstFirst ... 31112131415 ... LastLast

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •