PDA

View Full Version : When we get Voice Recognition for the command line Bash/Dash?



honeybear
April 20th, 2011, 11:41 PM
Hi,

It would be so welcome to have a daemon that run on a machine, listening to the USB microphone, and doing vocal commands based a commandlist file...

Bit like this movie:

Vids about new arm, eyes and helmet (with voice recognition done):
http://www.youtube.com/watch?v=rWzC8ZTuss4
http://www.youtube.com/watch?v=QNqMW8nirbw
http://www.youtube.com/watch?v=ATkCIR_62b8
http://www.youtube.com/watch?v=67biXF-dFwc

Vista and mac have pre-releases of voice reco. There are no single widely used Linux Voice Recognition for a daemon.

Happy Tux! May Linux be with you always, even in your car (Audi?) with voice recognition, linked to your home intelligent main Linux Server!

deconstrained
April 20th, 2011, 11:46 PM
There are a number of projects (http://en.wikipedia.org/wiki/Speech_recognition_in_Linux#Solutions) underway, but unfortunately the effort to get speech-to-text or speech-to-action isn't as strong or unified as some would prefer it to be.

Once that's taken care of, I imagine that turning spoken commands into specific actions will just be a matter of tying a word or phrase to a given script/program.

Dustin2128
April 20th, 2011, 11:54 PM
The problem with speech recognition: Fired employees running out the door while screaming "FORMAT C, RM / !!!"

honeybear
April 21st, 2011, 03:46 AM
here are the solutions for linux, but however I do not find any into the repositories for some unknown reasons :(

Second, there is no daemon or command line oriented one... X11 is needed, always.


Solutions

The following is a list of current projects dedicated to implementing speech recognition in Linux, as well as major (though mostly incomplete) native solutions that are available as of March 2008:

* CVoiceControl is a KDE and X Window independent version of its predecessor KVoiceControl
* DynaSpeak from SRI International (speaker-independent speech recognition software development kit that scales from small- to large-scale systems, for use in commercial, consumer, and military applications)
* CMU Sphinx is a general term to describe a group of speech recognition systems developed at Carnegie Mellon University.
* GnomeVoiceControl is a dialogue system to control the GNOME Desktop that was developed in the Google Summer of Code in 2007.
* Julius is a high-performance, two-pass large vocabulary continuous speech recognition (LVCSR) decoder software for speech-related researchers and developers.
* LumenVox Speech Engine is a commercial library for Linux and Windows for inclusion in other software. It has been integrated into the Asterisk private branch exchange system [2]
* Open Mind Speech a part of the Open Mind Initiative and aims to develop free (GPL) speech recognition tools and applications, as well as collect speech data.
* PerlBox is a perl based control and speech output.
* Simon (This project aims at being extremely flexible to compensate dialects or even speech impairments; requires HTK and Julius)
* VoxForge is a free speech corpus and acoustic model repository for open source speech recognition engines.
* Xvoice (requires ViaVoice to function)
* Verbio ASR is a commercial speech recognition server for Linux and windows platforms.
* Platypus is a open source shim that will allow Dragon NaturallySpeaking running under wine to work with any linux x11 application.
* Vedics is a speech assistant for GNOME Environment