Voice Recognition Algorithm

  1. Investigation with cosine transform, and anti transform algorithm, with some voice recognition code.
  2. Translator: Croatian, English.
  3. 2D to 3D picture algorithm.

Perlbox Voice

Perlbox Voice is an voice enabled application to bring your desktop under your command. With a single word, you can start your web browser, your favorite editor or whatever you want. This is the Linux and Unix voice recognition solution.

Voice Conference Manager

Voice Conference Manager uses VoiceXML and CCXML to control speech recognition, text to speech, and voice biometrics for a telephone conference service. Say the names or numbers of people and VCM places them into the call. Can be hosted on public servers.

M68331 Voice Recognition System

This project will show how to implement the Hidden Markov Model approximations of Voice Recognition into embedded and low power systems.

Modular Audio Recognition Framework

MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.

Voxapl

Voxapl is a suite of voice-enabled applications supporting voice recognition and text to speech functions for home automation services. It is written in Java and communicates via xAP and/or xPL messaging.

Voice XML Enabling Software

Voice XML Enabling Software (VXES) is an application that connects a VoiceXML Interpreter, a telephony platform, and MRCP servers that provide services for Automatic Speech Recognition and Text to Speech Synthesis. C++, Windows & Linux OS supported.

The Adam Speech Recognition Server

The adam server is a voice activated framework in which to control your desktop and perform general systems administration. It utilizes the Sphinx-4 speech recognition engine and the FreeTTS speech synthesis engine.

TTSReader

TTSReader is a full-featured text-to-speech software package that allows reading text aloud as well as to wav or MP3 files. TTSReader is a complete solution for users that work with text-to-speech technology, supporting a multitude of voices and providing features for both realtime text-to-speech conversion as well as saving speech to audio files for later usage. Features include: Intuitive user interface design; Automatic highlighting of currently read text; Reading to wav, reading to MP3 with adjustable settings; Control tags support; Pronunciation corrections; Support for both SAPI4 and SAPI5 voices; Skipping of sentences or paragraphs while reading; Auto-reading the clipboard, global hotkeys; and Documentation provided for all features.

Text to Wav

Text To Wav is a text to speech software only for SAPI4.0. It includes the following features: Text or html convert to WAV files; having a lame_enc.dll, can convert to MP3 too; Speak aloud and highlight a text; Moving caret on the text or typing a key, the text speak loud; Setting a voice1 Japanese TTS Engine and a voice2 English, The voice is automatically switched; Can customize font size, color, and background color. Version 1.35 added Speak Prev/Next Sentence and Pause/Resume buttons.

SmartRead

SmartRead can translate text to speech (TTS technology ) to read out clearly. It is very easy to use, needn't have any speech knowledge, check the article for you, read long text, have very obvious advantages. Repair function, can repair when engine has some problem, especially to XP users. Supports Windows Vista. Version 0.80 adds MP3, and SWF(flash) file conversion.