Einfache Projektliste Software-Karte

132 Projekte im Ergebnis
Letztes Update: 2019-04-21 23:51

Julius

Julius is an open-source, high-performance large vocabulary continuous speech recognition (LVCSR) engine for speech-related researchs and developments. With HMM acoustic model and language model, you can construct your own speech recognition system.

Moved to github: https://github.com/julius-speech/julius

Entwicklungsstatus: 4 - Beta, 5 - Produktion / stabil
Zielbenutzer: Entwickler, End Users/Desktop
Natürliche Sprache: English, Japanese
Betriebssystem: Linux, Windows, OS Unabhängig
Programmiersprache: C
Benutzerschnittstelle: Console (Text Based)
Aktivitätsart Perzentil: 2
Aktivitäten-Rangliste: 155Rang
Register Date: 2002-09-09 14:38
Letztes Update: 2011-12-26 14:04

linphone

Linphone is an audio and video Internet phone with GTK+ and console interfaces. It uses the SIP protocol, and is compatible with most SIP clients and gateways. It can use various audio and video codecs such as Speex, GSM, G711, G722, ilbc, amr, Theora, H263-1998, MPEG4, H264, VP8, and snow.

Letztes Update: 2009-03-25 07:41

FAAC

The FAAC project includes the AAC encoder FAAC and decoder FAAD2. It supports several MPEG-4 object types (LC, Main, LTP, HE AAC, PS) and file formats (ADTS AAC, raw AAC, MP4), multichannel and gapless en/decoding as well as MP4 metadata tags. The codecs are compatible with standard-compliant audio applications using one or more of these profiles.

Letztes Update: 2008-07-24 11:29

Speex

Speex is a patent-free compression format designed especially for speech. It is specialized for voice communications at low bit-rates in the 2-45 kbps range. Possible applications include Voice over IP (VoIP), Internet audio streaming, audio books, and archiving of speech data (e.g. voice mail).

Letztes Update: 2018-12-25 23:13

MMDAgent

MMDAgent は、音声対話システムを構築するためのツールキットです。ユーザーは、ユーザー自身のダイアログのシナリオ、3D のエージェント、および声をデザインできます。このソフトウェアは修正 BSD ライセンスでリリースされます。

Letztes Update: 2005-11-14 13:35

PHP Voice

PHP Voice (formerly known as PHP VXML) contain four classes that assist in developing voice application using PHP. It supports Speech Synthesis Markup Language 1.0, Speech Recognition Grammar Specification 1.0, Voice Browser Call Control: CCXML 1.0, and Voice Extensible Markup Language (VoiceXML) 2.0.

Letztes Update: 2013-03-03 19:13

MisterHouse

MisterHouse is a Unix/Windows home automation program written in Perl. It can respond to voice commands, Web browsers, time of day, serial port and X10 data, external files, etc., and can speak via Text to Speech engines.

Letztes Update: 2013-11-14 02:07

CMU Sphinx

CMU Sphinx, a Speech Recognition System, is transitioning to Open Source. The distribution contains a library (libsphinx2) and some small examples that link against it.

Letztes Update: 2007-10-10 13:37

FlowDesigner

FlowDesigner is a data flow-oriented development environment. It can be used to build complex applications by combining small, reusable building blocks. In some ways, it is similar to both Simulink and LabView, but is hardly a clone of either.

(Machine Translation)
Letztes Update: 2008-12-23 17:37

eSpeak

eSpeak is a compact text to speech engine for good
quality English and other languages. Its clear
articulation and good intonation makes it suitable
for listening to long text articles. It can speak
text files from the command line, and also
operates as a "talker" within the KDE TTS system
and with a Gnome Speech driver, as an alternative
to Festival or other similar programs. Windows
SAPI5 and command line versions are also available.

(Machine Translation)
Letztes Update: 2004-10-25 23:57

Snack sound toolkit

The Snack sound extension adds commands for sound play/record and sound visualization, e.g. waveforms and spectrograms. It supports in- memory sound objects, file based audio, streaming audio, WAV, AU, AIFF, and MP3 file formats, synchronous and asynchronous playback. The visualization canvas item types update in real-time and can output postscript. New commands and file formats can be added using the Snack C-API.

(Machine Translation)
Letztes Update: 2008-07-14 17:53

SingIt Lyric Displayer

The SingIt Lyric Displayer is a program to display formatted lyrics, including tagged text, CD+G, and id3v2xx lyrics. It consists of several displayers, an integrated editor, query, and karaoke tools. It supports various players, such as XMMS, Noatun, and Rhythmbox.

(Machine Translation)
Letztes Update: 2005-02-07 15:14

FreeTTS

FreeTTS is a speech synthesis system written entirely in Java. It is
based upon Flite, a small runtime speech synthesis engine developed
at Carnegie Mellon University. Flite is derived from the Festival
Speech Synthesis System from the University of Edinburgh and the
FestVox project from Carnegie Mellon University.

(Machine Translation)
Letztes Update: 2007-01-03 20:13

Julius Speech Recognition Engine

Julius is a high-performance large vocabulary
continuous speech recognition (LVCSR) engine for
speech-related research and development. You can
construct your own speech recognition system, but
you need a separate English acoustic model and
language model or grammar file.

(Machine Translation)