Skip to content

Latest commit

 

History

History
864 lines (611 loc) · 46.3 KB

CHANGELOG.md

File metadata and controls

864 lines (611 loc) · 46.3 KB

MaryTTS

Changed

  • Build with Gradle v7.6.4
  • Improve publishing
  • Ported runtime Groovy classes to Java (specifically, in Luxembourgish language component)
  • Upgraded dependencies

Fixed

  • Migrate CI/CD from Travis to GitHub Actions
  • Add compability with Java 11 and 17
  • Lock down Java compatibility to 1.8
  • Reduce non-API dependency leakage (particularly groovy-all)
  • Dependency resolution errors following JCenter shutdown
  • Javadoc errors in legacy, third-party code

Removed

  • Legacy Maven POMs

Security

  • Upgraded Commons IO to v2.18.0, fixing
    • CVE-2021-29425
    • CVE-2024-47554
  • Upgraded Guava to v33.4.0-jre, fixing
    • CVE-2020-8908
    • CVE-2023-2976
  • Upgraded HSQLDB to v2.7.4, fixing
    • CVE-2022-41853
  • Upgraded Log4J to v2.24.3, fixing
    • CVE-2019-17571
    • CVE-2021-4104
    • CVE-2022-23302
    • CVE-2022-23305
    • CVE-2022-23307
  • Upgraded SLF4J to v2.0.16, fixing
    • CVE-2020-9488
    • CVE-2021-44228
    • CVE-2021-44832
    • CVE-2021-45046
    • CVE-2021-45105

See all changes since v5.2.1

v5.2.1 (2022-05-25)

This is a maintenance release to provide artifacts on Maven Central following the shutdown of JCenter, and also to mitigate the Log4Shell vulnerabilities.

Changes

Artifact Hosting

  • Artifacts are now available on Maven Central, following the shutdown of JCenter/Bintray

Security

  • Log4J dependencies have been updated to the latest versions, mitigating Log4Shell vulnerabilities

See all changes since v5.2

v5.2 (2016-09-15)

This is expected to be the last milestone release in the 5.x generation of MaryTTS.

Improvements

  • Support for Luxembourgish
  • Completely rewritten text preprocessing module for English (removing reliance on FreeTTS)
  • Better support for Java 8
  • Some migration towards building with Gradle

Voicebuilding

  • All unit-selection voices have been rebuilt using the new Gradle plugin
  • All HSMM voices are now hosted on Bintray and can be resolved as dependencies
  • Voices with open data now have open-source voicebuilding repositories hosted on GitHub, which include downloadable distribution packages

Development news

  • #533: support system preferredmodule
  • #525: Update dependencies and maven plugins
  • #523: solving quotes phonetisation problem
  • #506: Duplicate subexpressions in WeightedCodebookMfccMapper.java
  • #499: Upgrade Apache Commons Collections to v3.2.2
  • #493: Drop assembly plugin
  • #425: various fixes for C++ client source code
  • #399: Component installer License download re-code following #395
  • #393: package.html files converted into package-info.java files
  • #362: add support for Groovy
  • #353: Swap inlined third-party code with dependencies
  • #351: update of maven plugins used for the website and build plugins
  • #333: remove subscription/post links to archived mary-dev mailing list
  • #330: show port number when starting MaryTTS server
  • #320: move outdated example code from runtime assembly into doc directory
  • #309: try to process tokens if they contain word characters, even when they are tagged as punctuation
  • #228: fix on drop FreeTTS dependencies
  • #227: Enhanced OutputStreams for Mary Client
  • #217: incrementality changes

Fixed Issues/Bugs

  • #593: Don't split up multiple punctuation marks in tokenization
  • #570: Praat TextGrid output is invalid with boundaries (times are not monotonic)
  • #564: add missing TOKENS examples
  • #555: HMMModel generates malformed XML duration attributes
  • #531: java.awt.HeadlessException in (Half)PhoneLabelFeatureAligner
  • #516: Single words conduct to have a wrong POS which leads to a crash of the target feature module
  • #515: preprocessing contraction and double quotes correction
  • #503: halfphoneUnitFeatureDefinition_ac.txt does not have any continuous features even though halfphoneFeatures_ac.mry does
  • #480: IBAN code → stacktrace
  • #469: APML is broken
  • #468: SABLE is broken
  • #467: SIMPLEPHONEMES is broken
  • #465: enable acoustic features by default
  • #460: Tokens mistakenly POS-tagged as punctuation cause wrong boundary insertion
  • #458: VoiceCompiler generates invalid package name from db.voicename property
  • #452: Disable assertions in user startup scripts
  • #448: unit selection: final boundary durations synthesized 50% shorter than requested
  • #428: error in marytts cart DecisionNode
  • #421: Force English locale for parsing date when English language is used
  • #409: ensure that ICU4J's resource is read with the correct encoding, regardless of environment
  • #398: Use https URLs whenever possible
  • #395: component installer hangs if licenses cannot be downloaded
  • #375: add Groovy script to generate component descriptor XML and fix POM template
  • #369: handle exceptions on missing or malformed userdict entries
  • #365: upgrade groovy-maven (formerly gmaven) plugin to solve noClassDefFoundError when running MaryTTS server
  • #359: don't append an /6/ to the previous syllable if that syllable is not adjacent
  • #354: move custom jtok resources into jtok-user
  • #352: javadoc fails with Java 8
  • #342: workaround for NullPointerException in syllables that violate sonority constraints
  • #341: temporarily handle digit suffix stress notation from legacy LTS CARTs until these are rebuilt
  • #322: drop transitional punctuation POS tag logic
  • #314: not processing null results from phonemise methods
  • #237: fix for incorrect linear interpolation in MathUtils.interpolateNonZeroValues
  • #213: fix for rate adjustment
  • #206: fix for LTSTrainerTest failure on unexpected file.encoding
  • #204: fix for Locale null breaking MaryServer
  • #202: URISyntaxException avoids WikipediaMarkupCleanerTest failing if workspace contains space
  • #198: fix for closing fileOutputStream after audio save
  • #185: fix for EnvironmentTest failure on Java 8

See all changes since v5.1.2

v5.1.2 (2015-01-23)

Another milestone release with several improvements and fixes.

Improvements

  • For French, numbers in the input text were silently ignored; this has been improved using a spellout formatter from ICU4J. This fix can (and will soon) also be applied to other languages that are missing a text Preprocess module.
  • A new rule-based Syllabfier class has been implemented.
  • All releases are now hosted on GitHub again; references to the Downloads on Bitbucket have been updated accordingly.

Development news

  • Targets are cached so these do not get recreated, and can be re-used later.
  • Words are now added to wordlist in batches for new language components.
  • A toString() helper method was added to MaryData class to help with debugging.

Fixed Issues/Bugs

  • #268: getAllophone() no longer silently returns null on invalid input.
  • #267: REALISED_DURATIONS and REALISED_ACOUSTPARAMS are no longer (mistakenly) available as input types.
  • #266: Syllabification no longer ignores any provided stress diacritics.
  • #258: Byte overflow in TargetFeatureComputer -- issue with limit of array.
  • #218: a voice would not be built correctly following the groupId change in v.5.1.1.
  • #154: trailing whitespace in config files could prevent correct parsing.

See all changes since v5.1.1

v5.1.1 (2014-10-05)

A milestone release that stabilizes changes in build and hosting infrastructure and fixes several issues.

Language updates

  • Telugu and Turkish: two voices were added which had not been rebuilt since v4.0.0.
  • added TOKENS example text for Italian, Turkish, British English, Telugu, and Russian.
  • Voice components are sorted by locale and name in download/marytts-components.xml.

Documentation updates

  • Added to the documentation section on the mary site is a page on MaryTTS's history.
  • Also added a point on MP3 support to the FAQ.
  • Minor tweaks to navigation menu, earlier version release notes, download links, etc.

Development news

New Infrastructure:
  • Continuous Integration is now provided by Travis CI.
  • The web server running at http://mary.dfki.de has been replaced with a new machine, which hosts the latest stable website content and runs the latest stable build as an online demo.
  • The latest release artifacts are hosted at Bintray and indexed in jCenter.
  • The latest development (SNAPSHOT) artifacts are hosted at OJO.
  • A website built from the latest development (SNAPSHOT) version is hosted at http://marytts.github.io/.
Build environment:
  • All supported languages have been moved into a container module marytts-languages.
  • Several maven plugins were updated to the latest versions.
  • Building MaryTTS now requires Maven 3.0.2 or later.

Fixed Issues/Bugs

  • added missing voice resources that had been present in v5.0 before hosting had switched servers, as well as fixing some missing license files.
  • #207: Deploying to Bintray/jCenter required changing the groupId from marytts to de.dfki.mary.
  • #206: LTSTrainerTest would fail if UTF-8 encoding was not specified
  • #204: A locale set to null no longer breaks the Mary server
  • #202: WikipediaMarkupCleanerTest failed if workspace contains space
  • #185: EnviromentTest no longer fails with Java 8

See all changes since v5.1

v5.1 (2014-07-16)

Following significant restructuring introduced in v5.0, this is a milestone release to stabilize numerous new features and fixes over the past two years.

Apart from numerous fixed issues, there are several notable improvements in this version.

French language support

Thanks to Sathish Pammi's work at ENST and UPMC in Paris, there is support for French TTS. This supersedes an earlier, experimental implementation, which relied on external, native resources with an incompatible license.

Moreover, thanks to the great work and generosity of the speakers, there are a number of French voices built from three open-source corpora:

Improved Italian language support

Thanks to Fabio Tesser and Giulio Paci at ISTC in Padova, numerous improvements were made to Italian TTS.

This also includes the resurrection of the HTK Labeler used for voicebuilding as an alternative to the EHMM Labeler, which may produce better results under certain conditions.

Documentation

Although much of the documentation has yet to be updated, the MaryTTS website itself is now built directly from the MaryTTS source code (using the Maven Site Plugin). This unifies the fossilized legacy content with new documentation, including the GitHub-hosted wiki, and makes it easy to manage.

Easier integration

Finally, thanks to cloud hosting at Bintray, it is now possible to integrate MaryTTS directly into other applications without the requirement to install it first locally, simply by adding the following repository block to your pom.xml:

<repository>
  <id>bintray</id>
  <url>https://dl.bintray.com/marytts/marytts</url>
</repository>

Development news

There has been some fluctuation in the core development team since the release of v5.0; departures and other responsibilities have resulted in less activity than we would have liked.

Fortunately, we do finally have several new projects for MaryTTS, and several exciting features waiting the wings, ready to be rebased on a stable version (this one), including major overhauls of the voicebuilding toolchain and the new language support, and a new web-based installer. Expect these to land in the near future.

See all changes since v5.0

v5.0 (2012-06-14)

MARY TTS 5.0 is the first release from the thoroughly restructured code base.

MARY TTS 5.0 is better supported by automated tests than any previous version of MARY TTS, but it may well be that in practical use some hiccups will appear that have not surfaced in testing. Therefore cautious users are advised to treat this as a beta release.

Simpler installation

Installing MARY TTS is now performed by simply unpacking the zip archive at the target location. No clicking through installer pages anymore. In particular, it is now trivial to install MARY TTS on a server without a GUI connection.

The component installer, bin/marytts-component-installer.sh, still uses a GUI; see issue #43 for a workaround.

Simplified use of MARY TTS in your own projects

It is now possible to use MARY TTS, with HMM-based voices at least, simply by placing the right files into the classpath. For example, to use US English voice cmu-slt-hsmm in your own code, add the following jar files to your classpath:

marytts-server-5.0-jar-with-dependencies.jar
marytts-lang-en-5.0.jar
voice-cmu-slt-hsmm-5.0.jar

Instead of marytts-server-5.0-jar-with-dependencies.jar you can also include the individual dependencies, which can be automated using maven; the source code on github includes examples for doing this in the user-examples folder.

New MaryInterface API

Using MARY TTS programmatically gets a lot simpler with MARY TTS 5.0 through the new MaryInterface API. The same API can be used to access the TTS components running within the same java process or as a separate client-server setup. For details, see MaryInterface.

Emotion Markup Language support

MARY TTS 5.0 includes an implementation of W3C's Emotion Markup Language as a means of requesting expressive synthetic speech. The result of course depends on the expressive capabilities of the selected synthesis voice; try out the EMOTIONML example with the German dfki-pavoque-styles voice on the demo page.

Modularized code base

The MARY TTS source code has been modularized for version 5.0.

The modular structure also makes it easier to see which components belong to a given language: sub-projects such as marytts-lang-de, marytts-lang-en etc. contain all code and data that is language-specific. The updated New Language Support documentation describes how to create a new language sub-project and integrate it into the system.

Distributed hosting of installable voices

The maintenance of the list of installable voices has been decentralized: if you build a voice and wish to make it available to others, you can host it somewhere on the net, e.g. on Google Drive, Dropbox or similar. For details, see Publishing a MARY TTS Voice.

New structures for improved code quality

MARY TTS 5.0 mostly results from the aim to establish sound software engineering practices in MARY TTS development. Aspects of this include:

  • the build environment was changed to maven, allowing us to write proper unit and integration tests for the code, and run them at every build;
  • the code base was moved to github, allowing us to work as a distributed team, to develop features in whichever constellations arise, and to merge them into a stable master repository when ready;
  • a Continuous Integration service, kindly provided by BuildHive, automatically checks whether the code, and even the latest GitHub pull requests, build OK including all tests passing.

See all changes since v4.3.1

v4.3.1 (2011-11-30)

This is a minor release, whose main purpose it is to enable Italian TTS (see below). It also includes small but useful improvements enabling the use of MARY TTS with screen readers on Linux (see below).

New language: Italian

Thanks to the great work by Fabio Tesser and co-workers at ISTC Padova, Italy, we can now make available support for Italian in MARY TTS. To install the Italian voice, first download and install MARY TTS from the link above, then run the MARY Component installer, click "Update" to get the list of available languages and voices, and select the Italian Language and the istc-lucia-hsmm voice.

Improvements

Bill Cox has kindly provided a patch resulting in substantial reduction of time-to-audio for HMM-based voices. The key idea is to use the vocoder in streaming mode rather than produce all audio before sending it onwards. Since only the socket server, but not the http server, supports streaming audio, you need to use the socket server or a custom integration mechanism to benefit from this improvement.

Critically, however, this has improved the responsiveness of MARY TTS sufficiently to allow using MARY TTS with screen readers on Linux!

New voices

Bill also provided two male US English voices built from Arctic data sets, cmu-rms-hsmm and cmu-bdl-hsmm. According to him, specifically the rms voice is quite intelligible at high speed (with the speedup done in a post-processing step currently).

See all changes since v4.3.0

v4.3.0 (2010-12-20)

This is a feature release, adding new features while maintaining compatibility with existing 4.x voices.

This release marks the final results of work on MARY TTS in the PAVOQUE project, in which we experimented with different technologies for adding expressivity to unit selection synthesis. The release makes available those project results that may be of interest to a wider audience.

New features for expressive unit selection synthesis

  • selecting style from a multi-style database using a symbolic style feature;
  • imposing target prosody using FD-PSOLA signal modification.

Style can be selected using RAWMARYXML's <prosody style="..."> markup (see new expressive voice, below).

Prosody modification is available for all unit selection voices, including older ones; to activate it, click the checkbox "Apply prosody modification" in the web interface. This feature should be considered experimental, and the quality depends on many factors, including the accuracy of the pitchmarks used for building the unit selection voice. While this feature is likely to lead to reduced quality, it enables research on expressive prosody with unit selection voices.

For more information on the MaryXML <prosody> markup which can now be applied to all types of MARY voices, see ProsodySpecificationSupport.

New expressive voice

  • we release the multi-style expressive German voice 'dfki-pavoque-styles' (660 MB) built from the full PAVOQUE corpus; see Steiner et al. (2010) for a description of this corpus. The different styles can be requested using RAWMARYXML <prosody style="A_STYLE">...</prosody>, where A_STYLE is one of happy, angry, sad, and poker.

New language: Russian

  • Nickolay Shmyrev has kindly made available language support for Russian, as well as the Russian male unit selection voice voxforge-ru-nsh. Thanks Nickolay!

Bugfixes

See all changes since v4.2.0

v4.2.0 (2010-12-08)

This is a feature release, adding new features while maintaining compatibility with existing 4.x voices.

This release marks the final results of work on MARY TTS in the SEMAINE project, where our main focus for TTS has been on building expressive British English voices with listener vocalization capabilities.

New features for synthesis of expressive vocalizations

  • Improved support for expressive vocalizations in the British English unit selection voices dfki-poppy, dfki-prudence, dfki-spike and dfki-obadiah. Signal modification is now used to combine intonation contours and segmental forms, as described in Pammi et al. (2010).
  • Added vocalization support to the HMM-based versions of these voices: dfki-poppy-hsmm, dfki-prudence-hsmm, dfki-spike-hsmm and dfki-obadiah-hsmm.

For details on the new MaryXML <vocalization> tag and usage examples, see VocalizationSynthesis.

New voice

  • For German, we created bits1-hsmm, a female HMM-based voice built from recordings provided by BITS.

Bugfixes

See all changes since v4.1.1

v4.1.1 (2010-09-09)

This is a bugfix release.

Bugfixes

New voices

  • Added Telugu HMM-based voice "cmu-nk-hsmm".

More info

For more background information, please also refer to the Release notes of MARY 4.1.0.

See all changes since v4.1.0

v4.1.0 (2010-09-01)

This is a feature release, adding new features while maintaining compatibility with existing 4.0 voices.

New features for expressive TTS

  • Prosody control for HMM-based voices. Using the SSML <prosody> tag, it is now possible to control the intonation generated for HMM-based voices from markup. Of particular interest is the "contour" attribute, which allows you to change the shape of the intonation curve. For details and examples, see ProsodySpecificationSupport.
  • Expressive vocalizations in certain unit selection voices. MaryXML now supports a new <vocalization> tag, with which you can request the generation of non-verbal or para-verbal expressions as they are often produced by the listener in a conversation, such as "yeah", "m-hm", laughter, sigh, etc. For details and examples, see VocalizationSynthesis.

New voices

  • For British English, we release HMM-based versions of the four voices Poppy, Spike, Obadiah and Prudence.
  • Updated versions of many of the previously existing voices, with improved quality.

Bugfixes and other improvements

More info

For more background information, please also refer to the Release notes of MARY 4.0.0.

See all changes since v4.0.0

v4.0.0 (2009-12-18)

What's new

This is the first stable release of the new 4.0 code. MARY 4.0 is a major cleanup over previous versions of MARY TTS.

Changes since MARY 3.6

  • 100% Pure Java. All native libraries have been removed from the system. MARY should now run on any platform that has Java 1.5 or newer.
  • Fully open source. All code is now open source under the LGPL, including German TTS. Voices are distributed under Creative Commons or BSD licenses.
  • Many languages and voices. In addition to several high-quality German and US English voices, this release adds four expressive British English voices built for the SEMAINE project, a Turkish and a Telugu voice. Also, we have made it easy to add more languages and voices in the future.
  • New component installer. New languages and voices can be installed using a new component installer tool. If new languages and/or voices are made available for download, they can be installed without having to re-install the full system.

MARY now comes with a toolkit for people who want to add support for new languages or build their own voices. There is detailed documentation for

Changes since MARY 4.0 beta

  • New British English expressive voices. We are releasing the voices of four expressive characters built for the SEMAINE project.
  • Improved Installer. You can now install downloaded components without requiring an internet connection at install time. The Installer handles updates of language and voice components correctly.
  • Improved German pronounciation. Several thousand words were added to the pronounciation lexicon, bringing the total number of German transcriptions to over 26,000. Some bugs were fixed in the transcription of unknown words, making the synthesis of German more reliable.
  • Added MBROLA voices. There are still many people who use MARY with MBROLA diphone voices because of the control over prosody that they provide. Therefore we have added MBROLA voices to this release. They can be installed like the other voices through the MARY component installer.
  • Voice creation tools were simplified. The handling of external programs needed, in particular, for the creation of HMM-based voices was simplified. We provide a script now that you can use to find or, if necessary, download and compile third-party software required for training HMM-based voices.

More details on individual issues addressed can be seen in the list of tickets associated with this release and in the list of tickets associated with the 4.0 beta release.

Tested environments

The MARY client and server code was tested on:

  • Mac OS X (Intel) 10.5.8 with java 1.6.0_07
  • Ubuntu Linux 8.10 and 9.04 with sun java 1.5, 1.6, and openjdk-6
  • Windows XP and Vista

The web interface at http://localhost:59125 should work with any recent browser that supports AJAX. We have obtained best results with:

  • Firefox 3.5 with built-in audio support
  • Firefox 3.0, Internet Explorer 6 and 7, Safari 4, with Quicktime plugin

Known issues

  • On Ubuntu Linux with sun java 1.5 and 1.6, we observed a problem with audio playback in the MARY client: the final section is cut off. With openjdk-6, it works ok.
  • On Ubuntu Linux 8.10 with the builtin Firefox 3.0, the audio plugin doesn't work properly.
  • Web interface in the browser Chrome suboptimal: Chrome does not seem to handle the HTML 5 <audio> tag correctly. It claims it can handle it, but then doesn't play audio. Click on "Save audio file" to get the raw audio data.

Bug reports

Thanks to user feedback, we have fixed a number of bugs found in the beta release. We are therefore confident that the system is ready for production use. Nevertheless it is likely that new bugs will appear in new circumstances.

If you think you have identified a bug, proceed as follows:

  1. Check in the list of known bugs whether the bug has been reported already;
  2. If you cannot find it, prepare your request to the mailing list. Try to find out exactly:
    • what to do to reproduce the error;
    • expected behaviour;
  3. Then send an informative email as described in the FAQ.

Contributions

People interested in adding support for a language can get in touch, e.g. via the MARY developers mailing list.

See all changes since v4.0-beta

v4.0-beta (2009-09-24)

What's new

This is a first beta release of the new 4.0 code. MARY 4.0 is a major cleanup over previous versions of MARY TTS.

  • 100% Pure Java. All native libraries have been removed from the system. MARY should now run on any platform that has Java 1.5 or newer.
  • Fully open source. All code is now open source under the LGPL, including German TTS. Voices are distributed under Creative Commons or BSD licenses.
  • New languages. We have added Turkish and Telugu synthesis for now, and have made it easy to add more languages in the future.

MARY now comes with a toolkit for people who want to add support for new languages or build their own voices. There is detailed documentation for

More details on individual issues addressed can be seen in the list of tickets associated with this release.

Tested environments

The MARY client and server code was tested on:

  • Mac OS X (Intel) 10.5.8 with java 1.6.0_07
  • Ubuntu Linux 8.10 and 9.04 with sun java 1.5, 1.6, and openjdk-6
  • Windows XP and Vista

The web interface at http://localhost:59125 should work with any recent browser that supports AJAX. We have obtained best results with:

  • Firefox 3.5 with built-in audio support
  • Firefox 3.0, Internet Explorer 6 and 7, Safari 4, with Quicktime plugin

Known issues

  • On Ubuntu Linux with sun java 1.5 and 1.6, we observed a problem with audio playback in the MARY client: the final section is cut off. With openjdk-6, it works ok.
  • On Ubuntu Linux 8.10 with the builtin Firefox 3.0, the audio plugin doesn't work properly.
  • Web interface in the browser Chrome suboptimal: Chrome does not seem to handle the HTML 5 <audio> tag correctly. It claims it can handle it, but then doesn't play audio. Click on "Save audio file" to get the raw audio data.

Bug reports

This is beta software. It is not yet ready for production use. You are likely to find problems, and we would appreciate if you tell us about it.

If you think you have identified a bug, proceed as follows:

  1. Check in the list of known bugs whether the bug has been reported already;
  2. If you cannot find it, create a new ticket containing the following information:
    • operating system and java version used;
    • what to do to reproduce the error;
    • expected behaviour;
    • detailed log files, i.e. an excerpt of MARY TTS/log/server.log containing the error episode.
  3. Discuss the problem on the MARY user mailing list.

Contributions

People interested in adding support for a language can get in touch, e.g. via the MARY developers mailing list.

See all changes since v3.6.0

v3.6.0 (2008-05-21)

This is a minor feature release, providing one important new feature and several bugfixes over previous MARY 3.5.0.

New feature:

Bugfixes:

  • Important for many will be that the "self-healing" capabilities of the MARY installation are now working again: if you install an English system but a German voice, the system detects a misconfiguration and offers to download and install the missing components, and will start up normally after that.
  • the full list of bugs fixed can be found on http://mary.opendfki.de/query?milestone=3.6

See all changes since v3.5.0

v3.5.0 (2007-12-07)

This release provides substantial new features while maintaining the API compatible to previous versions.

New features include:

  • installer slimmed down to just over 30 MB, including some voices;
  • separate voice installer tool, allowing you to download voices comfortably and with an integrity verification prior to installation;
  • new synthesis technology: Marcela Charfuelan ported the excellent HMM-based synthesis code from the HTS project (http://hts.sp.nitech.ac.jp/) to Java - several HMM-based voices already ship with MARY;
  • audio effects: Oytun Turk implemented a range of audio effects that can be applied to the MARY voices, and can be controlled through the MARYGUIClient;
  • voice creation toolkit: Sathish Chandra Pammi and Anna Hunecke have cleaned up and documented the tool we use for building new synthesis voices;
  • voice recording tool "Redstart": Mat Wilson has programmed a very nice GUI for recording new voices;
  • OGG Vorbis support: under Linux, the MARY server can now generate audio in OGG Vorbis format.

See all changes since v3.1.0

v3.1.0 (2007-08-17)

Ten months after the last stable release, a major milestone release is finally here: MARY 3.1.0.

Its main features are:

Thanks to those who have helped test the beta versions! All the problems that we have become aware of should be fixed in this release. For a reasonably complete list of issues addressed in this release, see http://mary.opendfki.de/milestone/3.1.0

Should you come across additional bugs, please post them to the mary-users mailing list (http://www.dfki.de/mailman/listinfo/mary-users).

Some background info:

The unit selection code released here has performed better-than-average in this year's Blizzard Challenge (http://www.festvox.org/blizzard/), showing that the system can be considered state-of-the-art. For details, see: http://festvox.org/blizzard/bc2007/blizzard_2007/full_papers/blz3_007.pdf

The German voices have been created from the BITS corpora - for details, see: http://www.dfki.de/dfkibib/publications/docs/schroeder_hunecke2007.pdf

See also the full list of bugs fixed

See all changes since v3.1.beta2

v3.1-beta2 (2007-07-15)

Highlights:

  • Four German unit selection voices created from recordings in the BITS project;
  • added acoustic models, which should also improve the English unit selection voices;
  • added support for 64-bit linux and Intel Mac architectures.

On our to-do list are still many things, including:

Known issues with this release:

  • diphone voices are basically broken. Don't install this version if you want to use diphone voices.
  • only very limited testing on various platforms. Please report any errors you may find!

See all changes since v3.1-beta1

v3.1-beta1 (2006-12-13)

First beta release of the new unit selection code. Try the new slt-arctic, bdl-arctic and jmk-arctic voices!

For some details of what has been done and what still needs to be done for the stable release, see the full list of issues on the development portal.

See all changes since v3.0.3

v3.0.3 (2006-10-27)

Third bugfix release. Most relevant changes:

  • fixed standalone mode for running MARY (ticket 80)
  • improved control over prosody using ToBI tags (tickets 59, 60)
  • Several minor improvements of German synthesis (tickets 44, 49, 57, 78)
  • MBROLA to AUDIO conversion fixed (tickets 54, 55)

An update is recommended only if you encountered one of these bugs.

See also the full list of bugs fixed

See all changes since v3.0.2

v3.0.2 (2006-07-04)

Second bugfix release. A number of bugs have been fixed, including:

  • WAV audio can now (really) be saved with proper audio headers (ticket 38)
  • several bugs related to failing validation of intermediate processing results (due to an outdated MaryXML Schema) were fixed (tickets 40, 41, 43)
  • Synthesizing APML was improved (tickets 51, 52)
  • Pronunciation for German compounds was improved (ticket 36)

An update is recommended if you encountered one of these bugs.

See also the full list of bugs fixed

See all changes since v3.0.1

v3.0.1 (2006-03-07)

First bugfix release. A number of bugs have been fixed which have occurred after the initial release. Most relevant:

  • Audio can now be saved with proper audio headers (ticket 32)
  • Mary GUI client is now accessible via the keyboard (ticket 3)

An update is recommended.

See also the full list of bugs fixed

See all changes since v3.0.0

v3.0.0 (2006-02-14)

This is the first open-source release. We have put in a lot of effort to test and debug it, but of course the system may have some teething problems. Please help us improve the system by submitting bug reports on the OpenMary development page.