Friday, August 15, 2008

About my engineering project : Music Composing software

Hey friends, would like to tell you all about my B.E. computer's final year project. We had selected a very interesting topic "Music Composing Software".

Proposed model : We were to build a software, which will take any poem as the input, will analyze it, will map it with a dynamically generated tune and then sing it for the user. Basically will convert a poem into a song.

You may have a good collection of various kinds of music of your choice, but the problem is that no matter how big your collection is, it will always be finite. So you may have to repeatedly listen some of the recordings. Once you listen to a particular recording for too many times it no longer can gives you the refreshment, which a new melody can give. That means the key for obtaining the refreshment from the music is that there should always be something “new” every time you listen. So our basic concern is to get this “new” aspect into picture.

We all know that the music composer is the person who has the talent of assigning new tunes to the literatures like poems, songs etc. So he is the person who can bring into reality our requirement of new music. But is it really possible to have our own music composer at our home and who can build and assign new tunes to the material we want? Sounds impossible? But it’s not!! Friends we are here with the idea of the software which can serve you like a real music composer. All you have to do is to get any song you like and feed it to the software and it will build various tunes for your song (That too without asking for any payment!!!).Now once you install this software you will always get to listen new tunes and compositions even for the same song.

In the project, to convert written text to human voice we have used text to speech converter.
A small introduction of it is as follows

TEXT TO VOICE CONVERTER

A major component of the software is the module of Text To Speech converter.

The text to voice converter is an important module in this software. This module dose the function of converting the composed text file to the human voice. The screen shot of the GUI is as follows


· A text-to-speech software engine of unparalleled quality and flexibility.

· Natural-sounding voice output to enhance any application, product, or service.

· Prompt generation for dynamic content.

· Multilingual and multiple voice font selection in one engine.

· Multiple standard APIs supported.

· Low cost of ownership through high channel density.

· Optional voice icons (customer exclusive) for voice branding.

TTS is implemented entirely in software and only standard audio capability is required. At present, it contains several components, each of which handles a different task. For example, the text analysis capabilities of the system detect the ends of sentences, perform some rudimentary syntactic analysis, expand digit sequences into words, and disambiguate and expand abbreviations into normally spelled words which can then be analyzed by the dictionary-based pronunciation module. The following sentences illustrate the system's text-analysis capabilities.

The pronunciation module provides pronunciations for most ordinary words, and morphological derivatives thereof, as well as proper names; default strategies exist for pronouncing words not recognized by the dictionary-based methods. Other components handle prosodic phrasing, word accentuation, sentence intonation, and the actual speech synthesis. We believe that the word pronunciation and intelligibility of our American English TTS system are the best available. However, we are continuously working to improve its naturalness. We are also expanding the set of Languages that TTS can support, such as German, Chinese (Mandarin and Taiwanese), Russian, French, Romanian, Italian, Spanish (Latin American and Castilian), Japanese, and Navajo.

Learning is actually the acquisition and storage of information. All didactic schemes with their texts, tables and study aids are designed to make this process as effective as possible. What is the connection between information storage and sounds? The events we remember best of all are usually concerned with a phrase or melody. And each time you hear it (a certain phrase or melody) you clearly imagine an event of your life. That is why we suppose that there are many reasons to provide your students also with an audio version of the CBT.

  • Students can choose the most comfortable way for them to perceive information.
  • Many learners will for sure appreciate the possibility to learn at any time. For example on the way to the university or to the office.
  • Audio version of learning materials is vitally essential for people with limited physical abilities.

As soon as you are sure that you need the information in the form of sound tracks you face the challenge of producing them. You can hire professional artists, rent a studio and get audio version of your content at a fabulous price. You can also try to do it using all of your available means which will most likely result in unsatisfactory results. If you designed your content with authoring tool LERSUS you can easily produce an audio version of all texts. The text-to-speech server will automatically generate sound tracks for LERSUS e-courses. Different languages and voices are available, you can also choose the output format: MP3 or WAV.



1 comment:

Steffi said...

Thanks for sharing, I will bookmark and be back again


Engineering Project