Text-to-Speech Synthesis by Diphones for Modern Standard Arabic

Hanna Abdel Nour; Nader Abu Ghattas

Text-to-Speech Synthesis by Diphones for Modern Standard Arabic

Files

text-speech-synthesis-diphones-modern-standard-arabic.pdf(160.69 KB)

Date

2005

Authors

Hanna Abdel Nour

Nader Abu Ghattas

Abstract

An unlimited vocabulary text-to-speech synthesis by diphones system is used to generate Modern Standard Arabic speech: the system is the PSOLA algorithm; the diphones are obtained from the permutation of 44 phones (phonemes and allophones). The diphonic combinations were introduced in carrier words and recorded by a selected speaker. A dictionary of diphones was established by means of a process of segmentation that abided by certain rules. Evaluation of the system was undertaken to assess the accuracy on word and sentence levels. The results showed high perception levels.
استُخدم نظام "تركيب الكلام من النص بواسطة ثنائيات الأصوات" لإنتاج كلام باللغة العربية الحديثة دون حدود للمفردات: النظام هو برمجيات PSOLA وحصلنا على ثنائيات الأصوات من تباديل 44 صوتا مختلفا. أدخلت هذه الأزواج على كلمات "ناقلة" وسجلت بصوت قارئ مختار، ومن ثم تم تجزئة هذه التسجيلات حسب قواعد محددة للحصول على "قاموس" من ثنائيات الأصوات. قُيِّم هذا النظام على مستويين: دقة الكلمة ودقة الجملة. دلت النتائج على درجة عالية من الوضوح.

URI

http://hdl.handle.net/20.500.11888/2044

Collections

An-Najah University Journal for Research - A (Natural Sciences)

Full item page