«Vocal research»: الفرق بين المراجعتين

من ويكي عربآيز
اذهب إلى: تصفح، ابحث
 
 
(11 مراجعة متوسطة بواسطة 6 مستخدمين غير معروضة)
سطر 1: سطر 1:
I'll try to explain what we want in simple language:: in spite of my weakness in English.
 
   
  +
First, I would like to express my thanks to [http://www.icsi.berkeley.edu/~gelbart/ doctoral student David Gelbart]for the good advice he gave to me about the
The first concept is that we are designing encoding format for the whole Arabic word,
 
  +
phonological studies. Currently, I am still searching for free details of [http://www.chomsky.info/ Naom Chomsky Research]
which we want it to reflect the deviation structure in Arabic.
 
now we are not discussing how would we convert existent text data into this semantic format.
 
   
  +
==The concept==
we need to represent the letters in this format.
 
  +
*The main concept is an encoding format for the whole Arab word, which should reflect the morphological structure in Arabic. We are focusing on this before we discuss implementing a system to convert existing text data into this semantic format.
the simple way is to use 5 bit to represent every letter "the space is 32 letters"
 
in such method we can use the alphabetical order or another order.
 
but the whole space for three letters is 2^^15=32768 it's to huge comparative with 5->6000 triple roots in Arabic!!
 
   
  +
*We need to implement a new format for letter representation. The simple way is to use a 5-bit representation every letter, so the letters can be ordered alphabetically or in another convenient fashion. Unfortunately, the space needed for three letters is: 2^15=32768. This is too large considering that 5->6000 triple roots in Arabic (which is the space calculated by adding al-harakat short vowels to the 28 consonants).
the best solution is to use the Arabic vocal preventions:
 
  +
such as in Arabic kkaf ق and jeem ج can't be together...etc
 
  +
*The best solution is to use the Arabic phonemes, meaning that certain patterns and grouping of letters are possible in Arabic, while others cannot. The letters jeem and qaaf, for example, cannot be grouped together. Moreover, such considerations mean that we must deal with letter substitutions as well.
and also we need to standarise the substitution of the letter, Addition...etc
 
   
those rules have been studied in Arabic grammar but as an existed situation not why, where and how.
+
*Arabic grammar focuses on the existence of certain letters together, but now the reasons this happens (i.e.: why, where, how, etc.).
  +
The branch of linguistics that deals with rules and examines these reasons is called [http://en.wikipedia.org/wiki/Phonology phonology].
   
now I think if we represent the letter initiation from its pronunciation characteristics we can discover and make this rules standards.
+
*Using the principles of phonology, I think we can apply these rules and make standards.
the idea is to represent the letter as a set of vocal characteristics describe the human phoneme.
+
The idea is to use phonological analysis to represent the letter as a set of vocal characteristics, which leads to all the possible phonemes in a language.
  +
([http://en.wikipedia.org/wiki/Tajweed Tajweed] touches on this idea: a letter either has such characteristics or it does not.)
as I know in TAJWEED those are bit characteristics: means the letter has the characteristic or has not.
 
then we can make digital algorithms and design the format including those rules,
+
Once we understand these phonological rules, we can make algorithms and design the format to include them. Hopefully, we can
also discover unknown, unstandarised or complex rules.
+
also discover unknown, nonstandard rules.
   
to reach the final format which reflects the deviation and vocal Arabic structure into digital relational architect.
+
*If we incorporate all of this, we should reach the final format which reflects the morphological and phonological structure of Arabic in digital relational architecture.
   
  +
==Background Information==
What we need is a guide to this studies without needs to sink into IPA and the voice synthesis researches.
 
  +
*In 1968, Noam Chomsky and Morris Halle published The Sound Pattern of English (SPE), the basis for Generative Phonology. In this view,
  +
phonological representations (surface forms) are structures whose
  +
phonetic part is a sequence of phonemes which are made up of
  +
distinctive features. ... The features describe aspects of
  +
articulation and perception, are from a universally fixed set, and
  +
have the binary values + or -. Ordered phonological rules govern how
  +
this phonological representation (also called underlying
  +
representation) is transformed into the actual pronunciation (also
  +
called surface form.)
  +
  +
*So Chomsky and Halle described phonemes as being made up of binary
  +
characteristics, which if I understand correctly is what you want to
  +
do also. Right?
  +
  +
*Trying looking for phonological studies of Arabic?
  +
  +
*It is not that easy to find phonology articles online. (^)
  +
I hope you will find a little. Maybe you'll need to use a university
  +
library, in the end. The sci.lang Usenet group
  +
(http://groups.google.com/group/sci.lang/ ) might be of interest to
  +
you also.
  +
  +
==Current Needs==
 
*What we need is a guide to these studies without deep knowledge of the International Phonetic Alphabet (IPA) and the voice synthesis research.
  +
*now I'm about to rebuild the Tarmeez bit-structure based on [http://www.mghamdi.com/ Mansour M. Alghamdi] which called [http://www.mghamdi.com/TabCon.htm Analysis, Synthesis and Perception of Voicing in Arabic]
  +
  +
*it's great to have any possible guidness or even another point of view

المراجعة الحالية بتاريخ 02:23، 26 يناير 2017

First, I would like to express my thanks to doctoral student David Gelbartfor the good advice he gave to me about the
phonological studies. Currently, I am still searching for free details of Naom Chomsky Research

The concept

  • The main concept is an encoding format for the whole Arab word, which should reflect the morphological structure in Arabic. We are focusing on this before we discuss implementing a system to convert existing text data into this semantic format.
  • We need to implement a new format for letter representation. The simple way is to use a 5-bit representation every letter, so the letters can be ordered alphabetically or in another convenient fashion. Unfortunately, the space needed for three letters is: 2^15=32768. This is too large considering that 5->6000 triple roots in Arabic (which is the space calculated by adding al-harakat short vowels to the 28 consonants).
  • The best solution is to use the Arabic phonemes, meaning that certain patterns and grouping of letters are possible in Arabic, while others cannot. The letters jeem and qaaf, for example, cannot be grouped together. Moreover, such considerations mean that we must deal with letter substitutions as well.
  • Arabic grammar focuses on the existence of certain letters together, but now the reasons this happens (i.e.: why, where, how, etc.).

The branch of linguistics that deals with rules and examines these reasons is called phonology.

  • Using the principles of phonology, I think we can apply these rules and make standards.

The idea is to use phonological analysis to represent the letter as a set of vocal characteristics, which leads to all the possible phonemes in a language. (Tajweed touches on this idea: a letter either has such characteristics or it does not.) Once we understand these phonological rules, we can make algorithms and design the format to include them. Hopefully, we can also discover unknown, nonstandard rules.

  • If we incorporate all of this, we should reach the final format which reflects the morphological and phonological structure of Arabic in digital relational architecture.

Background Information

  • In 1968, Noam Chomsky and Morris Halle published The Sound Pattern of English (SPE), the basis for Generative Phonology. In this view,

phonological representations (surface forms) are structures whose phonetic part is a sequence of phonemes which are made up of distinctive features. ... The features describe aspects of articulation and perception, are from a universally fixed set, and have the binary values + or -. Ordered phonological rules govern how this phonological representation (also called underlying representation) is transformed into the actual pronunciation (also called surface form.)

  • So Chomsky and Halle described phonemes as being made up of binary

characteristics, which if I understand correctly is what you want to do also. Right?

  • Trying looking for phonological studies of Arabic?
  • It is not that easy to find phonology articles online. (^)

I hope you will find a little. Maybe you'll need to use a university library, in the end. The sci.lang Usenet group (http://groups.google.com/group/sci.lang/ ) might be of interest to you also.

Current Needs

  • it's great to have any possible guidness or even another point of view