«Semantic Arabic Encoding and Format»: الفرق بين المراجعتين

من ويكي عربآيز
اذهب إلى: تصفح، ابحث
سطر 9: سطر 9:
 
Computers still deal with the [[Arabic]] language in comparison to other languages, especially English, in a limited way. It will deal with it on the textual level, with regards to form, storage, [[encoding]] problems,or their treatment or with respect to meaning which is included in by the text. Moreover, the Arabic language, in contrast to European languages, is a heavily [[morphological language]] (i.e. there is a rigid structure that links the structure of words in relation to their semantic and logical meaning).
 
Computers still deal with the [[Arabic]] language in comparison to other languages, especially English, in a limited way. It will deal with it on the textual level, with regards to form, storage, [[encoding]] problems,or their treatment or with respect to meaning which is included in by the text. Moreover, the Arabic language, in contrast to European languages, is a heavily [[morphological language]] (i.e. there is a rigid structure that links the structure of words in relation to their semantic and logical meaning).
   
This should justify striving for the understanding of different mechanisms of the structure of the Arabic language from which it benefits and ending the pressing need for the treatment of form reduction and meaning for Arabic.
+
This should justify striving for the understanding of different mechanisms of the structure of the Arabic language from which it benefits and ending the pressing need for treating unstructured text and dealing with meaning in Arabic.
 
 
 
==Project Objectives==
 
==Project Objectives==
سطر 17: سطر 17:
 
* The development of a system to include the meaning and the [[sentence structure]].
 
* The development of a system to include the meaning and the [[sentence structure]].
 
* The generation of an non-itemized Arabic [[morphological lexicon]] which will be the basis for a complete Arabic lexicon.
 
* The generation of an non-itemized Arabic [[morphological lexicon]] which will be the basis for a complete Arabic lexicon.
* Easing [[searching and categorizing]] whether on the morphological or semantic level.
+
* Easing [[searching and categorization]] whether on the morphological or semantic level.
 
* The possibility for linking the encoding with components to form a [[spell checker]] and then a [[grammar checker]] (which is the first system we really wanted to develop, an interactive spell checker for simplifying writing in Modern Standard Arabic (Fusha), an effective dictionary for words and technical terms, etc.).
 
* The possibility for linking the encoding with components to form a [[spell checker]] and then a [[grammar checker]] (which is the first system we really wanted to develop, an interactive spell checker for simplifying writing in Modern Standard Arabic (Fusha), an effective dictionary for words and technical terms, etc.).
  +
* The ability for the system to simplify dealing with Modern Standard Arabic (Fusha) and its writing accurately (to increase its use) and store texts to structure [[unstructured texts]], then find mechanisms for [[moving Arabic texts to a structured and correlated format]]. (Currently, less than 1% of Arabic texts are structured.)
 
  +
* A way to distinguish non-Arabic words (i.e. not derived from Arabic or are not subject to Arabic structure) and putting it through a framework that must exist.
 
==The Current Idea for the Project==
 
==The Current Idea for the Project==
   

نسخة 17:58، 16 مارس 2007

Arabic.png

هذه الصفحة موجودة بالعربية تحت عنوان الترميز المعنوي للغة العربية

Semantic Arabic Encoding and Format

This information is intended to be a permanent exchange during the progression of the project. Any suggestions or contributions are welcome.

First glance

Computers still deal with the Arabic language in comparison to other languages, especially English, in a limited way. It will deal with it on the textual level, with regards to form, storage, encoding problems,or their treatment or with respect to meaning which is included in by the text. Moreover, the Arabic language, in contrast to European languages, is a heavily morphological language (i.e. there is a rigid structure that links the structure of words in relation to their semantic and logical meaning).

This should justify striving for the understanding of different mechanisms of the structure of the Arabic language from which it benefits and ending the pressing need for treating unstructured text and dealing with meaning in Arabic.

Project Objectives

The Arabic Semantic Encoding project aims to writing mechanisms for storing Arabic texts (words) including its structure and derivational relationships in a way that reflects the most of semantic correlation possible. These objectives include:

  • The development of a system to include the meaning and the sentence structure.
  • The generation of an non-itemized Arabic morphological lexicon which will be the basis for a complete Arabic lexicon.
  • Easing searching and categorization whether on the morphological or semantic level.
  • The possibility for linking the encoding with components to form a spell checker and then a grammar checker (which is the first system we really wanted to develop, an interactive spell checker for simplifying writing in Modern Standard Arabic (Fusha), an effective dictionary for words and technical terms, etc.).
  • The ability for the system to simplify dealing with Modern Standard Arabic (Fusha) and its writing accurately (to increase its use) and store texts to structure unstructured texts, then find mechanisms for moving Arabic texts to a structured and correlated format. (Currently, less than 1% of Arabic texts are structured.)
  • A way to distinguish non-Arabic words (i.e. not derived from Arabic or are not subject to Arabic structure) and putting it through a framework that must exist.

The Current Idea for the Project

Project Progress

What We Need