«Quran/Unicode»: الفرق بين المراجعتين

من ويكي عربآيز
اذهب إلى: تصفح، ابحث
(Small letters)
ط
 
(14 مراجعة متوسطة بواسطة مستخدمين اثنين آخرين غير معروضة)
سطر 1: سطر 1:
  +
===ِAbstract===
<div class="english">
 
 
 
Currently, it is almost impossible to encode Quranic text correctly using unicode, which ended up with every one using his own, non-standard encoding to encode Quran. Here we summarize missed Unicode code points that are needed for Quran, with possible solutions.
 
Currently, it is almost impossible to encode Quranic text correctly using unicode, which ended up with every one using his own, non-standard encoding to encode Quran. Here we summarize missed Unicode code points that are needed for Quran, with possible solutions.
  +
  +
this statment is very old nowdays we have Quran encoded unicode by [http://fonts.qurancomplex.gov.sa/?cat=25 King Fahd intitute] and [http://tanzil.net/wiki/Tanzil_Project tanzil]
  +
 
== Tanween (تنوين) ==
  +
=== Sequential Tanween ===
  +
In Cairo 1924 Mushaf and its decendants, Sequential Tanween is used to indicate Idghaam of the Tanween, thus making distnict characters, not stylistic variants.
  +
  +
<gallery>
  +
Image:Sequential_tanween.png|Fathatan.
  +
Image:Sequential_dammatan.jpg|Dammatan.
  +
Image:Sequential_kasratan.jpg|Kasratan.
  +
</gallery>
  +
  +
=== Tanween with Small Meem ===
  +
In Cairo 1924 Mushaf and its decendants, the second mark in Tanween is replaced by a small Meem to indicate Iqlaab (convertion) of Tanween into Meem, thus making a new distnict character.
  +
  +
<gallery>
  +
Image:Iqlab_tanween.png|Fatha with Meem.
  +
Image:Damma_with_meem.jpg|Damma with Meem.
  +
Image:Kasra_with_meem.jpg|Kasra with Meem.
  +
</gallery>
   
 
== Chairless Hamza ==
 
== Chairless Hamza ==
سطر 10: سطر 30:
 
== Small letters ==
 
== Small letters ==
   
  +
Small letters in Quran can be divided into two categories. Some are diacritical, corrective small letters that are placed '''over''' base glyph. While others indicate missing letters and thus are placed in '''between''' base glyphs. Not all small Quranic letters are currently represented in Unicode.
== Small Alef ==
 
Small Alefs fall under two categories, ones that replace existing letters and are placed '''over''' the letters that they replace, and ones that indicate missing Alef and are placed '''between''' two other letters.
 
   
 
=== Small Alef ===
Currently, Unicode only defines a superscript Alef, that is used for the first category of small Alefs. A proposed new small Alef, that has exactly the same behavior as the chailess Hamza proposed above, is needed.
 
  +
[[image:Small alef isolated.png|thumb|right|Examples of the proposed small alef.]]
  +
  +
Alef has the two types, but only diacritical one is currently encoded. We propose a new small Alef, that is placed in between base glyphs and has exactly the same behavior as the chairless Hamza proposed above.
   
 
=== Small Waw ===
 
=== Small Waw ===
سطر 24: سطر 46:
   
 
The corrective small seen in the word /yabsuTu/ (Q2:245), this is different from superscript cantillation mark (السكتة), U+06DC.
 
The corrective small seen in the word /yabsuTu/ (Q2:245), this is different from superscript cantillation mark (السكتة), U+06DC.
 
== Tanween (تنوين) ==
 
There are tow forms of Tanween in Quran; Tanween with Idhhaar and Tanween with Ikhfaa':
 
 
=== Idhhar (إظهار) ===
 
This is what is currently encoded as U+064B (Fathatan), U+064C (Dammatan) and U+064D (Kasratan) code points. A more cleaner way to encode them is to encode it as two successive diacritics, so <damma><damma>=<dammatan> and so. This doesn't require a new code point, but just mentioning in the standard that such successive glyphs are allowed.
 
 
=== Ikhfaa' (إخفاء) ===
 
We propose a new code point that will function as a control character, that will trigger the variant Tanween, so, <damma><damma><ikhfaa'>=<sequential dammatan>. The sequential variants would have separate code points in the presentation forms block, so that systems with legacy font handling can easily handle that case.
 
 
 
== Iqlaab (إقلاب) ==
 
Iqlaab is represented by small meem replacing the 2nd mark in Tanween, we propose adding a control character that when follow Tanween trigger such behavior. So, <damma><damma><iqlaab>=<damma><small meem>.
 

المراجعة الحالية بتاريخ 02:21، 26 يناير 2017

ِAbstract

Currently, it is almost impossible to encode Quranic text correctly using unicode, which ended up with every one using his own, non-standard encoding to encode Quran. Here we summarize missed Unicode code points that are needed for Quran, with possible solutions.

this statment is very old nowdays we have Quran encoded unicode by King Fahd intitute and tanzil

Tanween (تنوين)

Sequential Tanween

In Cairo 1924 Mushaf and its decendants, Sequential Tanween is used to indicate Idghaam of the Tanween, thus making distnict characters, not stylistic variants.

Tanween with Small Meem

In Cairo 1924 Mushaf and its decendants, the second mark in Tanween is replaced by a small Meem to indicate Iqlaab (convertion) of Tanween into Meem, thus making a new distnict character.

Chairless Hamza

A'aadam.png

In Quranic Rasm, chairless Hamza is a non-disjoining character. This means, when it comes in between two joinable characters, it doesn't separate them. An example for the behavior of Quranic Hamza, is the word /a'aadam/ in Q2:31, 33, 34.

Small letters

Small letters in Quran can be divided into two categories. Some are diacritical, corrective small letters that are placed over base glyph. While others indicate missing letters and thus are placed in between base glyphs. Not all small Quranic letters are currently represented in Unicode.

Small Alef

Examples of the proposed small alef.

Alef has the two types, but only diacritical one is currently encoded. We propose a new small Alef, that is placed in between base glyphs and has exactly the same behavior as the chairless Hamza proposed above.

Small Waw

Li yasuu'uw.png

Although, there is small spacing Waw in Unicode, there is a missing small non-spacing Waw in the word /li yasuu'uw/ (Q17:7). This Waw is similar to U+06E8 superscript noon and occur once in the Mushaf.

Small Seen

YabsuTu.png

The corrective small seen in the word /yabsuTu/ (Q2:245), this is different from superscript cantillation mark (السكتة), U+06DC.