«Quran/Unicode»: الفرق بين المراجعتين

من ويكي عربآيز
اذهب إلى: تصفح، ابحث
(Iqlaab (إقلاب))
(Ikhfaa' (إخفاء))
سطر 41: سطر 41:
   
 
=== Ikhfaa' (إخفاء) ===
 
=== Ikhfaa' (إخفاء) ===
  +
Ikhfaa', as regard of Tanween, is represented by sequential Tanween. We propose a new control character, that, when follow Tanween, will trigger the requested variant of Tanween.
We propose a new code point that will function as a control character, that will trigger the variant Tanween, so, <damma><damma><ikhfaa'>=<sequential dammatan>. The sequential variants would have separate code points in the presentation forms block, so that systems with legacy font handling can easily handle that case.
 
   
  +
So:
  +
<pre>
  +
<damma><damma><ikhfaa'>=<sequential dammatan>
  +
</pre>
   
 
=== Iqlaab (إقلاب) ===
 
=== Iqlaab (إقلاب) ===

نسخة 19:46، 18 ديسمبر 2007

Currently, it is almost impossible to encode Quranic text correctly using unicode, which ended up with every one using his own, non-standard encoding to encode Quran. Here we summarize missed Unicode code points that are needed for Quran, with possible solutions.

Chairless Hamza

A'aadam.png

In Quranic Rasm, chairless Hamza is a non-disjoining character. This means, when it comes in between two joinable characters, it doesn't separate them. An example for the behavior of Quranic Hamza, is the word /a'aadam/ in Q2:31, 33, 34.

Small letters

Small Alef

Small Alefs fall under two categories, ones that replace existing letters and are placed over the letters that they replace, and ones that indicate missing Alef and are placed between two other letters.

Currently, Unicode only defines a superscript Alef, that is used for the first category of small Alefs. A proposed new small Alef, that has exactly the same behavior as the chailess Hamza proposed above, is needed.

Small Waw

Li yasuu'uw.png

Although, there is small spacing Waw in Unicode, there is a missing small non-spacing Waw in the word /li yasuu'uw/ (Q17:7). This Waw is similar to U+06E8 superscript noon and occur once in the Mushaf.

Small Seen

YabsuTu.png

The corrective small seen in the word /yabsuTu/ (Q2:245), this is different from superscript cantillation mark (السكتة), U+06DC.

Tanween (تنوين)

There are tow forms of Tanween in Quran; Tanween with Idhhaar and Tanween with Ikhfaa':

Idhhar (إظهار)

We don't propose a new code point for Idhar, as it is the regular for of Tanween. However, we propose declaring the sequence <mark> canonical equivalent to <double mark>, where <mark> is Fatha, Damma and Kasra and <double mark> is Fathatan, Dammatan and Kasratan respectively.

so:

<fatha><fatha> = <fathatan>
<damma><damma> = <dammatan>
<kasra><kasra> = <kasratan>

This is necessary for consistent representation of all variants of Tanween in Quran, see bellow.

Ikhfaa' (إخفاء)

Ikhfaa', as regard of Tanween, is represented by sequential Tanween. We propose a new control character, that, when follow Tanween, will trigger the requested variant of Tanween.

So:

<damma><damma><ikhfaa'>=<sequential dammatan>

Iqlaab (إقلاب)

Iqlaab is represented by small meem either above Noon, or replacing the second mark in Tanween We propose adding a control character that is represented as a small diacritical Meem when alone and when follow Tanween, replaces the lat mark with a small meem.

So:

<damma><damma><iqlaab>=<damma><small meem>