Quran/Unicode

من ويكي عربآيز
نسخة 07:35، 15 ديسمبر 2007 للمستخدم Hosny (ناقش | مساهمات) (Small letters)
اذهب إلى: تصفح، ابحث

Currently, it is almost impossible to encode Quranic text correctly using unicode, which ended up with every one using his own, non-standard encoding to encode Quran. Here we summarize missed Unicode code points that are needed for Quran, with possible solutions.

Chairless Hamza

A'aadam.png

In Quranic Rasm, chairless Hamza is a non-disjoining character. This means, when it comes in between two joinable characters, it doesn't separate them. An example for the behavior of Quranic Hamza, is the word /a'aadam/ in Q2:31, 33, 34.

Small letters

Small Waw

Li yasuu'uw.png

Although, there is small spacing Waw in Unicode, there is a missing small non-spacing Waw in the word /li yasuu'uw/ (Q17:7). This Waw is similar to U+06E8 superscript noon and occur once in the Mushaf.

Small Seen

YabsuTu.png

The corrective small seen in the word /yabsuTu/ (Q2:245), this is different from superscript cantillation mark.

Tanween (تنوين)

There are tow forms of Tanween in Quran; Tanween with Idhhaar and Tanween with Ikhfaa':

Idhhar (إظهار)

This is what is currently encoded as U+064B (Fathatan), U+064C (Dammatan) and U+064D (Kasratan) code points. A more cleaner way to encode them is to encode it as two successive diacritics, so <damma><damma>=<dammatan> and so. This doesn't require a new code point, but just mentioning in the standard that such successive glyphs are allowed.

Ikhfaa' (إخفاء)

We propose a new code point that will function as a control character, that will trigger the variant Tanween, so, <damma><damma><ikhfaa'>=<sequential dammatan>. The sequential variants would have separate code points in the presentation forms block, so that systems with legacy font handling can easily handle that case.


Iqlaab (إقلاب)

Iqlaab is represented by small meem replacing the 2nd mark in Tanween, we propose adding a control character that when follow Tanween trigger such behavior. So, <damma><damma><iqlaab>=<damma>.