«Arabic OCR»: الفرق بين المراجعتين

من ويكي عربآيز
اذهب إلى: تصفح، ابحث
(Software)
(Software)
سطر 26: سطر 26:
   
 
'''FOSS''' "no Arabic support"
 
'''FOSS''' "no Arabic support"
*[[Tesseract (software)|Tesseract]] is an open source OCR, initially developed by [[HP]], and released under the [[Apache License]], Version 2.0. It can be compiled using MSVC 6.0 or GCC.
+
*[http://sourceforge.net/projects/tesseract-ocr Tesseract] is an open source OCR, initially developed by HP, and released under the Apache License.
*[http://oocr.sourceforge.net OOCR] OOCR is an OCR program still in development, under the [[GPL]].
+
*[http://oocr.sourceforge.net OOCR] OOCR is an OCR program still in development, under the GPL.
*[[GOCR]] - included in [[Debian]] and other distributions.
+
*[http://jocr.sourceforge.net/ GOCR] - included in Debian and other distributions.
 
*[http://www.gnu.org/software/ocrad/ocrad.html GNU Ocrad] "is an OCR [...] program based on a feature extraction method".
 
*[http://www.gnu.org/software/ocrad/ocrad.html GNU Ocrad] "is an OCR [...] program based on a feature extraction method".
   

نسخة 16:00، 20 يناير 2007

Optical Character Recognition

OCR is the ability to scan a document (or grab a PDF file) and run an OCR program on it and it will generate, based on optical recognition and approximation, an editable text file. For an idea about OCR see http://www.students.cs.uu.nl/people/mjkammer/Work/intro_2_OCR.html

Current Status of Arabic OCR software

I (MuhammadAlkarouri) know of no actually working Arabic OCR software that is open source. Any additions are certainly welcome.

Resources

Arabic OCR Links

Papers

Software

  • Readiris - Supports Arabic and Persian
  • NovoDynamics VERUS - Focuses on high-performance OCR and image enhancement for Arabic-based scripts, including Arabic, Persian, Pashto, Urdu.

FOSS "no Arabic support"

  • Tesseract is an open source OCR, initially developed by HP, and released under the Apache License.
  • OOCR OOCR is an OCR program still in development, under the GPL.
  • GOCR - included in Debian and other distributions.
  • GNU Ocrad "is an OCR [...] program based on a feature extraction method".

Other Links