arabic, anism1a.gif (1984 bytes)arabic software solutions, anism1b1.gif (3409 bytes)arabic, anism1a2.gif (1972 bytes)

Arabic Software  Desktop Publishing  Translation   OCR  ASR  TTS  MultimediA


ASR | Categorization | Correction | Diacritization | DMS/ArabDox | Ibsar Reading Machine | IDRISI Search
Johaina | Keyword Extraction | KMS | MMMP | MT | NLP | OCR |
Services  | SET | Speech | Summarization | TTS

NLP.gif (555 bytes)


Natural Language Processing
Sakhr realized at a very early stage that the current rush towards globalization means - from an information perspective -  breaking the language barriers, i.e. the necessity to work multilingually with a high degree of linguistic transparency. 

 


In 1985 Sakhr Software Company launched a multi-phase, large-scale research project for automatic processing of written Arabic. The Company mobilized massive human, material and financial resources to achieve this ambitious goal, which took more than ten years to be accomplished.

Arabic Core Linguistic Engines
The Core Linguistic Engines address the following four language levels:
  1. Character Level:
    Arabic Optical Character Recognition (A-OCR):
    Since 1993, Sakhr has been developing the technology of automatic recognition of printed Arabic text known as OCR. Although Sakhr has been targeting the Arab market, Sakhr realized from the very beginning the importance of developing a bilingual OCR for Arabic/English text.
    The program is able to learn characters shapes, which can be used to improve the recognition accuracy to its maximum. No other Arabic OCR system has this unique feature of combining both recognition technologies. After establishing its leadership in the field of Arabic/English printed text recognition, Sakhr internationalized its OCR system by supporting more languages. Sakhr has released a Persian version of its product for an Iranian distributor. Special versions for Arabic script languages, such as Urdu and Jawi can be developed based on the same technology. In addition to the support of Arabic-like languages, Sakhr now supports all the 16 European languages.
  1. Word Level:
    Multi-Mode Morphological Processor (MMMP):
    Sakhr's MMMP is a morphological analyzer-synthesizer of Arabic. The analyzer identifies
    all possible stem forms of a word, i.e. extracting its basic form stripped from affixes.
    Unlike the English Stemmer, the MMMP analyzer does not stop at the stem level but proceeds to extract the root and the Morphological Pattern (MP) of the word.
    Decomposing Arabic words into their morphological primitives is a basic requirement for
    full text indexing, search, dictionary organization and look up, as well as for spelling and grammatical checking. Even more important, the MMMP enables deeper processing of Arabic at the syntax and semantic levels. The MMMP synthesizer works in a reverse mode to generate linguistically-correct final word forms. The synthesizer is a key tool for generating the required output in machine translation systems and other text generation applications, such as summarizers and style checkers.
  1. Sentence Level:
    a) Multi-Mode Syntactical Processor (MMSP):
    The Sakhr MMSP parses the Arabic sentence into its syntactical constituents: (verb, subject, object, adverbs, predicate, etc...). MMSP is driven by a formal grammar of Arabic with extensive linguistic and lexical coverage, it integrates a set of advanced deterministic and preferential parsing techniques. Its major power lies in its ability to resolve the inter-mixed ambiguities involved within non-vowelized Arabic text. This ability is a major factor for successful in-depth processing of text for translation, summarization, automatic understanding, and content analysis.

    b) Arabic Automatic Diacritizer (AAD)
    The AAD is a technology breakthrough, which solved the basic problem of handling the unvowelized Arabic text automatically. It is an intelligent processor based on the MMMS. It simulates the mental process exercised by Arabic native speakers in interpreting undiacritized text and substituting missing vowels. The Automatic Diacritizer provides different options for diacritization: full, mandatory, or case ending diacritics. The AAD is the entry point for rendering written Arabic text suitable for serious computation.

Continuous Text Level:
a) Arabic Text Fragmenter (ATF)

Sakhr's ATF automatically divides the continuous text into separate sentences. It is a basic front-end processor, which prepares narratives for sentence-based processors such as parsers and for machine translation.

b) Arabic Automatic Indexer (AAI)
Sakhr's AAI automatically examines the content of a document to identify key words and phrases. For the first time, the Arabic automatic indexer enables the creation of book indices with an ease never done before seen for Arabic books. AAI has different levels of indexing and has an HTML version for the Internet.
 

Please contact George N. Hallak
for more info about Sakhr Enterprise Solutions pricing models.

ASR | Categorization | Correction | Diacritization | DMS/ArabDox | Ibsar Reading Machine | IDRISI Search
Johaina | Keyword Extraction | KMS | MMMP | MT | NLP | OCR |
Services  | SET | Speech | Summarization | TTS



Home Page || AramediA Contact Info || Adobe Middle East (ME) ||
 
Arabic Fonts || Arabic Language Tutors || All Languages Tutors ||
Arabic Calligraphy || Children Software || Arabic NewsStand ||
Arabic Resources || American Sign Language (ASL) ||
Educational PC & Mac ||
Desktop Publishing DTP - PC & Mac ||
Dictionaries ||
Directions ||
Enterprise Solutions || Islamic Software ||
  Islamics' Software || Microsoft Arabic Software || Comic Books-DVD ||
Keyboard Stickers || Multilingual Keyboards || New Products ||
OCR || Machine Translation || Search Engines || Shopping Cart || Software Solutions || Universal Word || World Resources || Word Processors || World Languages || The AramediA Sales Policy ||
Software Search || aramediaStore.com ||
Amazon.com ||

AramediA

Arabic Machine Translation
Enterprise and Standalone

Join Our Newsletter

37 Adams Street, Braintree, MA 02184-1906
United States of America (USA)
Tel 1-781-849-0021 Fax 1-781-849-2922

 

animail2.gif (5769 bytes)

We Ship All Around the Globe

Copyright 1995 - 2014 - GnhBos Incorporated dba AramediA. All rights reserved.

 



 

 

 

 

 

 

 

 

 

 

 

 

 

 

Academic Success Diacritization Keyboard Stickers/Labels QuarkXPress XTensions
Adobe Middle Eastern (ME) Desktop Publishing (DTP) Keyword Extraction Software Services
Amazon.com Shopping Document Management System Knowledge Management System Software Utilities
American Sign Language (ASL) Enterprise Solutions Localization Services Speech, Integrated Technology
Arabic Fonts - Calligraphy Electronic Dictionaries Machine Translation (MT) Speech Recognition (ASR)
Arabic Language Tutors Ibsar Reading Machine MLS Easy Immersion Lab Summarization
Arabic Microsoft Vista IDRISI Search Morphological Analyzer (MMMP)

Text-to-Speech (TTS)

Automatic Speech Recognition International Keyboards Multilingual Dictionaries

Translation Services

Categorization Islamic Software (Harf) Multilingual Language Tutors

Transparent Language

Children's Programs Islamics' Software and More Multilingual OCR Utilities Mac / Windows
Correction Spell Check Johaina, Arabic News in English Natural Language Processing

Word Processors



english-arabic,arabic terminology,synonym,arabic terminologies,word-hoard,
english-arabic translator, english-arabic translator,translation services,acronyms,phrase, spell,Ajeeb,arabic translators,translation,word,french-arabic,phraseology,synonym, islam,spell,translation services,thesaurus, language,arabic terminology,dictionary, meaning,arabic dictionary,lexicon,arabic translator,domain, language,arabic translator,synonym,arabic dictionary,Arabic lexicography,vocabulary,lexicon,

Dictionary software, multimedia, Bidirectional, English Arabic English, English Dictionaries, Arabic Dictionary, covers many languages dictionary and applications, please call us for more information at 1-781-849-0021:

Word processing for the following languages is also available
 www.aramedia.com/uniword.htm: European, Arabic, Hebrew, Cyrillic, Asian and Indian Languages, Albanian, Arabic (includes spell checker), Aramaic, Armenian, Azeri-Arabic, Azeri-Cyrillic, Azeri-Turkish, Bengali, Bohemian(Czech), Bulgarian, Burmese, Byelorussian, Croatian, Danish, Dutch, English, Esperanto, Farsi, French, Finnish, Georgian, German, Greek/Modern, Greek/Classical, Gujarati, Hebrew, Hindi, Hungarian, Icelandic, International Phonetic Alphabet, Inuktitut, Italian, Kannada, Khmer, Ladino, Lao, Latin, Latvian, Lihyanite, Lithuanian, Macedonian, Malayalam, Malay-Jawi, Marathi, Moabite, Mongolian, Nabataean, Nepali, Norwegian, Oriya, Oromo, Pashto, Polish, Portuguese, Punjabi, Rumanian, Russian, Safitic,Sami, Sanskrit, Serbian, Sinhalese, Slovak, Slovenian, South Arabian, Spanish, Swedish, Swiss, Syriac-Eastern, Syriac-Estrangelo, Tagalog, Tamil, Telugu, Thai, Talmudic, Tibetan, Tigrinya, Tigre, Transliteration, Turkish, UK-English, Ugaritic, Ukrainian, Urdu, Vietnamese, Welsh, Wendish Lusatian, sorbian , Yiddish.

 

 

Learn, read, and write these languages, www.aramedia.com/uniform2000.htm:

 

 

 

 

 

 

 

 

 

Sakhr Islamic Software, Sakhr Arabic software, Learn Arabic, Arabic for beginners
Arabic language, software localization, software localization, translation, Arabic
translation, multimedia, educational programs, Arabic, Islam, Moslem, Islamic,
Hebrew, Farsi, Persian, Persia, Iran, Iranian