arabic, anism1a2.gif (1972 bytes) arabic software solutions, anism1b1.gif (3409 bytes) arabic, anism1a2.gif (1972 bytes)

Arabic & Multilingual International Software  Desktop Publishing  Machine Translation  Document Management  NLP   OCR  ASR  TTS  MultimediA


Apptek MediaSphere,capture, video, metadata, translate, translation, translator,chat Arabic, Chinese, French, German, Italian, Japanese, Korean, Persian, Portuguese, Spanish, Turkish, Pashto, Russian

MediaSphere™

MediaSphere is a software solution providing multilingual transcripts of various TV and Radio stations for many domestic and international news bureaus as well as transcripts for conversational telephony.

MediaSphere offers a multilingual solution for video, audio and telephony for text processing with advanced linguistic capabilities such as Machine Translation of transcribed text, Information retrieval with query translation and automated Name Finding and Detection.

MediaSphere is a state of the art solution to facilitate the process of generating transcription from TV transmissions and telephony, translating transcribed text into and from English and deliver rich media content online.

The multilingual MediaSphere solution utilizes video and audio logging technologies, telephony platforms as well as Automatic Speech Recognition integrated with Machine Translation, Information Retrieval and Name Finding.


The technical solution automatically captures and indexes television, video and audio content in near real-time making it fully searchable and accessible. Once audio is captured and indexed, Knowledge Management tools will facilitate to the user intelligent content search capabilities.

Audio-Media Mining

MediaSphere Software Components
1. Automatic Speech Recognition (ASR) and Audio Mining.

AramediA will provide ASR for Modern Standard Arabic (MSA) that is speaker independent The system supports MSA, nevertheless, AppTek ASR is designed in order to collect, transcribe, and train using selected dialects. The system can support multiple local dialects and hence improve the accuracy performance.

Spoken Language: the Natural Dialect

MSA is mostly broadcast; is contrived and rarely used Spontaneous speech is almost never MSA PlainSpeech has dialect adapted modules

2. Knowledge Management System.

In the area of Knowledge Management, AppTek offers several software modules under the collective name PlainKnowledge, which have already been integrated into third-party products or have also been implemented as SDK for end customers. The individual modules are PlainCluster (grouping), PlainClassify (classification), PlainSummarize (summarization), PlainExtract (content-specific extraction), PlainLingua (language recognition of a text) and PlainRetrieve (associative search).

3. Machine Translation (MT) Technology.

AppTek provides Machine Translation (MT) application for more than twenty (20) language pairs. The system runs on a standalone or as a web-based dynamic translation of online content with full integration with Automatic Speech Recognition and Text-to-Speech software, enabling future development of speech-to-speech Machine Translation. The suite of linguistic tools includes proper-noun recognition, morphological analysis, and large annotated lexicons.

4. Digital Assets Management (DAM)

The digital asset management system is comprised of multiple agents that provide access to various resources. These resources are collected, stored and retrieved via the Digital Asset Management Network. Network refers to the interconnection and collaboration of the various agents and not a computer-based network. The entire network can be contained within a single computer, but this may not be practical given the resources required to perform real-time ingest and analysis. In some cases, agents may exist on the same computer to provide less-latent access to published data. AppTek’s human language technology and DAM has been integrated in the UIMA environment, an open architecture for the management of unstructured information, such as natural language text (e.g. news articles), audio, and video. This management includes analysis, search, storage, information retrieval and extraction, data mining, etc. A very simplified description of the responsibility of UIMA is that UIMA takes unstructured information, analyzes it and produces structured information such as databases or indexes.

6. NameFinder.

NameFinder™ is an advanced technology engine that is used to scan text for proper nouns (such as human names, organizations, geographic locations, currencies, parts etc.) in various languages--even in writing systems that do not use capitalization. NameFinder™ can also accurately identify the ethno-linguistic origins of a person's name. The system recognizes and transliterates between the Roman (Latin) alphabet and other writing systems, and can identify transliterated names despite discrepancies, ambiguities, or simple misspellings.

MediaSphere Architecture

The following diagram illustrates the hardware architecture:

Functionality

Television signal reception and distribution and Telephony Audio Monitoring capability

The solution provide the capability for 24x7 monitoring a foreign language or English Channel or telephony platform The monitoring capability will include:

a. 24x7 capture of all channel(s).
b. Near-real-time transcription with adjustable latency.
c. Near-real-time machine translation from supported languages and to English of transcribed text for the channels.
d. Speaker Identification: recognizes voices that match a set of trained speaker models, labeling speaker turns with names. Unspecified speakers are classified according to gender, and clustered together if there are multiple appearances.
e. An option Use of a standard web browser: MediaSphere News Monitoring users interact with the application using a standard browser-based interface. The interface is a complete, easy-to-use system.
f. Automatic highlighting of text to mark entities including people, places, organizations, and dates and times. The entity detection and highlighting takes place in two stages:

i. Near-real-time (within limited latency): The optional Name extraction module identifies names in the speech recognition output, and categorizes them into people, locations and organizations.

ii. Advanced: Post-processing of transcribed text.

Review streaming video with synchronized transcriptions and translations

a. Review Streaming Video: The system automatically builds a rich, structured index in real time that can be used to search video content and immediately locate a specific video segment for playback. The video index is time synchronized to every encoded copy, providing immediate, pinpoint retrieval of content. Words and phrases are automatically highlighted within a transcript, allowing you to monitor the Arabic (source) speech-to-text recognition and English translation synchronized with the video.

b. Review Video Segments: The Video Digitizing process transforms standard analog or digital video into web-friendly and broadcast-quality content. Video Digitizing manages multiple Digitizing processes while simultaneously indexing video. VideoCapture automatically detects visual scene changes, spoken words, and recognized voices. The resulting rich video index provides both editors and viewers with fine-grained control over content. The video index allows users to select video segments based on channel, start time, video frame or between live (10 minutes latency) or archived =videos.

c. Review Still Frames: VideoCapture uses signal analysis algorithms to generate key frames, which provide a visual overview of the content.

Search for specific spoken content in the video cache by Arabic- or English-language text query Full-text search in Arabic or English: Using Advanced Search, you can narrow your searches to include a specific broadcaster, program, language, asset type, date or range of dates, Boolean operators or keywords. You can save search queries and use them again and again. The search capability supports all languages managed by the system. The Linguistic-based search integrated with the system (TextFinder™) provides the following advanced capability:

I. A particular feature of this system is its multilingual Morphology Analyzer. The morphology analyzer is capable of analyzing and generating all inflectional paradigms for all classes of verbs, nouns and adjectives or any other part of speech. The module is also capable of reducing character strings to their dictionary look-up form (root or stem, as required). The analyzer processes stems as well as meaningful root derivational morphology.

II. Query Translation: TextFinder™ has bilingual dictionaries that allow users to search in multilingual text simultaneously. The user can type a search word or phrase in one language and retrieve the matching words with successive part of speech in other languages using the morphology analyzer as well as the large-size dynamic bilingual lexicons.

III. Proper Noun Search: A Proper Noun Extraction System is used within TextFinder™. The system can identify all proper nouns in multiple languages such as human names and geographic countries, cities, places, etc., utilizing the large names database developed by AppTek as well as surface parsing and statistics. The search can also be done using exact match and fuzzy match techniques. It allows for Name/Event searches Reference lists and real-time alerts Alerts from keyword-based Filters: MediaSphere monitors news broadcasts and alerts users of new content that is of interest to them. All incoming information is matched against users' saved queries, and alerts are triggered when the specified criteria in the saved queries matches the incoming information above a specified threshold. The use of contextual profiling and personalization ensures that subtle differences in context are taken into account before alerting a user of new information. It is crucial that any system that alerts a user of new content only does so when the content is relevant to the user and the system must allow for an arbitrary level of customization to have taken place. Embedded proper noun identification is open to user-specific keyword list creation, modification or addition.

Provide graphical temporal display of the occurrence of topics or keywords within one or all channels

The Video Digitizing technology includes the following features:
• Manages, monitors, and schedules video acquisition.
• Automatically builds a rich, structured index in real time that can be used to search video content and immediately locate a specific video segment for playback. The video index produced by VideoCapture is time synchronized to every encoded copy, providing immediate, pinpoint retrieval of content.
• Automatic alignment of text (e.g. script) with content
• Video clips can be played based on a time code associated with the selected key frame or text fragment.

Other Functionality

• Administrator Capabilities: The system will offer the following system capabilities:

o Web-based interface
o Index management
o Account management
o View management
o User and group management
o Security management
o Folder management
• Automatic Hyper-linking

MEDIASPHERE News & Telephony Monitoring automatically generates hyperlinks to relevant material in real time. The hyperlinks connect to contextually similar content and can be used to recommend related articles, documents, affinity products or services, as well as media content that relates to textual content. Because hyperlinks are automatically inserted when a document is returned, the hyperlinks can include references to archived material as well as the latest news or material on a particular subject.


Please, contact
George N. Hallak for Multi-License pricing and customized solutions

 

All AppTek Products are on GSA Advantage!®
| ChatSphere
| LocalSphere | MediaSphere | MemorySphere | NameFinder |
|
TranSphere | TranSphere Plug-in for Microsoft OfficeTextFinder | WebTrans |
|
 PlainBabel | PlainKnowledge | PlainKnowledge for Windream | PlainSpeech | PlainTranslate |
 


AramediA

Join Our Newsletter

61 Adams Street, Braintree, MA 02184-1906
United States of America (USA)
Tel 1-781-849-0021 Fax 1-781-849-2922

 

animail2.gif (5769 bytes)

 

We Ship All Around the Globe

Copyright © 1995 - 2008 - GnhBos, Incorporated. dba AramediA. All rights reserved.

 



 

 

 

 

 

 

 

 

 

 

 

 

 

 

Our Dictionaries, machine translation, translation memory,
and lexicons multilingual dictionary software cover a wide spectrum.
Call for more information: 1-781-849-0021

English Arabic, Arabic terminology, synonym, Arabic terminologies, word-hoard,english-arabic translator,
English Arabic translator, translation services, acronyms, phrase, spell, Ajeeb, Arabic translators, translation, word, French Arabic, phraseology, synonym, Islam, spell, translation services, thesaurus, language, Arabic terminology, dictionary, meaning, Arabic dictionary, lexicon, Arabic translator, domain, language, Arabic translator, synonym, Arabic dictionary, Arabic lexicography, vocabulary, lexicon,

Dictionary software, multimedia, Bidirectional, English Arabic English, English Dictionaries, Arabic Dictionary,
covers many languages dictionary and applications, please call us for more information at
1-781-849-0021:
Word processing for the following languages is also available: European, Arabic, Hebrew, Cyrillic, Asian and Indian Languages, Albanian, Arabic (includes spell checker), Aramaic, Armenian, Azeri-Arabic, Azeri-Cyrillic, Azeri-Turkish, Bengali, Bohemian(Czech), Bulgarian, Burmese, Byelorussian, Croatian, Danish, Dutch, English, Esperanto, Farsi, French, Finnish, Georgian, German, Greek/Modern, Greek/Classical, Gujarati, Hebrew, Hindi, Hungarian, Icelandic, International Phonetic Alphabet, Inuktitut, Italian, Kannada, Khmer, Ladino, Lao, Latin, Latvian, Lihyanite, Lithuanian, Macedonian, Malayalam, Malay-Jawi, Marathi, Moabite, Mongolian, Nabataean, Nepali, Norwegian, Oriya, Oromo, Pashto, Polish, Portuguese, Punjabi, Rumanian, Russian, Safitic,Sami,
Sanskrit, Serbian, Sinhalese, Slovak, Slovenian, South Arabian, Spanish, Swedish, Swiss, Syriac-Eastern, Syriac-Estrangelo, Tagalog, Tamil, Telugu, Thai, Talmudic, Tibetan, Tigrinya, Tigre, Transliteration, Turkish, UK-English, Ugaritic, Ukrainian, Urdu, Vietnamese, Welsh, Wendish
Lusatian, sorbian , Yiddish,

Arabic language, software localization, software localisation, translation, Arabic Sakhr Arabic software, Learn Arabic, Arabic for beginners, translation, multimedia, educational programs, arabic terminology,arabic terminologies, english-arabic,french-arabic, spanish-arabic,arabic meanings, acronyms,arabic translation, homonyms,jargon,vocabulary,lexicon,define, Arabic typing tutor, dictionary, scanning,translation software, arabic dictionary,Arabic, Islam, Moslem, Islamic, Hebrew, Farsi, Persian, Persia, Iran, Iranian,Arabic lexicography,lexicon,spell,word, vocabulary, language, domain,arabic dictionaries, arabic glossaries,,arabic translators,arabic translator,english-arabic translator, qamoos, Ajeeb, Arabic language tutor,arabic keyboards,spellchecker,keyboards,persian,farsi, urdu,hebrew, font, transliteration, index,synonym,antonym,thesaurus,meaning,word-hoard, ideom,phrase,phraseology,expression,Arabic,software localization,translation, global, dictionary, technical,machine translation,education,educational multimedia,word, multilingual, word processor, languages, arabic text email, Spanish, German, french, asian, turkish
 

Our Dictionaries, machine translation, translation memory, and lexicons multilingual dictionary software cover a wide spectrum. Call for more information: 1-781-849-0021

english-arabic,arabic terminology,synonym,arabic terminologies,word-hoard,
english-arabic translator, english-arabic translator,translation services,acronyms,phrase, spell,Ajeeb,arabic translators,translation,word,french-arabic,phraseology,synonym, islam,spell,translation services,thesaurus, language,arabic terminology,dictionary, meaning,arabic dictionary,lexicon,arabic translator,domain, language,arabic translator,synonym,arabic dictionary,Arabic lexicography,vocabulary,lexicon,
Boston Limo

Dictionary software, multimedia, Bidirectional, English Arabic English, English Dictionaries, Arabic Dictionary, covers many languages dictionary and applications, please call us for more information at 1-781-849-0021:

Word processing for the following languages is also available: European, Arabic, Hebrew, Cyrillic, Asian and Indian Languages, Albanian, Arabic (includes spell checker), Aramaic, Armenian, Azeri-Arabic, Azeri-Cyrillic, Azeri-Turkish, Bengali, Bohemian(Czech), Bulgarian, Burmese, Byelorussian, Croatian, Danish, Dutch, English, Esperanto, Farsi, French, Finnish, Georgian, German, Greek/Modern, Greek/Classical, Gujarati, Hebrew, Hindi, Hungarian, Icelandic, International Phonetic Alphabet, Inuktitut, Italian, Kannada, Khmer, Ladino, Lao, Latin, Latvian, Lihyanite, Lithuanian, Macedonian, Malayalam, Malay-Jawi, Marathi, Moabite, Mongolian, Nabataean, Nepali, Norwegian, Oriya, Oromo, Pashto, Polish, Portuguese, Punjabi, Rumanian, Russian, Safitic,Sami, Sanskrit, Serbian, Sinhalese, Slovak, Slovenian, South Arabian, Spanish, Swedish, Swiss, Syriac-Eastern, Syriac-Estrangelo, Tagalog, Tamil, Telugu, Thai, Talmudic, Tibetan, Tigrinya, Tigre, Transliteration, Turkish, UK-English, Ugaritic, Ukrainian, Urdu, Vietnamese, Welsh, Wendish Lusatian, sorbian , Yiddish.

Learn, read, and write these languages:

Afrikaans, Albanian, Algeria, Bahrain, Egypt, Iraq, Jordan, Kuwait, Lebanon, Libya, Morocco, Oman, Qatar, Saudi Arabia, Syria, Tunisia, U.A.E., Yemen, Argentina, Armenia, Armenian, Austria, Austrian, Australia, Australian, Azeri, Bahrain, Basque, Barusian, Belgium, Belgian, Bieze, Bengali, Bolivia, Bosnia, Brazil, Bulgarian, Canadian, Caribbean, Catalan, Chile, Chinese, China, Colombia, Columbian, Costa Rica, Costa Rican, Cree, Croatian, Croatia, Czech, Danish, Dari, Dominican Republic, Dominican Republic, Dutch, Ecuador, Egypt, Egyptian, Salvador, El Salvador, English, Bieze, Canadian, Canada, Jamaica, Jamaican, New Zealand, Philippines, Philipino, Pilipino, Filipino, South Africa, South African, Trinidad, United Kingdom, International, Estonian, Estonia, Faeroese, Faeroese, Farsi, Finnish, Finland, Swedish, Sweden, French, Macedonian, Galician, Gaelic, Georgian, Georgia, Germany, German, Greek, Greece, Guatemalan, Guatemala, Gujarati, Gujaratian, Hebrew, Herzegovina, Hindi, Honduran, Honduras, Hungary, Hungarian, Icelandic, Kannada, Konkani, Marathi, Sanskrit, Tamil, Tugu, Indonesian, Indonesia, Iraq, Iraqi, Irish, Ireland, Italian, Italy, Japanese, Japan, Hiragana, Kannada, Konkani, Konkanian, Sanskrit, Korean, Korea, kurdish, Kurd, Kuwait, Lao, Laos, Latin America, Latin American, Latvian, Latvia, Lebanon, Lebanese, Libya, Libyan, Liechtenstein, Lithuanian, Lithuania, Luxembourg, Malay, Brunei, Darussalam, Malaysia, Malayalam, Malayalam, Maltese, Maori, Maorian, Marathi, Marathian, Mexico, Mexican, Monaco, Moroccan, Netherlands, Nicaragua, Nicaraguan, Northern Sotho, Norwegian, Norway, Norwegian, bokma, Nynorsk, Oman, Panama, Panamanian, Paraguay, Pashto, Peru, Peruvian, Polish, Poland, Puerto Rico, Punjabi, Quechua, Bolivia, Romanian, Romania, Russian, Prussian, Russia, Barusian, Sanskrit, Cyrillic, Herzegovina, Slovak, Slovakian, Slovenian, South African, Spanish, Spain, Argentina, Bolivian, Chile, Venezuelan, Swahili, Swahilian, Tugu, Togolese, Thai, Trinidad, Tswana, Tunisia, Turkish, Ukrainian, Ukrainian, United Kingdom, united States, Urdu
Urdu, Uzbek, Uzbekian, Vietnamese, Welsh, Xhosa, Yemen, Zimbabwe. Zulu,