Name Romanizer

Synopsis

MAPSOno is the family name of our anthroponomy processing package; a set of specialized modules tuned for applications such as information retrieval, document clustering, named entity extraction (NER), and translation. Output samples are available for download as PDF for your convenience, you are also invited to send us your sample input for processing free of charge, we accept limited size samples in any form or encoding.



Download  Android Romanizer App Android App available

Download  Apple Romanizer App iOS App coming soon

Access Romanizer web application Web application available

MAPSOno SoftwareDatabasesMAPSTopo Software
Personal Names Transcription Arabic Given Names Romanization
Personal Names Romanization Arabic Surnames Transcription
Personal Names Arabicization Arab Full Names Arabicization
Personal Names Retrieval Names of Arabic Origins Retrieval
Arabic Names Classification Non-Arab Names  
Roman Names Arabicizer Unique and Indigenous Names  
  Famous Names and Celebrities  
  Counterintuitively Pronounced Names  

Romanization is a way of reproducing the sound of the words according to the orthography rules of the target language "mainly Latin script based language"; uniform results in the Romanization of non-Latin based script languages are difficult to obtain, since vowel points and diacritical marks -if any- are generally omitted from both manual and machine writing. It follows that for correct identification of the words which appear in any particular name, knowledge of its standard Arabic-script spelling including proper pointing, and recognition of dialectal and idiosyncratic deviations are essential.

In multilingual applications, the problem of corss-lingual name matching is complicated by the fact that when a name is represented in a script different from its native script there may be several alternative representations for each phoneme, leading to large number of potential variants, this is true for Romanized forms of names in many languages such as Arabic. Romanizer does an orthographic transcription by default, a process that can distinguish between multicultural names and apply a sophisticated algorithm to provide the most likely variations. It is neither a simple name matching algorithm nor a name combination generator engine; it rather goes all the way back creating a new legitimate version of the input name token to decide which legitimate variants to return, thus avoiding the elaborate listing of trivial and never-exist variants mostly output by simple combination processes. It also tracks cultural variations through look-up database of refined set of etymologic database to follow exactly the way used by specific culture to Romanize the name.

For Semetic languages such as Arabic which is normally unvocalized, the unvocalized Arabic name does not give sufficient information for accurate romanization, Arabic personal names come without any diacritical marks so a fully accurate vocalized version of the name is necessary for romanization purposes, the system serves as a major pre-processor and provide three optional vocalization modes to chose from as adding vowels to Arabic personal names is critical for any Romanization or transcription process, this module accomplishes the task of disambiguating Arabic personal names; it can be viewed as a front-end processor for the transcription system, this module is name specific, unpredictable output will result if used to vocalize ordinary text

Arabic personal name has no more than one way to spell and this is true for most of the names exist in the language vocabulary; variants occur only when romanizing the Arabic name to other languages that have no one-to-one phonetic relationship with Arabic e.g. all romance languages. As a result of this each Arabic name has many romanized variants in other languages too.

A major problem challenging Arabic text processing systems is the absence of diacritics and vowels (Fat'ħa, Damma, Kasra, Shadda, Sukuun) in almost all the Arabic writings. The information contained in unvocalized orthography of the Arabic names is not sufficient to most applications supposed to accomplish automatic transliteration of Arabic proper nouns to English For instance the names "أحْمَد" /ʔħ'mad/ and "مُحَمَّد" /muħam'mad/ " have the following different transcription pairs respectively :

"Ahmet" and "Muhemet" in Turkey,
"Ahmed" and "Mohamed" in North Africa e.g. Morocco, Algeria, Tunisia, Mauritania, and Egypt
"Ahmad" and "Muhamad" in Gulf region e.g. KSA, Iraq, Oman, etc.

The pairs shown above do not actually tell the whole story since the list of variants of such very common names will be really long without enforcing some sort of rules to limit the output within acceptable phonemic forms. The system is designed to deal with such complexities and fully return all romanized versions of any personal name with such minimal nuances been distinguished and clearly manifested


MAPSOno Romanizer functional diagram

MAPS Name Romanizer
A screenshot of MAPSOno Romanizer interface, you can view the technical specifications. You may also DOWNLOAD Evaluation copy.

  The sample given below contains multilingual text. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the correct intended characters, you may either get the suitable fonts from our lingual support page or download the TXT version for which no extra arrangement is required.

Vocalized: vocalized version of the nameRoman: Romanized entry
Standard: Romanization standardLanguage: Entry language
Variant: Variant romanizationCommon: Popular Romanization

ID Entry Vocalized Roman Standard Language Variant Common
1 أبوبكر أَبُوبَكْر Abū Bakr ALA-LC Arabic Abubakar Abu Bakr
2 حزام حَزَام Ḥazām ALA-LC Arabic Hazam
3 حسام حُسَام Ḥusām ALA-LC Arabic Hosam Husam
4 حسان حَسَّان Ḥassān ALA-LC Arabic Hasan Hassan
5 خولي خُولِي Khūlī ALA-LC Arabic Khouly Khuli
6 خنساء خَنْسَاء Khansā ALA-LC Arabic Khansa
7 خضاب خِضَاب Khiḍāb ALA-LC Arabic Khidab
8 جداول جَدَاوِل Jadāwil ALA-LC Arabic Jadawil
9 جيداء جِيدَاء Jīdā' ALA-LC Arabic Jidaa
10 جدوى جَدْوَى Jadwā ALA-LC Arabic Jedoi Jadwa
11 건우 Geonu Revised Romanization Korean
12 경수 Gyeongsu Revised Romanization Korean
13 경숙 Gyeongsuk Revised Romanization Korean
14 경옥 Gyeongok Revised Romanization Korean
15 경자 Gyeongja Revised Romanization Korean
16 경희 Gyeonghui Revised Romanization Korean
17 광수 Gwangsu Revised Romanization Korean
18 다은 Daeun Revised Romanization Korean
19 도윤 Doyun Revised Romanization Korean
20 도현 Dohyeon Revised Romanization Korean
21 张 伟 Chang Wei Wade-Giles Chinese
22 王 伟 Wang Wei Wade-Giles Chinese
23 王 芳 Wang Fang Wade-Giles Chinese
24 李 伟 Li Wei Wade-Giles Chinese
25 王 秀 英 Wang Hsiu Ying Wade-Giles Chinese
26 李 秀 英 Li Hsiu Ying Wade-Giles Chinese
27 李 娜 Li Na Wade-Giles Chinese
28 张 秀 英 Chang Hsiu Ying Wade-Giles Chinese
29 刘 伟 Liu Wei Wade-Giles Chinese
30 张 敏 Chang Min Wade-Giles Chinese
31 ሀብተ ሕይወት Habte Hiwet Non-Standard Amharic
32 ሀመልማል Hamelmal Non-Standard Amharic
33 ኃይለ ሃይማኖት Hayle Haymanot Non-Standard Amharic
34 ሄኖክ Henok Non-Standard Amharic
35 ኃይለ ሥላሴ Hayle Silasse Non-Standard Amharic
36 ላካቸው Lakachew Non-Standard Amharic
37 ልሳነወርቅ Lissanewerk Non-Standard Amharic
38 መስዋዕት Meswait Non-Standard Amharic
39 ተስፋው Tesfaw Non-Standard Amharic
40 ዐማኑኤል Amanuel Non-Standard Amharic
41 Сабина Sabina Non-Standard Russian
42 Серафим Serafim Non-Standard Russian
43 Ольга Olga Non-Standard Russian
44 Михаил Mikhail Non-Standard Russian
45 Мирослав Miroslav Non-Standard Russian
46 Наташа Natasha Non-Standard Russian
47 Феодора Feodora Non-Standard Russian
48 Борис Boris Non-Standard Russian
49 Иван Ivan Non-Standard Russian
50 Катя Katia Non-Standard Russian
51 कुशिक Kushik Non-Standard Hindi
52 गिरीश Gireesh Non-Standard Hindi
53 शिखा Shikha Non-Standard Hindi
54 गीता Geeta Non-Standard Hindi
55 अखिल Akhil Non-Standard Hindi
56 शिवानी Shivanee Non-Standard Hindi
57 रोहिनी Rohinee Non-Standard Hindi
58 नीलिमा Neelima Non-Standard Hindi
59 रीना Reena Non-Standard Hindi
60 रत्ना Ratna Non-Standard Hindi

Home » MAPS  » MAPS Onomastics » Name Romanizer

Category Software | Reference MAPSONOROM | Family MAPSONO | Last updated 26/4/2020