Name Romanizer
Romanization is a way of reproducing the sound of the words according to the orthography rules of the target language "mainly Latin script based language"; uniform results in the Romanization of non-Latin based script languages are difficult to obtain, since vowel points and diacritical marks -if any- are generally omitted from both manual and machine writing. It follows that for correct identification of the words which appear in any particular name, knowledge of its standard Arabic-script spelling including proper pointing, and recognition of dialectal and idiosyncratic deviations are essential. In multilingual applications, the problem of corss-lingual name matching is complicated by the fact that when a name is represented in a script different from its native script there may be several alternative representations for each phoneme, leading to large number of potential variants, this is true for Romanized forms of names in many languages such as Arabic. Romanizer does an orthographic transcription by default, a process that can distinguish between multicultural names and apply a sophisticated algorithm to provide the most likely variations. It is neither a simple name matching algorithm nor a name combination generator engine; it rather goes all the way back creating a new legitimate version of the input name token to decide which legitimate variants to return, thus avoiding the elaborate listing of trivial and never-exist variants mostly output by simple combination processes. It also tracks cultural variations through look-up database of refined set of etymologic database to follow exactly the way used by specific culture to Romanize the name. For Semetic languages such as Arabic which is normally unvocalized, the unvocalized Arabic name does not give sufficient information for accurate romanization, Arabic personal names come without any diacritical marks so a fully accurate vocalized version of the name is necessary for romanization purposes, the system serves as a major pre-processor and provide three optional vocalization modes to choose from as adding vowels to Arabic personal names is critical for any Romanization or transcription process, this module accomplishes the task of disambiguating Arabic personal names; it can be viewed as a front-end processor for the transcription system, this module is name specific, unpredictable output will result if used to vocalize ordinary text Arabic personal name has no more than one way to spell and this is true for most of the names exist in the language vocabulary; variants occur only when romanizing the Arabic name to other languages that have no one-to-one phonetic relationship with Arabic e.g. all romance languages. As a result of this each Arabic name has many romanized variants in other languages too. A major problem challenging Arabic text processing systems is the absence of diacritics and vowels (Fat'ħa, Damma, Kasra, Shadda, Sukuun) in almost all the Arabic writings. The information contained in unvocalized orthography of the Arabic names is not sufficient to most applications supposed to accomplish automatic transliteration of Arabic proper nouns to English For instance the names "أحْمَد" /ʔħ'mad/ and "مُحَمَّد" /muħam'mad/ " have the following different transcription pairs respectively :
"Ahmet" and "Muhemet" in Turkey, The pairs shown above do not actually tell the whole story since the list of variants of such very common names will be really long without enforcing some sort of rules to limit the output within acceptable phonemic forms. The system is designed to deal with such complexities and fully return all romanized versions of any personal name with such minimal nuances been distinguished and clearly manifested |
![]()
A screenshot of MAPSOno Romanizer interface, you can view the technical specifications. You may also DOWNLOAD Evaluation copy. |
![]() |
Vocalized: vocalized version of the name | Roman: Romanized entry |
Standard: Romanization standard | Language: Entry language |
Variant: Variant romanization | Common: Popular Romanization |
ID | Entry | Vocalized | Roman | Standard | Language | Variant | Common |
---|---|---|---|---|---|---|---|
1 | أبوبكر | أَبُوبَكْر | Abū Bakr | ALA-LC | Arabic | Abubakar | Abu Bakr |
2 | حزام | حَزَام | Ḥazām | ALA-LC | Arabic | Hazam | |
3 | حسام | حُسَام | Ḥusām | ALA-LC | Arabic | Hosam | Husam |
4 | حسان | حَسَّان | Ḥassān | ALA-LC | Arabic | Hasan | Hassan |
5 | خولي | خُولِي | Khūlī | ALA-LC | Arabic | Khouly | Khuli |
6 | خنساء | خَنْسَاء | Khansā | ALA-LC | Arabic | Khansa | |
7 | خضاب | خِضَاب | Khiḍāb | ALA-LC | Arabic | Khidab | |
8 | جداول | جَدَاوِل | Jadāwil | ALA-LC | Arabic | Jadawil | |
9 | جيداء | جِيدَاء | Jīdā' | ALA-LC | Arabic | Jidaa | |
10 | جدوى | جَدْوَى | Jadwā | ALA-LC | Arabic | Jedoi | Jadwa |
11 | 건우 | Geonu | Revised Romanization | Korean | |||
12 | 경수 | Gyeongsu | Revised Romanization | Korean | |||
13 | 경숙 | Gyeongsuk | Revised Romanization | Korean | |||
14 | 경옥 | Gyeongok | Revised Romanization | Korean | |||
15 | 경자 | Gyeongja | Revised Romanization | Korean | |||
16 | 경희 | Gyeonghui | Revised Romanization | Korean | |||
17 | 광수 | Gwangsu | Revised Romanization | Korean | |||
18 | 다은 | Daeun | Revised Romanization | Korean | |||
19 | 도윤 | Doyun | Revised Romanization | Korean | |||
20 | 도현 | Dohyeon | Revised Romanization | Korean | |||
21 | 张 伟 | Chang Wei | Wade-Giles | Chinese | |||
22 | 王 伟 | Wang Wei | Wade-Giles | Chinese | |||
23 | 王 芳 | Wang Fang | Wade-Giles | Chinese | |||
24 | 李 伟 | Li Wei | Wade-Giles | Chinese | |||
25 | 王 秀 英 | Wang Hsiu Ying | Wade-Giles | Chinese | |||
26 | 李 秀 英 | Li Hsiu Ying | Wade-Giles | Chinese | |||
27 | 李 娜 | Li Na | Wade-Giles | Chinese | |||
28 | 张 秀 英 | Chang Hsiu Ying | Wade-Giles | Chinese | |||
29 | 刘 伟 | Liu Wei | Wade-Giles | Chinese | |||
30 | 张 敏 | Chang Min | Wade-Giles | Chinese | |||
31 | ሀብተ ሕይወት | Habte Hiwet | Non-Standard | Amharic | |||
32 | ሀመልማል | Hamelmal | Non-Standard | Amharic | |||
33 | ኃይለ ሃይማኖት | Hayle Haymanot | Non-Standard | Amharic | |||
34 | ሄኖክ | Henok | Non-Standard | Amharic | |||
35 | ኃይለ ሥላሴ | Hayle Silasse | Non-Standard | Amharic | |||
36 | ላካቸው | Lakachew | Non-Standard | Amharic | |||
37 | ልሳነወርቅ | Lissanewerk | Non-Standard | Amharic | |||
38 | መስዋዕት | Meswait | Non-Standard | Amharic | |||
39 | ተስፋው | Tesfaw | Non-Standard | Amharic | |||
40 | ዐማኑኤል | Amanuel | Non-Standard | Amharic | |||
41 | Сабина | Sabina | Non-Standard | Russian | |||
42 | Серафим | Serafim | Non-Standard | Russian | |||
43 | Ольга | Olga | Non-Standard | Russian | |||
44 | Михаил | Mikhail | Non-Standard | Russian | |||
45 | Мирослав | Miroslav | Non-Standard | Russian | |||
46 | Наташа | Natasha | Non-Standard | Russian | |||
47 | Феодора | Feodora | Non-Standard | Russian | |||
48 | Борис | Boris | Non-Standard | Russian | |||
49 | Иван | Ivan | Non-Standard | Russian | |||
50 | Катя | Katia | Non-Standard | Russian | |||
51 | कुशिक | Kushik | Non-Standard | Hindi | |||
52 | गिरीश | Gireesh | Non-Standard | Hindi | |||
53 | शिखा | Shikha | Non-Standard | Hindi | |||
54 | गीता | Geeta | Non-Standard | Hindi | |||
55 | अखिल | Akhil | Non-Standard | Hindi | |||
56 | शिवानी | Shivanee | Non-Standard | Hindi | |||
57 | रोहिनी | Rohinee | Non-Standard | Hindi | |||
58 | नीलिमा | Neelima | Non-Standard | Hindi | |||
59 | रीना | Reena | Non-Standard | Hindi | |||
60 | रत्ना | Ratna | Non-Standard | Hindi |
Home » MAPS » MAPS Onomastics » Name Romanizer