Geographic Names Arabicizer
Synopsis
The Geographic Names Arabicizer is one of the MAPS suite for toponymy, a productivity tool for Arabicizing place names from many different languages in their native script, the system transcripts names into fully diacritized Arabic adding optional Arabic variants with extra useful information like the potential name type.
Preview
A direct Arabicization of geographic name is straightforward with Kalmasoft Arabicizer if the input is written in native script, the name's inherent phonemic characteristics are kept through various stages of Arabicization process. This system is based on thorough listing of language phonetic rules that are used to Arabicize place names from 50 different languages, place names can be input in their original script or as romanized version where improvement to Arabicization process is done through pattern matching and other accuracy measurments used for Arabicization.
Three kinds of Arabicization are available through this system:
- Arabicizing geographic names directly from their native writing systems, e.g. Тришкова (تريشكوفا)
- Arabicizing geographic names that are transcripted using a third language writing system, e.g. Cyrillized Greek, as in [Германиа] for Γερμανία Arabicized to (جيرمانيا)
- Retrieving Arabic geographic names that are transcripted using a dedicated Romanization system like UNGEGN e.g. Dihūk (دِهُوك).
If the geographic names are originally non-Arabic, a dedicated retrieval system Geographic Names Retriever. will be the most suitable for such a case.
The system supports more than 50 languages in input: Oromo, Afar, Afrikaans, Amharic, Armenian, Assamese, Bengali, Bulgarian, Burmese, Catalan, Chinese, Czech, Danish, Dutch, Esperanto, Finnish, French, Gaelic, German, Greek, Gujarati, Hebrew, Hindi, Indonesian, Interlingua, Irish, Italian, Japanese, Korean, Kurdish, Malayalam, Marathi, Pashto, Persian, Portuguese, Romanian, Russian, Spanish, Somali, Swahili, Swedish, Telugu, Thai, Tigrinya, Turkish, Ukrainian, Urdu, Yiddish, Yoruba, Tigre, Filipino, Tamazight, Syriac.
The system also retrieves Arabic place names romanized using the following standards ADEGN, ALA_LC, BGN, Buckwalter, DIN, IGN, ISO, KATS, RJGC, SAS, SATTS, UNGEGN, and IPA.
The system uses a database for geographic names that come in different orthographies and have multiple exonyms e.g. Egypt, Cairo.
Resources
- Geographic Names Romanizer
- Geographic Names Transcriber
- Geographic Names Arabicizer
- Geographic Names Retriever
The sample given below contains multilingual text. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the correct intended characters, you may either get the suitable fonts from our lingual support page or download the TXT version for which no extra arrangement is required, you may also view a larger sample here Arabicized place name database.
Sample output
Entry: Geographic name input
KATS: Kalmasoft Arabic Transliteration System
Arabic: Arabicized output
Standard: Arabicization standard
Language: Entry language writing system
Variants: vocalized Arabic variant spelling
ID | Entry | KATS | Arabic | Standard | Language | Variants |
---|---|---|---|---|---|---|
1 | Στάγειρα | stAgyyrA | ستاغييرا | non-Standard | Greek | إستاغيرا |
2 | Στρατονίκεια | strAtwnykyyA | ستراتونيكييا | non-Standard | Greek | إستراتونيكيا |
3 | Στρογγυλή | strwggwly | ستروغغولي | non-Standard | Greek | |
4 | Στρυμόνας | strwmwnAs | ستروموناس | non-Standard | Greek | |
5 | Στυξ | stwK | ستوخ | non-Standard | Greek | |
6 | Σύβαρη | swbAry | سوباري | non-Standard | Greek | |
7 | Σύβοτα | swbwtA | سوبوتا | non-Standard | Greek | |
8 | Σύμη | swmy | سومي | non-Standard | Greek | |
9 | Τανάιδα | tAnAydA | تانايدا | non-Standard | Greek | |
10 | Τάραντας | tArAntAs | تارانتاس | non-Standard | Greek | |
11 | Ταρκύνιοι | tArkwnywy | تاركونيوي | non-Standard | Greek | |
12 | Ταρσός | tArsws | تارسوس | non-Standard | Greek | |
13 | Τάρταρος | tArtArws | تارتاروس | non-Standard | Greek | |
14 | Τάρτησσος | tArtysws | تارتيسوس | non-Standard | Greek | |
15 | עכו | okA | عكا | non-Standard | Hebrew | |
16 | עפולה | obwlh | عبوله | non-Standard | Hebrew | |
17 | ערד | ord | عرد | non-Standard | Hebrew | |
18 | אריאל | AryAl | اريال | non-Standard | Hebrew | |
20 | אשדוד | A$dwd | اشدود | non-Standard | Hebrew | |
21 | אשקלון | A$qlwn | اشقلون | non-Standard | Hebrew | عسقلان |
23 | בת ים | bt ym | بت يم | non-Standard | Hebrew | |
24 | באר שבע | byr $bo | بير شبع | non-Standard | Hebrew | بير شبعا |
25 | בית שאן | byt $An | بيت شان | non-Standard | Hebrew | |
26 | בית שמש | byt $m$ | بيت شمش | non-Standard | Hebrew | |
27 | ביתר עילית | bytr oylyt | بيتر عيليت | non-Standard | Hebrew | |
29 | בני ברק | bny brq | بني برق | non-Standard | Hebrew | |
30 | דימונה | dymwnh | ديمونه | non-Standard | Hebrew | |
31 | אילת | EylAt | إيلات | non-Standard | Hebrew | |
32 | אלעד | Alod | العد | non-Standard | Hebrew | |
33 | גבעת שמואל | jbot $mwAl | جبعت شموال | non-Standard | Hebrew | |
34 | גבעתיים | jbotyym | جبعتييم | non-Standard | Hebrew | |
35 | חדרה | Hdrh | حدره | non-Standard | Hebrew | |
36 | חיפה | Hybh | حيبه | non-Standard | Hebrew | |
37 | הרצליה | hrzlyh | هرزليه | non-Standard | Hebrew | هرتزليا |
38 | הוד השרון | hwd h$rwn | هود هشرون | non-Standard | Hebrew | |
39 | חולון | Hwlwn | حولون | non-Standard | Hebrew | |
40 | ירושלים | !wrw$lym | أوروشليم | non-Standard | Hebrew | أورشليم |
41 | כפר קאסם | kfr qAsm | كفر قاسم | non-Standard | Hebrew | |
42 | כרמיאל | krmyAl | كرميال | non-Standard | Hebrew | |
43 | כפר סבא | kfr sbA | كفر سبا | non-Standard | Hebrew | |
44 | כפר יונה | kfr ywnh | كفر يونه | non-Standard | Hebrew | |
45 | קריית אתא | qryAt AtA | قريات اتا | non-Standard | Hebrew | |
46 | קריית ביאליק | qryAt byAlyq | قريات بياليق | non-Standard | Hebrew | |
47 | קריית גת | qryAt jt | قريات جت | non-Standard | Hebrew | |
48 | קריית מלאכי | qryAt mlAky | قريات ملاكي | non-Standard | Hebrew | |
49 | קריית מוצקין | qryAt mwSqyn | قريات موصقين | non-Standard | Hebrew | |
50 | קריית אונו | qryAt Awnw | قريات اونو | non-Standard | Hebrew | |
51 | קריית שמונה | qryAt $mwnh | قريات شمونه | non-Standard | Hebrew | كريات شمونا |
52 | קריית ים | qryAt ym | قريات يم | non-Standard | Hebrew | |
53 | לוד | lwd | لود | non-Standard | Hebrew | |
54 | מעלה אדומים | molh Adwmym | معله ادوميم | non-Standard | Hebrew | معاليه أدوميم |
56 | מעלות-תרשיחא | molwt tr$yHA | معلوت ترشيحا | non-Standard | Hebrew | |
57 | מגדל העמק | mjdl homq | مجدل هعمق | non-Standard | Hebrew | |
58 | מודיעין עילית | mwdyoyn oylyt | موديعين عيليت | non-Standard | Hebrew | |
60 | מודיעין-מכבים-רעות | mwdyoyn mkbym rowt | موديعين مكبيم رعوت | non-Standard | Hebrew | |
61 | נהריה | nhryh | نهريه | non-Standard | Hebrew | |
62 | נצרת | nSrt | نصرت | non-Standard | Hebrew | |
63 | נצרת עילית | nSrt oylyt | نصرت عيليت | non-Standard | Hebrew | |
64 | נשר | n$r | نشر | non-Standard | Hebrew | |
65 | נס ציונה | ns Sywnh | نس صيونه | non-Standard | Hebrew | |
66 | נתניה | ntnyh | نتنيه | non-Standard | Hebrew | ناتانيا |
67 | נתיבות | ntybwt | نتيبوت | non-Standard | Hebrew | |
68 | אופקים | Awbqym | اوبقيم | non-Standard | Hebrew | |
69 | אור עקיבא | Awr oqybA | اور عقيبا | non-Standard | Hebrew | |
70 | אור יהודה | Awr yhwdh | اور يهوده | non-Standard | Hebrew | |
71 | פתח תקווה | btH tqwwh | بتح تقووه | non-Standard | Hebrew | |
72 | קלנסווה | qlnswh | قلنسوه | non-Standard | Hebrew | |
73 | רעננה | ronnh | رعننه | non-Standard | Hebrew | |
74 | רהט | rht | رهت | non-Standard | Hebrew | |
75 | רמת גן | rAmAt jn | رامات جن | non-Standard | Hebrew | |
76 | רמת השרון | rAmAt h$rwn | رامات هشرون | non-Standard | Hebrew | |
77 | רמלה | rmlh | رمله | non-Standard | Hebrew | |
78 | רחובות | rHwbwt | رحوبوت | non-Standard | Hebrew | |
79 | ראשון לציון | rA$wn lSywn | راشون لصيون | non-Standard | Hebrew | |
80 | ראש העין | rA$ hoyn | راش هعين | non-Standard | Hebrew | |
81 | צפת | Sbt | صبت | non-Standard | Hebrew | |
83 | שדרות | $drwt | شدروت | non-Standard | Hebrew | |
84 | שפרעם | $brom | شبرعم | non-Standard | Hebrew | |
85 | טמרה | tmrh | تمره | non-Standard | Hebrew | |
86 | טייבה | tyybh | تييبه | non-Standard | Hebrew | |
87 | תל אביב | tl Abyb | تل ابيب | non-Standard | Hebrew | |
88 | טבריה | tbryh | تبريه | non-Standard | Hebrew | |
89 | טירה | tyrh | تيره | non-Standard | Hebrew | |
90 | טירת כרמל | tyrt krml | تيرت كرمل | non-Standard | Hebrew | |
91 | אום אל-פחם | !m AlfHm | م الفحم | non-Standard | Hebrew | |
92 | יבנה | ybnh | يبنه | non-Standard | Hebrew | |
93 | יהוד-מונוסון | yhwd mwnwswn | يهود مونوسون | non-Standard | Hebrew | |
94 | יקנעם | yqnom | يقنعم | non-Standard | Hebrew | |
95 | Борисоглебск | bwryswglybsk | بوريسوغليبسك | non-Standard | Russian | |
96 | Бородино | bwrdynw | بورودينو | non-Standard | Russian | |
97 | Боровичи | bwrwfyty | بوروفيتي | non-Standard | Russian | |
98 | Боровск | bwrwfsk | بوروفسك | non-Standard | Russian | |
99 | Борзя | bwrzy | بورزي | non-Standard | Russian | |
100 | Братск | brAtsk | براتسك | non-Standard | Russian | |
101 | Бронницы | brwnnytk | بروننيتك | non-Standard | Russian | |
102 | Брянск | brynsk | برينسك | non-Standard | Russian | |
103 | Будённовск | bwdywnnwfsk | بوديوننوفسك | non-Standard | Russian | |
104 | Бугульма | bwgwlkm! | بوغولګمأ | non-Standard | Russian | |
105 | Новочеркасск | nwfwtyrk!sk | نوفوتيركأسك | non-Standard | Russian | |
106 | Новодвинск | nwfwdfynsk | نوفودفينسك | non-Standard | Russian | |
107 | Новохопёрск | nwfwKwbywrsk | نوفوخوبيورسك | non-Standard | Russian | |
108 | Новокубанск | nwfwkwb!nsk | نوفوكوبأنسك | non-Standard | Russian | |
109 | Новокуйбышевск | nwfwkwybk$yfsk | نوفوكويبګشيفسك | non-Standard | Russian | |
110 | Новокузнецк | nwfwkwznytk | نوفوكوزنيتك | non-Standard | Russian | |
111 | Новомичуринск | nwfwmytwrynsk | نوفوميتورينسك | non-Standard | Russian | |
112 | あおきむら | !wkmrA | أوكِمُرا | non-Standard | Japanese | |
113 | あおもりけん | !!mwrkyn | أوُمورِكين | non-Standard | Japanese | |
114 | あおもりし | !!mwr$y | أوُمورِشِي | non-Standard | Japanese | |
115 | あかいがわむら | !kAygAwAmrA | أكايغاوامُرا | non-Standard | Japanese | |
116 | あかいわし | !kAywA$y | أكايواشِي | non-Standard | Japanese | |
117 | あかしし | !kA$$y | أكاشِشِي | non-Standard | Japanese | |
118 | あがのし | !gAnw$y | أغانوشِي | non-Standard | Japanese | |
119 | あかびらし | !kAbrA$y | أكابِراشِي | non-Standard | Japanese | |
120 | あがまち | !gAmAt$y | أغاماتشِي | non-Standard | Japanese | |
121 | あかむら | !kAmrA | أكامُرا | non-Standard | Japanese | |
122 | つべつちょう | tswbytswt$yw | تسُوبيتسُوتشِيو | non-Standard | Japanese | |
123 | つまごいむら | tswmAgwymُrA | تسُوماغويمُرا | non-Standard | Japanese | |
124 | つやまし | tswyAmA$y | تسوياماشِي | non-Standard | Japanese | |
126 | つるおかし | tswr!kA$y | تسُورُأُكاشِي | non-Standard | Japanese | |
127 | たいじちょう | tAyjt$yw | تايجِتشِيو | non-Standard | Japanese | |
128 | だいせんし | dAysyn$y | دايسينشِي | non-Standard | Japanese | |
129 | だいせんちょう | dAysynt$yw | دايسينتشِيوأ | non-Standard | Japanese | |
130 | たいとうく | tAytw!k | تايتوأُكُ | non-Standard | Japanese | |
131 | だいとうし | dAytw!$y | دايتوأُشِي | non-Standard | Japanese | |
132 | たいないし | tAynAy$y | تاينايشِي | non-Standard | Japanese | |
133 | たいわちょう | tAywAt$yw | تايواتشِيو | non-Standard | Japanese | |
134 | たかいしし | tAkAy$$y | تاكايشِشِي | non-Standard | Japanese | |
135 | たかおかし | tAkAwkA$y | تاكاوكاشِي | non-Standard | Japanese | |
136 | たかぎむら | tAkAgmrA | تاكاغِمُرا | non-Standard | Japanese | |
137 | たかさきし | tAkAsAk$y | تاكاساكِشِي | non-Standard | Japanese | |
138 | たかさごし | tAkAsAgw$y | تاكاساغوشِي | non-Standard | Japanese | |
139 | たかしまし | tAkA$mA$y | تاكاشِماشِي | non-Standard | Japanese | |
140 | たがじょうし | tAgAjyw!$y | تاغاجِيوأُشِي | non-Standard | Japanese | |
135 | たかおかし | tAkAwkA$y | تاكاوكاشِي | non-Standard | Japanese | |
136 | たかぎむら | tAkAgmrA | تاكاغِمُرا | non-Standard | Japanese | |
137 | たかさきし | tAkAsAk$y | تاكاساكِشِي | non-Standard | Japanese | |
138 | たかさごし | tAkAsAgw$y | تاكاساغوشِي | non-Standard | Japanese | |
139 | たかしまし | tAkA$mA$y | تاكاشِماشِي | non-Standard | Japanese | |
140 | たがじょうし | tAgAjyw!$y | تاغاجِيوأُشِي | non-Standard | Japanese | |