Names Diversity Project Kalmasoft Arabic Names Classification Statistics www.kalmasoft.com, info@kalmasoft.com Creation date: May 30, 2020 Work References: Database of Arabic Given Names: www.kalmasoft.com/KLEX/dbgivnm.htm Database of Arabic Surnames: www.kalmasoft.com/KLEX/dbsurnm.htm Database of Unique and Indigenous Names: www.kalmasoft.com/KLEX/dbuniqnm.htm Database of Names of Arabic origin: www.kalmasoft.com/KLEX/dbaorgn.htm Database of Arabic Name Variants: www.kalmasoft.com/KLEX/dbvarom Files metadata: 1. Statistical information file: File name: DivProArabInfo.txt. Contents: statistics and information: File created: Apr 27, 2020. File update: May 30, 2020. 2. Output deliverable file: File name: DivProArab.csv. Contents: final deliverable processed names: File created: Apr 28, 2020. File update: May 30, 2020. Data format: CSV, comma separated values. File encoding: Central European (Windows). 3. Arabic name parts file: File name: DivProArabParts.txt. Contents: separate file for names with Arabic parts: File created: Apr 28, 2020. File update: May 30, 2020. Data format: TAB, Tab delimited. File encoding: Central European (Windows). 4. Arabic names file: File name: DivProArabNames.txt. Contents: separate file for Arab names: File created: Apr 28, 2020. File update: May 30, 2020. Data format: TAB, Tab delimited. File encoding: Central European (Windows). Deliverable data file fields information: ID: identification number. full name: Full name. name# : given name(s). arab: y/n field designates whether the name is Arabic. surname: Surname. gender: gender of first name in the full name list, M: male, F: female, U: unisex. classification: full name culture classification code. A: full Arab name. D: Adopted non-Arab name, single names only. M: potential non-Arab Muslim name, single names only. O: potentially of Arabic origin, single names only. P: partially Arabic, mix of Arabic and non-Arabic names, full names only. country: ISO3166 country code, (AA) Common Arabic name 'Common', (PS) Palestine. zone: countries aggregation marker based on culture. Z0: Algeria,Mauritania,Morocco (Maghreb, Francophone) Z1: Tunisia,Libya (North Africa, Francophone) Z2: Egypt,Israel,Palestine,Jordan (Mediterranean, Anglophone) Z3: Nigeria,Cameroon,Senegal,Chad,Niger,Mali (West Africa, Francophone) Z4: Sudan,Kenya,Ethiopia,Eritrea,Somal,Djibuty (East Africa, Anglophone) Z5: Syria,Turkey,Lebanon,Malta,Armenia,Albania,Bosnia and Herzegovina (Levantine, Francophone) Z6: Saudi Arabia,Bahrain,Iraq,Yemen,Qatar,Emirates,Comoros (Arabic Gulf, Anglophone) Z7: Iran,Afghanistan,Pakistan,Uzbekistan,Turkmenistan,Tajikistan,Ukraine (Asia Minor) Z8: India,Bangladesh,Brunei,Sri Lanka,Nepal,Madagascar (East Asia) Z9: Malaysia,Indonesia,Philippines,Cambodia (South East Asia) Arabic name parts file fields information: ID: identification number. full name: Full name. Parts: Arabic parts found in the name. //////////////////////////////////////////////////////////////////////////////// 1. [Arabic names classification algorithm] --hidden-- 2. [Full name geolocation algorithm, based on imbalanced corpus] --hidden-- //////////////////////////////////////////////////////////////////////////////// 3. [General statistics] Total number of names: 3,790,402 Total full Arabic names: 7,680; (0.2026%) of total names. Partially Arabic names: 84,496; (2.2292%) of total names. Very common full Arabic names: 993; (12.9297%) of total full Arabic names [7,680]. Total potential Arabic parts: 108,157 Total unique Arabic name parts: 11,643; (2.0436%) of total reference Arabic names [569,720]. Total names assigned to zones: 3,470; (45.1823%) of total full Arabic names [7,680]. Number of countries assigned: 26 out of 68 countries. Cultural zones assigned: 8 out of 10 zones. 4. [Distribution of Arabic full names assigned to cultural zones] Zone code Number;Percent Percent Z0 391; (5.0911% of full Arabic names) (0.0103% of total names) Z1 118; (1.5365% of full Arabic names) (0.0031% of total names) Z2 878; (11.4323% of full Arabic names) (0.0232% of total names) Z3 5; (0.0651% of full Arabic names) (0.0001% of total names) Z5 410; (5.3385% of full Arabic names) (0.0108% of total names) Z6 1,075; (13.9974% of full Arabic names) (0.0284% of total names) Z7 395; (5.1432% of full Arabic names) (0.0104% of total names) Z8 198; (2.5781% of full Arabic names) (0.0052% of total names) 5. [Distribution of full Arabic names assigned to unique country, marked 'A'] Country Number;Percent Percent ISO code Emirates 67; (0.8724% of full Arabic names) (0.0018% of total names) AE Afghanistan 18; (0.2344% of full Arabic names) (0.0005% of total names) AF Bangladesh 198; (2.5781% of full Arabic names) (0.0052% of total names) BD Bahrain 26; (0.3385% of full Arabic names) (0.0007% of total names) BH Cameroon 17; (0.2214% of full Arabic names) (0.0004% of total names) CM Algeria 232; (3.0208% of full Arabic names) (0.0061% of total names) DZ Egypt 545; (7.0964% of full Arabic names) (0.0144% of total names) EG Iraq 658; (8.5677% of full Arabic names) (0.0174% of total names) IQ Iran 33; (0.4297% of full Arabic names) (0.0009% of total names) IR Jordan 147; (1.9141% of full Arabic names) (0.0039% of total names) JO Kuwait 193; (2.513% of full Arabic names) (0.0051% of total names) KW Lebanon 225; (2.9297% of full Arabic names) (0.0059% of total names) LB Libya 60; (0.7812% of full Arabic names) (0.0016% of total names) LY Morocco 14; (0.1823% of full Arabic names) (0.0004% of total names) MA Mauritania 125; (1.6276% of full Arabic names) (0.0033% of total names) MR Oman 88; (1.1458% of full Arabic names) (0.0023% of total names) OM Pakistan 330; (4.2969% of full Arabic names) (0.0087% of total names) PK Palestine 147; (1.9141% of full Arabic names) (0.0039% of total names) PS Qatar 4; (0.0521% of full Arabic names) (0.0001% of total names) QA Saudi Arabia 121; (1.5755% of full Arabic names) (0.0032% of total names) SA Senegal 5; (0.0651% of full Arabic names) (0.0001% of total names) SN Syria 165; (2.1484% of full Arabic names) (0.0044% of total names) SY Tunisia 56; (0.7292% of full Arabic names) (0.0015% of total names) TN Turkey 6; (0.0781% of full Arabic names) (0.0002% of total names) TR Yemen 128; (1.6667% of full Arabic names) (0.0034% of total names) YE 6. [Distribution of partially Arabic names per country, one part marked 'P'] --hidden-- 7. [Distribution of Arabic name parts per country] Country Number;Percent Percent Common 64,970; (76.8912% of partial Arab) (1.7141% of total names) Emirates 568; (0.6722% of partial Arab) (0.015% of total names) Afghanistan 145; (0.1716% of partial Arab) (0.0038% of total names) Bangladesh 733; (0.8675% of partial Arab) (0.0193% of total names) Bahrain 316; (0.374% of partial Arab) (0.0083% of total names) Cameroon 280; (0.3314% of partial Arab) (0.0074% of total names) Algeria 2,983; (3.5303% of partial Arab) (0.0787% of total names) Egypt 4,748; (5.6192% of partial Arab) (0.1253% of total names) Iraq 8,166; (9.6644% of partial Arab) (0.2154% of total names) Iran 223; (0.2639% of partial Arab) (0.0059% of total names) Jordan 1,548; (1.832% of partial Arab) (0.0408% of total names) Kuwait 2,300; (2.722% of partial Arab) (0.0607% of total names) Lebanon 3,385; (4.0061% of partial Arab) (0.0893% of total names) Libya 640; (0.7574% of partial Arab) (0.0169% of total names) Morocco 160; (0.1894% of partial Arab) (0.0042% of total names) Mauritania 2,665; (3.154% of partial Arab) (0.0703% of total names) Oman 587; (0.6947% of partial Arab) (0.0155% of total names) Pakistan 1,498; (1.7729% of partial Arab) (0.0395% of total names) Palestine 2,401; (2.8416% of partial Arab) (0.0633% of total names) Qatar 58; (0.0686% of partial Arab) (0.0015% of total names) Saudi Arabia 1,298; (1.5362% of partial Arab) (0.0342% of total names) Sudan 10; (0.0118% of partial Arab) (0.0003% of total names) Senegal 165; (0.1953% of partial Arab) (0.0044% of total names) Syria 3,368; (3.986% of partial Arab) (0.0889% of total names) Tunisia 749; (0.8864% of partial Arab) (0.0198% of total names) Turkey 44; (0.0521% of partial Arab) (0.0012% of total names) Unknown 865; (1.0237% of partial Arab) (0.0228% of total names) Yemen 3,284; (3.8866% of partial Arab) (0.0866% of total names) 8. [Distribution of non-Arab Muslim names per country, marked 'N'] --hidden-- 9. [Distribution of name parts of Arabic origin per country, marked 'O'] --hidden-- 10. [ISO3166 country codes, 68 countries targeted] Code Country name AA Common AE Emirates AF Afghanistan AL Albania AM Armenia AZ Azerbaijan BA Bosnia and Herzegovina BD Bangladesh BF Burkina Faso BH Bahrain CF Central African Republic CI Côte d'Ivoire CM Cameroon DJ Djibouti DZ Algeria EG Egypt ER Eritrea ET Ethiopia ID Indonesia IL Israel IN India IQ Iraq IR Iran JO Jordan KE Kenya KH Cambodia KM Comoros KW Kuwait KZ Kazakhstan LB Lebanon LK Sri Lanka LR Liberia LY Libya MA Morocco MG Madagascar ML Mali MR Mauritania MT Malta MY Malaysia NE Niger NG Nigeria NP Nepal OM Oman PH Philippines PK Pakistan PS Palestine QA Qatar RW Rwanda SA Saudi Arabia SD Sudan SN Senegal SO Somalia SS South Sudan SY Syria TD Chad TH Thailand TJ Tajikistan TM Turkmenistan TN Tunisia TR Turkey TZ Tanzania UA Ukraine UG Uganda UZ Uzbekistan XX Unknown YE Yemen ZM Zambia ZW Zimbabwe 11. [Distribution of full Arabic names assigned to multiple countries, marked 'A'] Countries Number;Percent Percent AA 3,849; (50.1172% of full Arabic names) (0.1015% of total names) AA/AE/SA 1; (0.013% of full Arabic names) (0.0% of total names) AA/BH/AE 1; (0.013% of full Arabic names) (0.0% of total names) AA/PK/IR 3; (0.0391% of full Arabic names) (0.0001% of total names) AA/SA/IQ 1; (0.013% of full Arabic names) (0.0% of total names) AA/SA/YE 2; (0.026% of full Arabic names) (0.0001% of total names) AA/XX/DZ 1; (0.013% of full Arabic names) (0.0% of total names) AA/XX/EG 2; (0.026% of full Arabic names) (0.0001% of total names) AA/XX/IQ 2; (0.026% of full Arabic names) (0.0001% of total names) AE/IQ 2; (0.026% of full Arabic names) (0.0001% of total names) AF/AA/PK 2; (0.026% of full Arabic names) (0.0001% of total names) AF/AA/XX 2; (0.026% of full Arabic names) (0.0001% of total names) AF/PK 2; (0.026% of full Arabic names) (0.0001% of total names) BH/AA/IQ 1; (0.013% of full Arabic names) (0.0% of total names) BH/IQ 1; (0.013% of full Arabic names) (0.0% of total names) DZ/AA/MR 2; (0.026% of full Arabic names) (0.0001% of total names) DZ/MR 6; (0.0781% of full Arabic names) (0.0002% of total names) DZ/XX 2; (0.026% of full Arabic names) (0.0001% of total names) EG/AA/JO 1; (0.013% of full Arabic names) (0.0% of total names) EG/AA/PS 2; (0.026% of full Arabic names) (0.0001% of total names) EG/JO 4; (0.0521% of full Arabic names) (0.0001% of total names) EG/PS 5; (0.0651% of full Arabic names) (0.0001% of total names) IQ/AA/AE 1; (0.013% of full Arabic names) (0.0% of total names) IQ/AA/BH 3; (0.0391% of full Arabic names) (0.0001% of total names) IQ/AE 7; (0.0911% of full Arabic names) (0.0002% of total names) IQ/BH 3; (0.0391% of full Arabic names) (0.0001% of total names) IQ/SA 11; (0.1432% of full Arabic names) (0.0003% of total names) IQ/YE 6; (0.0781% of full Arabic names) (0.0002% of total names) JO/EG 3; (0.0391% of full Arabic names) (0.0001% of total names) JO/PS 2; (0.026% of full Arabic names) (0.0001% of total names) LB/AA/XX 1; (0.013% of full Arabic names) (0.0% of total names) LB/SY 3; (0.0391% of full Arabic names) (0.0001% of total names) LY/TN 1; (0.013% of full Arabic names) (0.0% of total names) MA/AA/DZ 2; (0.026% of full Arabic names) (0.0001% of total names) MA/XX 2; (0.026% of full Arabic names) (0.0001% of total names) MR/DZ 2; (0.026% of full Arabic names) (0.0001% of total names) MR/XX 3; (0.0391% of full Arabic names) (0.0001% of total names) OM/XX 2; (0.026% of full Arabic names) (0.0001% of total names) PK/IR 2; (0.026% of full Arabic names) (0.0001% of total names) PK/XX 3; (0.0391% of full Arabic names) (0.0001% of total names) PS/AA/JO 2; (0.026% of full Arabic names) (0.0001% of total names) PS/EG 8; (0.1042% of full Arabic names) (0.0002% of total names) PS/JO 2; (0.026% of full Arabic names) (0.0001% of total names) SA/AA/BH 1; (0.013% of full Arabic names) (0.0% of total names) SA/AE 5; (0.0651% of full Arabic names) (0.0001% of total names) SA/IQ 7; (0.0911% of full Arabic names) (0.0002% of total names) SA/YE 2; (0.026% of full Arabic names) (0.0001% of total names) SY/AA/LB 2; (0.026% of full Arabic names) (0.0001% of total names) SY/LB 5; (0.0651% of full Arabic names) (0.0001% of total names) TN/AA/LY 1; (0.013% of full Arabic names) (0.0% of total names) XX 61; (0.7943% of full Arabic names) (0.0016% of total names) XX/AA/EG 1; (0.013% of full Arabic names) (0.0% of total names) XX/AA/JO 1; (0.013% of full Arabic names) (0.0% of total names) XX/BH 1; (0.013% of full Arabic names) (0.0% of total names) XX/BH/AA 2; (0.026% of full Arabic names) (0.0001% of total names) XX/EG 5; (0.0651% of full Arabic names) (0.0001% of total names) XX/IQ 2; (0.026% of full Arabic names) (0.0001% of total names) XX/LB 3; (0.0391% of full Arabic names) (0.0001% of total names) XX/PS 1; (0.013% of full Arabic names) (0.0% of total names) XX/SA 1; (0.013% of full Arabic names) (0.0% of total names) YE/IQ/AA 1; (0.013% of full Arabic names) (0.0% of total names) YE/IQ/SA 2; (0.026% of full Arabic names) (0.0001% of total names) YE/SA 1; (0.013% of full Arabic names) (0.0% of total names) YE/XX/AA 4; (0.0521% of full Arabic names) (0.0001% of total names)