MAPSSeman© Lite (PoS Tagger) Specifications

Overall Description

  • A context-sensitive Arabic tagger suited for big corpus Arabic text.
  • A professional productivity tool for tagging Arabic text; it is hybrid meaning it rather does semantic and syntactic analysis (ordinary taggers work on morphology level with heavy reliance on linguistic principles and lexical characteristcs); the system incorporates a solid lexical analyzer (stemmer) that prepares the text to the morphological analyzer which in turn works on morphotactical and contextual probabilities before tagging any token.
  • The system approaches the issue of part of speech tagging using techniques that go beyond the superficial "Linguistic Mechanics" and string manipulation such as stemming, tokenization, morphological analysis or any other classical techniqes leveraging the ideosyncratic meaning in a completely new technology.
  • Supports tagsets as specified by CG, Brill, Penn Treebank, CLAWS, Brown, LOB, Khoja.
  • Export output to JSON, XML, SQL, TXT, for processing purposes; HTML, XLS, PDF for viewing purposes.
  • This tool is designed to be used with large Arabic corpora, however, many simplified features are added to assist novice users making it ideal for use by academic purposes as well.
  • The system is equipped with a powerful tagset editor so users can edit the built-in tagsets or start compiling a completely new tagset of their own.
  • Arabic text transliteration (KATS) for readibilty for non-Arabic speakers.
  • Three Arabic varieties in the input (classic, MSA, colloquial).
  • Verbose dispay of tagged text include some 10 categories (root, clitics, tense, case, mood, voice, form, gloss, etc.)
  • Sliding window adjustment.
  • Over 30 multi-level ganuled highly descriptive NER tagset.
  • Joining identical entities.
  • Related entities extension.
  • lookup gazetteers.

Details (Click on the image to view enlarged version)
Corpus options Mini report generation Single catena entry Arabic varieties

Corpus option

mini report generation

single catena entry

input different Arabic varieties
Versatile corpus tuning options Mini report generation Dual entry option Input different Arabic varieties

Specification Description
Hardware platform x86, 32bit
Operating system Microsoft Windows 8, Windows 7, Vista, NT, XP, 2000
Hard disc free space 1GB minimum
Processor Pentium at 1GHz or higher
Main memory (RAM) 512MB or more
Display size 20 inch or wider

---- Other requirements ----

Web Browser Microsoft Explorer 7 or superior, Mozilla Firefox 3.6 or superior
   

  MAPSSeman© Lite (Tagger) Criteria
Tagging speed 1,000 (token/min) full options selected, 1GHz, 1GB RAM
     
Multi-user This software does not support multi-user environment.  

View interface Output view interface


Named Entity Recognition Arabic Named Entity Recognition


Home » MAPS » MAPS Semantics » MAPSSeman Lite (PoS Tagger) Specifications

Category Software | Reference MSLTAG | Family MAPSSEMANL | Last updated 19/12/2019