MAP – Text Decomposition

MAP – Textual Decomposition Analysis Module

MAP decomposes running text into a streaming sequence of individual words and punctuation. Brackets, parentheses, quotes, etc. are detached and occupy separate lines in the stream.

Items containing or terminating in a period are looked up in the system dictionary to determine whether they are abbreviations, i.e. whether the period is part of the spelling, or end-of-segment markers.

The dictionary has a comprehensive list of abbreviations. Each punctuation mark has its own dictionary entry (which can be viewed with DIC) with relevant JG categorical information attached.

Leave a Reply

Your email address will not be published.