Slide 1 of 32
Notes:
Text is taken through a series of modules and broken down into the components that include sentences, phrases, lexical elements and tokens. Sets of sentences, such as paragraphs, sections of a MedLine abstract and the like are organized into paragraphs. All of these fit into an overarching structure called a Document.