Test 2 Flashcards
Chapters 8, 16, 17, 21 (142 cards)
Natural Language
Unfettered spoken or written language
-Primary means of human communication
Natural Language Processing (NLP)
Enabling the use of automated methods that represent the relevant information in the text with high validity and reliability.
Patrick Suppes
-Pioneer in computerized learning
“…the challenge to psychological theory made by linguists to provide an adequate theory of language learning may well be regarded as the most significant intellectual challenge to theoretical psychology in this century.”
Bag-of-Words
A language model where text is represented as a collection of words, independent of each other and disregarding word order.
Keyword
A word or phrase that conveys special meaning or to refer to information that is relevant to such a meaning,
Machine Learning
A computer technique in which information learned from data is used to improve system performance.
NLP Text Processing
- Lexical: Tokenization, part of speech, head, lemma
- Parsing and Chunking
- Semantic Tagging: Semantic role, word sense
- Certain Expressions: Named entities
- Discourse: coreference, discourse segments
NLP Speech Processing
- Phonetic transcription
- Segmentations (Puncutations)
- Prosody
Types of NLP: Information Extraction
Methods that process text to capture and organize specific information in the text and also to capture and organize specific relations between the pieces of information.
-Most common form in biomedicine.
Biosurveillance
A public health activity that monitors a population for occurrence of a rare disease or increased occurrence of a common one.
Named-entity Recognition
In language processing, a sub-task of information extraction that seeks to locate and classify atomic elements in text into predefined categories
Named-entity Normalization
The natural language processing method, after finding a named entity in a document, for linking (normalizing) that mention with appropriate database identifiers.
Modifiers of Interest
In NLP, a term used to describe or otherwise modify a named-entity that has been recognized.
Relations Among Named Entities
A characterization of two entities in NLP with respect to the semantic nature of the relationship between them.
Reference Resolution
In NLP, recognizing that two mentions in two different textual locations refer to the same entity.
Question Answering (QA)
A computer-based process whereby a user submits a natural language question that is then automatically answered by returning a specific response.
Text Summarization
Takes one or several documents as input and produces a single, coherent text that synthesizes the main points of the input documents.
Text Generation
Methods that create coherent natural language text from structured data or from textual documents in order to satisfy a communication goal.
Machine Translation
Automatic mapping of text written in one natural language into text of another language.
Text Readability Assessment and Simplification
An application of NLP in which computational methods are used to assess the clarity of writing for a certain audience or to revise the exposition using similar terminology and sentence construction.
Linguistic Steps in NLP: Morphology
The way words are built up from smaller, meaning-bearing units; the structure of words
- Various forms of basic words
- Make more words from less.
Linguistic Steps in NLP: Syntax
How words are put together to form correct sentences and what structural role each word has.
-Syntax tree assigned by grammar or lexicon.
Linguistic Steps in NLP: Semantics
What words mean and how these meanings combine in sentences to form sentence meanings.
Linguistic Steps in NLP: Pragmatics
How sentences are used in different situations and how use affects the interpretation of the sentence.