Main Issues in NLP
NLP Processes
Why NLP processes?
Algorithm pipeline:
Pipeline scheduling
Reasons for limited effectiveness
Perfect effectiveness?
Process-related reasons for Limited effectiveness
Lack of training data:
Domain transfer of an approach
Error accumulation
Strategies to counter error accumulation
Joint inference algorithms
Pipeline extensions
Practical effectiveness tweaks
Reasons for limited efficiency
Ways to improve memory efficiency
Ways to improve run-time efficiency
Potential memory efficiency issues
Memory consumption in NLP
Storage of inputs and outputs
Storage of algorithms
Indexing of relevant information
Simpler algorithms
Filtering relevant portions of text
Optimal scheduling of pipelines
Intuition
Effects
Parallelization
Text analysis entails “natural” parallelization
Basic parallelization scenario
Homogeneous parallel system architecture
Discussion of the parallelization approaches
Analysis pipelining
Analysis parallelization
Pipeline duplication:
Schedule parallelization