Preface
1.The Basics
The Importance of Language Annotation
The Layers of Linguistic Description
What Is Natural Language Processing?
A Brief History of Corpus Linguistics
What Is a Corpus?
Early Use of Corpora
Corpora Today
Kinds of Annotation
Language Data and Machine Learning
Classification
Clustering
Structured Pattern Induction
The Annotation Development Cycle
Model the Phenomenon
Annotate with the Specification
Train and Test the Algorithms over the Corpus
Evaluate the ResuIts
Revise the Model and Algorithms
Summary
2.DefiningYourGoal andDataset
3.Corpus Analytics
4.Building Your Model and Specification
5.Applying and Adopting Annotation Standards
6.Annotation and Adjudication
7.Training:Machine Learning
8.Testing and Evaluation
9.Revising and Reporting
10.Annotation:TimeML
11.Automatic Annotation:Generatinq 11meM L
12.Afterword:The Future ofAnnotation
A.List of Available Corpora and Specifications
B.ListofSoftwareResources
C.MAEUserGuide
D.MAIUserGuide
E.Bibliography
Index