This is a text which surveys the growing field of research known as corpus annotation - an electronic collection of texts. Corpus annotation is a central resource in linguistics, information technology and the processing of human language. The book seeks to show the nature of language and the most effective means of analysing it. A bibliography lists relevant e-mail addresses and Web sites.
List of contributors Preface 1. Introducing corpus annotation Geoffrey Leech 2. Grammatical tagging Geoffrey Leech 3. Syntactic annotation: treebanks Geoffrey Leech and Elizabeth Eyes 4. Semantic annotation Andrew Wilson and Jenny Thomas 5. Discourse annotation: anaphoric relations in corpora Roger Garside, Steve Fligelstone and Simon Botley 6. Further levels of annotation Geoffrey Leech, Anthony McEnery and Martin Wynne 7. A hybrid grammatical tagger: CLAWS4 Roger Garside and Nicholas Smith 8. How to generalise the task of annotation Steve Fligelstone, Mike Pacey and Paul Rayson 9. Improving a tagger Nicholas Smith 10. Retageting a tagger Fernando Sánchez León and Amalio F.Nieto-Serrano 11. The use of syntactic annotation tools: partial and full parsing Jeremy Bateman, Jean Forrest, and Tim Willis 12. Higher-level annotation tools Roger Garside and Paul Rayson 13. A corpus/annotation toolbox Anthony McEnery and Paul Rayson 14. A corpus-based grammar tool Anthony McEnery, John Paul Baker and John Hutchinson 15. The exploitation of multilingual annotated corpora for term extraction Anthony McEnery, Jean-Marc Langé, Michael Oakes and Jean Véronis 16. Cross-linguistic guidelines for the annotation of corpora Peter Kahrel, Ruthanna Barnett and Geoffrey Leech 17. Consistency and accuracy in correcting automatically-tagged corpora John Paul Baker Appendix I: Sources for further information (WWW and e-mail addresses) Appendix II: Abbreviations and acronyms Appendix III: Specimen annotation practices: the C7 and C5 tagsets Bibliography Index
R.G. Garside, Geoffrey Leech, Anthony Mark Mcenery
Reviews for Corpus Annotation: Linguistic Information from Computer Text Corpora
A clear introduction to a number of important debates concerning autobiography and memory --Journal of British Studies