树库属于深加工语料库,是语料库语言学和自然语言处理技术发展到相对成熟阶段的产物。《计算语言学与语言科技原文丛书·树库:句法分析语料库的构建和使用(英文影印版)》主要讲述如何构建树库、如何使用树库,基本反映了近10年间树库研究的整体面貌,是树库研究发展到一定阶段的一个比较全面的总结,起到了承前启后的作用。
《树库:句法分析语料库的构建和使用(英文影印版)》主要论述在建立和使用树库过程中发现的一系列问题,如何处理不同语言的语料库,这些问题对语言学、计算语言学、自然语言、句法及语法的研究也有很大帮助。
导读
Preface
Introduction
Anne Abeillé
1 BUILDING TREEBANKS
2 USING TREEBANKS
Part Ⅰ BUILDING TREEBANKS
ENGLISH TREEBANKS
Chapter1 THE PENN TREEBANK:AN OVERVIEW
Ann Taylor, Mitchell Marcus, Beatrice Santorini
INTRODUTION
1 THE ANNOTATION SCHEMES
2 METHODOLOGY
3 CONCLUSIONS
Chapter2 THOUGHTS ON TWO DECADES OF DRAWING TREES
Geoffrey Sampson
1 HISTORICAL BACKGROUND
2 BUILDING TREEBANKS
3 EXPLOITING THE SUSANNE TREEBANK
4 SMALL IS BEAUTIFUL
5 ANNOTATING A SPOKEN CORPUS
6 USING THE CHRISTINE CORPUS
7 CONCLUSION
Chapter3 BANK OF ENGLISH AND BEYOND
Timo Jarvinen
1 INTRODUCTION
2 ANNOTATING 200 MILLION WORDS
3 ENGCG SYNTAX
4 FDG PARSER
5 CONCLUSION
Chapter4 COMPLETING PARSED CORPORA
Sean Wallis
1 INTRODUCTION
2 CONVENTIONAL POST-CORRECTION
3 A PARADIGM SHIFT: TRANSVERSE CORRECTION
4 CRITIQUE
GERMAN TREEBANKS
Chapter5 SYNTACTIC ANNOTATION OF A GERMAN NEWSPAPER CORPUS
Thorsten Brants, Wojciech Skut, Hans Uszkoreit
1 INTRODUCTION
2 TREEBANK DEVELOPMENT
3 CORPUS ANNOTATION
4 APPLICATIONS
5 CONCLUSIONS
Chapter6 ANNOTATION OF ERROR TYPES FOR A GERMAN
NEWSGROUP CORPUS
Markus Becker, Andrew Bredenkamp, Berthold Crysmann, Judith Klein
1 INTRODUCTION
2 CORPUS DESCRIPTION
3 ANNOTATION STRATEGY
4 ANNOTATION TOOLS
5 EVALUATION
6 FIRST RESULTS
7 CONCLUSION
SLAVIC TREEBANKS
……
Part Ⅱ USING TREEBANKS