Corpus Work at HCRC
An overview is given of work on the creation, collection, preparation, and publication of electronic corpora of written and spoken language undertaken at the Human Communication Research Centre at the Universities of Edinburgh and Glasgow. Four major efforts are described: the HCRC Map Task Corpus, the ECI/MC1, the MLCC project and work on document architectures and processing regimes for SGML-encoded corpora.