In search of representativity in specialised corpora
Categorisation through collocation
Geoffrey Williams | Université de Bretagne Sud, France
In large reference corpora representativeness is attempted through carefully selected sampling and sheer size. The situation is different with special language corpora in that their very nature limits them in size. Their representativity is measured by reference to external selection criteria, generally following bibliographic classifications, which tend to be subjective. In order to overcome subjectivity in specialised corpora, a corpus-directed system of internal selection using lexical criteria is proposed. The aim is not to create rigid boundaries but to see clearly what is actually present in the corpus. The method adopted is demonstrated on a corpus consisting of research articles from specialised journals and conference proceedings in the field of plant biology. Restricted collocational networks are used to isolate prototypical groupings within the corpus. It is shown that audience is an important factor in strong and weak prototypical groupings in theme and domain specific corpora. Articles addressing domain specialists through a journal tend to be more central than those presented to a theme-specific discourse community through conference proceedings.
Keywords: specialised corpora, categorisation, collocation, prototypicality
Published online: 18 October 2002
https://doi.org/10.1075/ijcl.7.1.03wil
https://doi.org/10.1075/ijcl.7.1.03wil
Cited by
Cited by other publications
No author info given
BOWLES, HUGO
Brezina, Vaclav, Tony McEnery & Stephen Wattam
Corpas, Gloria
Hidalgo-Downing, Laura
Krausse, Sylvana
Liu, Zhanyi, Haifeng Wang, Hua Wu & Sheng Li
Murakami, Akira, Paul Thompson, Susan Hunston & Dominik Vajn
Salama, Amir H.Y.
Seretan, Violeta
Usoniene, Aurelija, Linas Butenas, Birute Ryvityte, Jolanta Sinkuniene, Erika Jasionyte & Algimantas Juozapavicius
Valencia Giraldo, M.ª Victoria & Gloria Corpas Pastor
Williams, Geoffrey
Williams, Geoffrey
Williams, Geoffrey, Claude Sionis & Paul Boucher
This list is based on CrossRef data as of 10 january 2021. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.