Article published in:Analyse Lexicale et Syntaxique: Le système INTEX
Edited by Cédrick Fairon
[Lingvisticæ Investigationes 22:1/2] 1999
► pp. 327–340
Parsing a web site as a corpus
GlossaNet is an automated system that monitors Web sites. On dates and at intervals selected by the user, GlossaNet downloads the Web site, converts it to an electronic corpus and uses the intex programs (M. Silberztein 1993) and the linguistic resources of the ladl (electronic dictionaries and libraries of local grammars) to parse it. Once the software has been set up, it automatically repeats the task at regular periods of time (as the Web site is updated). Results, if any, are e-mailed to the user.
Published online: 03 October 2000