Natural Language Processing for Online Applications
Text retrieval, extraction and categorization
This title has been replaced by:
Natural Language Processing for Online Applications: Text retrieval, extraction and categorization. Second revised edition , Peter Jackson and Isabelle Moulinier (2007)
Natural Language Processing for Online Applications: Text retrieval, extraction and categorization. Second revised edition , Peter Jackson and Isabelle Moulinier (2007)
Hardbound – Replaced by new edition
ISBN 9789027249883 (Eur)
ISBN 9781588112491 (USA)
Paperback – Replaced by new edition
ISBN 9789027249890 (Eur)
ISBN 9781588112507 (USA)
Netlibrary e-Book – Replaced by new edition
ISBN 9780585462530
This text covers the emerging technologies of document retrieval, information extraction, and text categorization in a way which highlights commonalities in terms of both general principles and practical issues. It seeks to satisfy a need on the part of technology practitioners in the Internet space, faced with having to make difficult decisions as to what research has been done an what the best practices are. It is not intended as a vendor guide (such things are quickly out of date), or as a recipe for building applications (such recipes are very context-dependent). But it does identify the key technologies, the issues involved, and the strengths and weaknesses on evaluation in every chapter, both in terms of methodology (how to evaluate) and what controlled experimentation and industrial experience have to tell us.
[Natural Language Processing, 5 (1st)] 2002. x, 226 pp.
Publishing status: Obsolete
© John Benjamins Publishing Company
Table of Contents
-
Preface | p. ix
-
1. Natural language processing | p. 1
-
2. Document retrieval | p. 23
-
3. Information extraction | p. 75
-
4. Text categorization | p. 119
-
5. Towards text mining | p. 173
-
Index | p. 219
“In general, the book is a very good, concise reference book filled with many theoretical principles and practical guidelines. I recommend this book to anyone who wants to build applications related to text retrieval, information extraction and categorization.”
Zhongdong Zhang, Novator Systems Ltd., Toronto, on Linguist List Vol-14-226, 2003
“The authors had the good idea of not making this book a vendor guide but rather an overview of methodologies and technologies available and the evaluation criteria for the techniques described. I do not believe their goal was to publish a detailed overview but an introduction to the various technologies available. In that regard, the book is very successful and I much appreciate it because key concepts are clearly outlined which it makes it easier to follow the authors through the more complex parts of the book. I would recommend it to anyone who is interested in NLP and its applications to the new challenges brought out by the arrival of the information age.”
Patrick Drouin, University of Montreal, in Terminology 10:1, 2004
“In my view, the book is very practical: certainly, since it is pretty comprehensible and does not go into too profound details, it could serve well as a textbook for an introductory course. However, the book is not intended exclusively as an academic text. It is also aimed at software engineers, project managers, and technology executives who want or need to understand the technology at some level. I think that such people may find it useful, and that it may provoke ideas, discussions, and action the field of applied research and development.”
Martin Holub, in The Prague Bulletin of Mathematical Linguistics Vol. 83, 2005
“Some special features of the book include solid coverage of evaluation techniques in every chapter, excellent endnotes, and references to exactly the right stuff. However, the most salient feature of this book is the clear and cogent writing. It reads much like a series of well-written review articles an is actually enjoyable to read while not skimping at all on technical detail.”
K. Bretonnel Cohen, University of Colorado, in Language 80(1), 2004
Cited by (99)
Cited by 99 other publications
Belete, Mequanent Degu, Ayodeji Olalekan Salau, Girma Kassa Alitasb & Tigist Bezabh
Ibañez, Marilyn Minicucci, Reinado Roberto Rosa & Lamartine Nogueira Frutuoso Guimarães
Nnamoko, Nonso, Themis Karaminis, Jack Procter, Joseph Barrowclough & Ioannis Korkontzelos
Jbel, Mouad, Imad Hafidi & Abdelmoutalib Metrane
Miok, Kristian, Padraig Corcoran & Irena Spasić
Nuccio, Massimiliano & Sofia Mogno
Ibañez, Marilyn Minicucci, Reinaldo Roberto Rosa & Lamartine Nogueira Frutuoso Guimarães
Ibañez, Marilyn Minicucci, Reinaldo Roberto Rosa & Lamartine Nogueira Frutuoso Guimarães
Nakayama, Minoru, Kouichi Mutsuura & Hiroh Yamamoto
Funkner, Anastasia A. & Sergey V. Kovalchuk
Loutsaris, Michalis Avgerinos & Yannis Charalabidis
Rivolli, Adriano, Jesse Read, Carlos Soares, Bernhard Pfahringer & André C. P. L. F. de Carvalho
Cosh, Kenneth, Sakgasit Ramingwong, Narissara Eiamkanitchat & Lachana Ramingwong
Rahab, Hichem, Abdelhafid Zitouni & Mahieddine Djoudi
Rivolli, Adriano, Carlos Soares & Andre C.P.L.F. de Carvalho
Rivolli, Adriano, Carlos Soares & André C. P. L. F. de Carvalho
Banerjee, Binayak, Tania Sarkar, Pratap Chakraborty & Alok Ranjan Pal
Chadha, Sanchit, Antuan Byalik, Eli Tilevich & Alla Rozovskaya
Nabhan, Rabih Joseph
Rivolli, Adriano, Larissa C. Parker & Andre C. P. L. F. de Carvalho
Cohen, Kevin Bretonnel, Benjamin Glass, Hansel M. Greiner, Katherine Holland-Bouley, Shannon Standridge, Ravindra Arya, Robert Faist, Diego Morita, Francesco Mangano, Brian Connolly, Tracy Glauser & John Pestian
Indu, M & K V Kavitha
Byalik, Antuan, Sanchit Chadha & Eli Tilevich
Byalik, Antuan, Sanchit Chadha & Eli Tilevich
Li, Simon, Kamrun Nahar & Benjamin C. M. Fung
Tahir, Muhammad Atif, Emdad Khan & Adel Al Salem
Pablo, Zelinna Cynthia, Nathaniel Oco, Ma. Divina Gracia Roldan, Charibeth Cheng & Rachel Edita Roxas
Amolochitis, Emmanouil, Ioannis T. Christou, Zheng-Hua Tan & Ramjee Prasad
Lesmo, Leonardo, Alessandro Mazzei, Monica Palmirani & Daniele P. Radicioni
Perea-Ortega, José M., Arturo Montejo-Ráez, M. Teresa Martín-Valdivia & L. Alfonso Ureña-López
Perea-Ortega, José M., Arturo Montejo-Ráez, M. Teresa Martín-Valdivia & L. Alfonso Ureña-López
Anchieta, Rafael T., Rogerio F. de Sousa & Raimundo S. Moura
Kimbrough, Steven O., Thomas Y. Lee & Ulku Oktem
Leopold, Henrik, Sergey Smirnov & Jan Mendling
Pestian, John P., Pawel Matykiewicz, Michelle Linn-Gust, Brett South, Ozlem Uzuner, Jan Wiebe, K. Bretonnel Cohen, John Hurdle & Christopher Brew
Wnuk, Krzysztof, Martin Höst & Björn Regnell
Boella, Marco, Francesca Romana Romani, Anjela Al-Raies, Cristina Solimando & Giuliano Lancioni
HOU, WEN-JUAN & JIA-HAO TSAO
KyungTae Kim, Sungahn Ko, Niklas Elmqvist & David S Ebert
Rattanyu, Kanlaya & Makoto Mizukawa
Mala, Piotr
Spangler, W. S., J. T. Kreulen, Y. Chen, L. Proctor, A. Alba, A. Lelescu & A. Behal
Wu, Qin, Eddie Fuller & Cun-Quan Zhang
Chen, Ying, Scott Spangler, Jeffrey Kreulen, Stephen Boyer, Thomas D. Griffin, Alfredo Alba, Amit Behal, Bin He, Linda Kato, Ana Lelescu, Cheryl Kieliszewski, Xian Wu & Li Zhang
Irmak, Utku, Vadim von Brzeski & Reiner Kraft
Larson, Martha, Eamonn Newman & Gareth J. F. Jones
MacFarlane, Katrinna & Violeta Holmes
Nau, Dana S.
Norouzzadeh, Mohammad S., Ayoub Bagheri & Mohammad H. Saraee
Alonso, Omar, Premkumar T. Devanbu & Michael Gertz
Gallo, Ignazio & Elisabetta Binaghi
Motta, Eduardo, Alexandre Andreatta & Sean Siqueira
Nakayama, Minoru & Yosiyuki Takahasi
Spangler, Scott, Larry Proctor & Ying Chen
Stasko, John, Carsten Görg & Zhicheng Liu
Valette, Mathieu & Monique Slodzian
Antunes, Bruno, Nuno Seco & Paulo Gomes
Behal, Amit, Ying Chen, Cheryl Kieliszewski, Ana Lelescu, Bin He, Jie Cui, Jeffrey Kreulen, James Rhodes & W. Scott Spangler
CAPORASO, J. GREGORY, WILLIAM A. BAUMGARTNER, DAVID A. RANDOLPH, K. BRETONNEL COHEN & LAWRENCE HUNTER
Jankowski, Andrzej & Andrzej Skowron
Jo, Taeho & Malrey Lee
Netisopakul, Ponrudee & Norapan Siriumpunkul
Rasmussen, Steen, Diana Mangalagiu, Hans Ziock, Johan Bollen & Gordon Keating
Soni, Ankit, Nees Jan van Eck & Uzay Kaymak
Stasko, John, Carsten Gorg, Zhicheng Liu & Kanupriya Singhal
Voloshynovska, Iryna & Nadiya Andreychuk
von Brzeski, Vadim, Utku Irmak & Reiner Kraft
Xian-Jun Meng, Qing-Cai Chen, Xiao-Long Wang & Xiao-Hong Yang
Banville, Debra L.
Banville, Debra L.
Conrad, Jack G. & Cindy P. Schriber
Fosdick, Howard
Hunter, Lawrence & K. Bretonnel Cohen
Jackson, P. & F. Schilder
Mikeal, Adam, Cody Green, Alexey Maslov, Scott Phillips & John Leggett
Morioka, Nobuyuki & Ashesh Mahidadia
Radovanović, Miloš & Mirjana Ivanović
van Diggelen, Jurriaan, Robbert-Jan Beun, Frank Dignum, Rogier M. van Eijk & John-Jules Meyer
Wang, Xiaoting, Peng Zhu, Giovanni Felici & Evangelos Triantaphyllou
Zhang, Dell & Wee Sun Lee
Chaudiron, Stéphane
Dale, R., Li Lei, H. de Vries, M. Gardiner & M. Tilbrook
John Davies, Grobelnik, Marko & Dunja Mladenić
Natarajan, J., D. Berrar, C. J. Hack & W. Dubitzky
Natt och Dag, Johan & Vincenzo Gervasi
Paz-Trillo, Christian, Renata Wassermann & Paula P. Braga
Saric, F., J. Snajder, B.D. Basic & H. Eklic
Schulze‐Kremer, Steffen & Barry Smith
Wang, Wei, Diep Bich Do & Xuemin Lin
Dag, J.N., V. Gervasi, S. Brinkkemper & B. Regnell
Dale, Robert, Rafael Calvo & Marc Tilbrook
Hartley, James, Eric Sotto & Claire Fox
Xiangzhu Gao, San Murugesan & B. Lo
Dale, Robert, Cecile Paris & Marc Tilbrook
Jackson, Peter, Khalid Al-Kofahi, Alex Tyrrell & Arun Vachher
Mladenić, Dunja & Marko Grobelnik
Portscher, Edwin, James Geller & Richard Scherl
[no author supplied]
This list is based on CrossRef data as of 20 september 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
Subjects
Main BIC Subject
CF: Linguistics
Main BISAC Subject
LAN009000: LANGUAGE ARTS & DISCIPLINES / Linguistics / General