Natural Language Processing for Online Applications
Text retrieval, extraction and categorization
Second revised edition
This text covers the technologies of document retrieval, information extraction, and text categorization in a way which highlights commonalities in terms of both general principles and practical concerns. It assumes some mathematical background on the part of the reader, but the chapters typically begin with a non-mathematical account of the key issues. Current research topics are covered only to the extent that they are informing current applications; detailed coverage of longer term research and more theoretical treatments should be sought elsewhere. There are many pointers at the ends of the chapters that the reader can follow to explore the literature. However, the book does maintain a strong emphasis on evaluation in every chapter both in terms of methodology and the results of controlled experimentation.
This title replaces:
Natural Language Processing for Online Applications: Text retrieval, extraction and categorization, Peter Jackson and Isabelle Moulinier (2002)
Natural Language Processing for Online Applications: Text retrieval, extraction and categorization, Peter Jackson and Isabelle Moulinier (2002)
[Natural Language Processing, 5] 2007. x, 232 pp.
Publishing status: Available
Published online on 1 July 2008
Published online on 1 July 2008
© John Benjamins Publishing Company
Table of Contents
-
Preface to the 2nd edition | p. ix
-
Chapter 1. Natural language processing | p. 1
-
1.1 What is NLP?
-
1.2 NLP and linguistics
-
1.3 Linguistic tools
-
1.4 Plan of the book
-
Chapter 2. Document retrieval | p. 23
-
2.1 Information retrieval
-
2.2 Indexing technology
-
2.3 Query processing
-
2.4 Evaluating search engines
-
2.5 Attempts to enhance search performance
-
2.6 The future ofWeb searching
-
Chapter 3. Information extraction | p. 69
-
3.1 The message understanding conferences
-
3.2 Regular expressions
-
3.3 Finite automata in FASTUS
-
3.4 Context-free grammars
-
3.5 Limitations of current technology and future research
-
3.6 Summary of information extraction
-
Chapter 4. Text categorization | p. 113
-
4.1 Overview of categorization tasks
-
4.2 Handcrafted rule based methods
-
4.3 Inductive learning for text classification
-
4.4 Nearest neighbor algorithms
-
4.5 Combining classifiers
-
4.6 Evaluation of text categorization systems
-
Chapter 5. Text mining | p. 163
-
5.1 What is text mining?
-
5.2 Resolving reference and coreference
-
5.3 Automatic summarization
-
5.4 Testing of automatic summarization programs
-
5.5 Prospects for text mining and NLP
-
-
Index | p. 227
Cited by (74)
Cited by 74 other publications
Campos, Diego G., Tim Fütterer, Thomas Gfrörer, Rosa Lavelle-Hill, Kou Murayama, Lars König, Martin Hecht, Steffen Zitzmann & Ronny Scherer
Inupakutika, Devasena, David Akopian, Ganesh Reddy, Patricia Chalela, Sahak Kaghyan & Rahul Mundlamuri
Tandon, Archana, Bireshwar Dass Mazumdar & Manoj Kumar Pal
Jain, Ashish, Sakthivel Durairaj, Anwesh Reddy Paduri, Praveen Krishnan, Pramod Chalaiah, Jaideep Chanda & Narayana Darapaneni
Melhem, Mohammed K. Bani, Laith Abualigah, Raed Abu Zitar, Abdelazim G. Hussien & Diego Oliva
Romanov, Dmitry, Valentin Molokanov, Nikolai Kazantsev & Ashish Kumar Jha
Vollero, Agostino, Domenico Sardanelli & Alfonso Siano
Nundloll, Vatsala, Robert Smail, Carly Stevens & Gordon Blair
Tikhonova, Olga, Aleksandr Khrulkov, Aleksandr Antonov, Stanislav L. Sobolevsky & Sergey A. Mityagin
Yeshambel, Tilahun, Josiane Mothe & Yaregal Assabie
Ansari, Md Tarique Jamal & Naseem Ahmad Khan
Csányi, Gergely & Tamás Orosz
Daniel, Gwendal & Jordi Cabot
Itahriouan, Zakaria, Nisserine El Bahri, Samir Brahim Belhaouari, Hajji Tarik & Mohamed Ouazzani Jamil
Sánchez-Cervantes, José Luis, Giner Alor-Hernández, Mario Andrés Paredes-Valverde, Lisbeth Rodríguez-Mazahua & Rafael Valencia-García
Baraibar-Diez, Elisa, Manuel Luna, María D. Odriozola & Ignacio Llorente
Chantar, Hamouda, Majdi Mafarja, Hamad Alsawalqah, Ali Asghar Heidari, Ibrahim Aljarah & Hossam Faris
Lunn, Stephanie, Jia Zhu & Monique Ross
Pérez-Soler, Sara, Gwendal Daniel, Jordi Cabot, Esther Guerra & Juan de Lara
Soni, Mukesh, S. Gomathi & Yagna Bhupendra Kumar Adhyaru
Talukder, Md Ashraful Islam, Sheikh Abujar, Abu Kaisar Mohammad Masum, Sharmin Akter & Syed Akhter Hossain
Vollero, Agostino, Alfonso Siano & Domenico Sardanelli
Aboalnaser, Sara A.
Daniel, Gwendal, Jordi Cabot, Laurent Deruelle & Mustapha Derras
Daniel, Gwendal, Jordi Cabot, Laurent Deruelle & Mustapha Derras
Lai, Kaitao, Natalie Twine, Aidan O’Brien, Yi Guo & Denis Bauer
Cahill, Maria, Soohyung Joo & Kathleen Campana
Cahill, Maria, Soohyung Joo & Kathleen Campana
Kejriwal, Mayank, Daniel Gilley, Pedro Szekely & Jill Crisman
Zhao, Qianqian, Kai Chen, Tongxin Li, Yi Yang & XiaoFeng Wang
Krallinger, Martin, Obdulia Rabal, Anália Lourenço, Julen Oyarzabal & Alfonso Valencia
Shin, Teo Yon, Yuan Zihong, Ng Wee Siong, Zhang Yangfan & Valerie Phangt
Sulieman, Lina, David Gilmore, Christi French, Robert M. Cronin, Gretchen Purcell Jackson, Matthew Russell & Daniel Fabbri
Tomašev, Nenad
Zhang, Lishan & Kurt VanLehn
Chukharev-Hudilainen, Evgeny & Aysel Saricaoglu
Farrell, Treasa & Nick Rushby
Takemiya, Makoto, Kei Majima, Mitsuaki Tsukamoto & Yukiyasu Kamitani
Carchiolo, Vincenza, Alessandro Longheu & Michele Malgeri
Gibert, Marcin
Kusumadewi, Sri, Chanifah Indah Ratnasari & Linda Rosita
Rebelo, Francisco, Carlos Soares & Rosaldo J. F. Rossetti
Stanković, Ranka, Cvetana Krstev, Ivan Obradović & Olivera Kitanović
Stanković, Ranka, Cvetana Krstev, Ivan Obradović & Olivera Kitanović
Huijnen, Pim, Fons Laan, Maarten de Rijke & Toine Pieters
More, Joaquim, David Baneres, Jordi Conesa & Montse Junyent
Thessen, Anne E., Cynthia Sims Parr & Luis M. Rocha
Anzalone, Salvatore Maria, Y. Yoshikawa, Hiroshi Ishiguro, Emanuele Menegatti, Enrico Pagello & Rosario Sorbello
Banchs, Rafael E. & Carlos G. Rodríguez Penagos
Banchs, Rafael E. & Carlos G. Rodríguez Penagos
Bobicev, Victoria, Marina Sokolova, Khaled El Emam, Yasser Jafer, Brian Dewar, Elizabeth Jonker & Stan Matwin
Cheng, Li & Alei Liang
Yoon, Sunmoo, Noémie Elhadad & Suzanne Bakken
Anzalone, Salvatore M., Yuichiro Yoshikawa, Hiroshi Ishiguro, Emanuele Menegatti, Enrico Pagello & Rosario Sorbello
Carvalho, Joao P., Fernando Batista & Luisa Coheur
Liszka, Kathy J., Chien-Chung Chan & Chandra Shekar
Liszka, Kathy J., Chien-Chung Chan & Chandra Shekar
Blackburn, Timothy D., Thomas A. Mazzuchi & Shahram Sarkani
Gardoň, Andrej & Aleš Horák
Kang, Jingjing, Tao Liu, He Hu & Xiaoyong Du
Kannan, Rajkumar, Maria Bielikova, Frederic Andres & S. R. Balasundaram
Küçük, Dilek & Adnan Yazıcı
O’Shea, James, Zuhair Bandar & Keeley Crockett
Bonino, Dario, Alberto Ciaramella & Fulvio Corno
Ashley, Kevin D. & Stefanie Brüninghaus
Canan Pembe, F. & Tunga Güngör
Geist, Anton
Oleshchuk, Vladimir & Vitaly Klyuev
Cohen, K. Bretonnel & Lawrence Hunter
Kucuk, Dilek & Adnan Yazici
Seki, Kazuhiro & Javed Mostafa
This list is based on CrossRef data as of 25 october 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
Subjects
Main BIC Subject
UYQL: Natural language & machine translation
Main BISAC Subject
COM042000: COMPUTERS / Natural Language Processing