Statistical Significance Testing for Natural Language Processing

Statistical Significance Testing for Natural Language Processing
Author :
Publisher : Springer Nature
Total Pages : 98
Release :
ISBN-10 : 9783031021749
ISBN-13 : 3031021746
Rating : 4/5 (49 Downloads)

Book Synopsis Statistical Significance Testing for Natural Language Processing by : Rotem Dror

Download or read book Statistical Significance Testing for Natural Language Processing written by Rotem Dror and published by Springer Nature. This book was released on 2022-06-01 with total page 98 pages. Available in PDF, EPUB and Kindle. Book excerpt: Data-driven experimental analysis has become the main evaluation tool of Natural Language Processing (NLP) algorithms. In fact, in the last decade, it has become rare to see an NLP paper, particularly one that proposes a new algorithm, that does not include extensive experimental analysis, and the number of involved tasks, datasets, domains, and languages is constantly growing. This emphasis on empirical results highlights the role of statistical significance testing in NLP research: If we, as a community, rely on empirical evaluation to validate our hypotheses and reveal the correct language processing mechanisms, we better be sure that our results are not coincidental. The goal of this book is to discuss the main aspects of statistical significance testing in NLP. Our guiding assumption throughout the book is that the basic question NLP researchers and engineers deal with is whether or not one algorithm can be considered better than another one. This question drives the field forward as it allows the constant progress of developing better technology for language processing challenges. In practice, researchers and engineers would like to draw the right conclusion from a limited set of experiments, and this conclusion should hold for other experiments with datasets they do not have at their disposal or that they cannot perform due to limited time and resources. The book hence discusses the opportunities and challenges in using statistical significance testing in NLP, from the point of view of experimental comparison between two algorithms. We cover topics such as choosing an appropriate significance test for the major NLP tasks, dealing with the unique aspects of significance testing for non-convex deep neural networks, accounting for a large number of comparisons between two NLP algorithms in a statistically valid manner (multiple hypothesis testing), and, finally, the unique challenges yielded by the nature of the data and practices of the field.


Statistical Significance Testing for Natural Language Processing Related Books

Statistical Significance Testing for Natural Language Processing
Language: en
Pages: 98
Authors: Rotem Dror
Categories: Computers
Type: BOOK - Published: 2022-06-01 - Publisher: Springer Nature

DOWNLOAD EBOOK

Data-driven experimental analysis has become the main evaluation tool of Natural Language Processing (NLP) algorithms. In fact, in the last decade, it has becom
Foundations of Statistical Natural Language Processing
Language: en
Pages: 719
Authors: Christopher Manning
Categories: Language Arts & Disciplines
Type: BOOK - Published: 1999-05-28 - Publisher: MIT Press

DOWNLOAD EBOOK

Statistical approaches to processing natural language text have become dominant in recent years. This foundational text is the first comprehensive introduction
Validity, Reliability, and Significance
Language: en
Pages: 179
Authors: Stefan Riezler
Categories:
Type: BOOK - Published: - Publisher: Springer Nature

DOWNLOAD EBOOK

Biomedical Natural Language Processing
Language: en
Pages: 174
Authors: Kevin Bretonnel Cohen
Categories: Computers
Type: BOOK - Published: 2014-02-15 - Publisher: John Benjamins Publishing Company

DOWNLOAD EBOOK

Biomedical Natural Language Processing is a comprehensive tour through the classic and current work in the field. It discusses all subjects from both a rule-bas
Pretrained Transformers for Text Ranking
Language: en
Pages: 307
Authors: Jimmy Lin
Categories: Computers
Type: BOOK - Published: 2022-06-01 - Publisher: Springer Nature

DOWNLOAD EBOOK

The goal of text ranking is to generate an ordered list of texts retrieved from a corpus in response to a query. Although the most common formulation of text ra