Your browser doesn't support javascript.
loading
Good-Turing frequency estimation in a finite population.
Hwang, Wen-Han; Lin, Chih-Wei; Shen, Tsung-Jen.
Affiliation
  • Hwang WH; Institute of Statistics and Department of Applied Mathematics, National Chung Hsing University, Taichung 40227, Taiwan.
Biom J ; 57(2): 321-39, 2015 Mar.
Article in En | MEDLINE | ID: mdl-25394337
Good-Turing frequency estimation (Good, ) is a simple, effective method for predicting detection probabilities of objects of both observed and unobserved classes based on observed frequencies of classes in a sample. The method has been used widely in several disciplines, such as information retrieval, computational linguistics, text recognition, and ecological diversity estimation. Nevertheless, existing studies assume sampling with replacement or sampling from an infinite population, which might be inappropriate for many practical applications. In light of this limitation, this article presents a modification of the Good-Turing estimation method to account for finite population sampling. We provide three practical extensions of the modified method, and we examine performance of the modified method and its extensions in simulation experiments.
Subject(s)
Key words

Full text: 1 Database: MEDLINE Main subject: Statistics as Topic Language: En Journal: Biom J Year: 2015 Type: Article Affiliation country: Taiwan

Full text: 1 Database: MEDLINE Main subject: Statistics as Topic Language: En Journal: Biom J Year: 2015 Type: Article Affiliation country: Taiwan