Jump to content

Strong and weak sampling

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by OAbot (talk | contribs) at 19:27, 30 December 2020 (Open access bot: doi added to citation with #oabot.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Strong and weak sampling are two sampling approach[1] in Statistics, and are popular in computational cognitive science and language learning[2]. In strong sampling, it is assumed that the data are intentionally generated as positive examples of a concept[3], while in weak sampling, it is assumed that the data are generated without any restrictions.[4]

Formal Definition

In strong sampling, we assume observation is randomly sampled from the true hypothesis:

In weak sampling, we assume observations randomly sampled and then classified:

Consequence: Posterior computation under Weak Sampling

Therefore the likelihood will be "ignored".

References

  1. ^ Xu, Fei. "Bayesian word learning Sensitivity to sampling in Bayesian word learning" (PDF). MIT. Developmental Science.
  2. ^ Hsu, Anne. "Sampling assumptions in language learning 1 Running head: SAMPLING ASSUMPTIONS IN LANGUAGE LEARNING Sampling assumptions affect use of indirect negative evidence in language learning". ResearchGate.
  3. ^ Navarro, Danielle. "Lecture 20: Strong vs weak sampling" (PDF). Computational Cognitive Science.
  4. ^ Navarro, Daniel (2012). "Sampling assumptions in inductive generalization". Cognitive Science. 36 (2): 187–223. doi:10.1111/j.1551-6709.2011.01212.x. PMID 22141440.