@mShuaiZhao 2018-01-31T12:56:06.000000Z 字数 4107 阅读 1027

Week03. Statistics with R Part01

Coursera 2018.01

Defining Probability

random process

In a random process we know what outcomes could happen, but we don't know which particular outcome will happen.
probability
- $P(A)$ = probability of event A
- There are several possible interpretations of probability but they (almost) completely agree on the mathematical rules probability must follow:
  
  $0 \le P(A) \le 1$
frequentist interpretation

The probability of an outcome is the proportion of times the outcome would occur if we observed the random process an infinite number of times.
bayesian interpretation
- A Bayesian interprets probability as a subjective degree of belief.
- Largely popularized by revolutionary advance in computational technology and methods during the last twenty years.
law of large numbers

die n.骰子
- 大数定律
  law of large numbers states that as more observations are collected, the proportion of occurrences with a particular outcome converges to the probability of that outcome.
- 赌徒谬论
  The common misunderstanding of the law of large numbers is that random processes are supposed to compensate for whatever happened in the past. This is called the gambler's fallacy, or the law of averages.

disjoint (mutually exclusive)

互斥事件
- disjoint (mutually exclusive) events cannot happen at the same time.
  - the outcome of a single coin toss cannot be a head and a tail.
  - a student con't both fail and pass a class.
  - a single card drawn from a deck cannot be an ace and a queen
- non-disjoint events can happen at the same time
  - a student can get an A in Stats and A in Econ in the same semester
union of disjoint events

互斥事件的组合

For disjoint events $A$ and $B$ ,

$P(A \ or \ B) = P(A) + P(B)$
union of non-disjoint events

For non-disjoint events $A$ and $B$ ,

$P(A \ or \ B) = P(A) + P(B) - P(A \ and \ B)$
general addition rule
sample space

a sample space is a collection of all possible outcomes of a trial
probability distributions
概率分布
- a probability distribution lists all possible outcomes in the sample space, and the probabilities with which they occur.
- rules
  1. the events listed must be disjoint
  2. each probability must be between 0 and 1
  3. the probabilities must total 1
complementary events
互补事件

complementary events are two mutually exclusive events whose probabilities add up to 1.
disjoint vs. complementary

independence

two processes are independent if knowing the outcome of one provides no useful information about the outcome of the other.
checking for independence:

$P(A | B) = P(A)$ , then $A$ and $B$ are independent.
practice
determining dependence based on sample data
Product rule for independent events:

If $A$ and $B$ are independent, $P(A and B) = P(A) \times P(B)$

见课件pdf吧.

image_1c50ptg5thohscrhh71rg2akm3b.png-336.6kB

一枚硬币，抛一次，正面朝上和反面朝上是互斥但相关的事件；
抛两次，两次的结果之间是不相关的。

条件概率

study
marginal

边缘概率，因为在contingency table的边缘。
joint

联合概率，关键词是and
conditional

条件概率
- 利用贝叶斯定理(Bayes' theorem)来计算
Product rule for independent events:

If $A$ and $B$ are independent, $P(A \ and \ B) = P(A) \times P(B)$
independence and conditional probabilities

Generically, if $P(A|B) = P(A)$ then the events A and B are said to be independent.
- Conceptually: Giving B doesn’t tell us anything about A.
- Mathematically: If events A and B are independent, $P(A \ and \ B) = P(A) × P(B)$ . Then,
  
  $\begin{align*} P(A | B) =\frac{P(A \ and \ B)}{P(B)}=\frac{P(A) \times P(B)}{P(B)} = P(A) \end{align*}$

image_1c51968vhc8nveqsr49qgva1g.png-830.5kB

image_1c51971cpuqn1qg01t2rpv8m7a1t.png-997.7kB