Cousera机器学习基石第四周笔记-Machine-Learning-Foundation-Week-4-Note-in-Cousera

Posted on 2019-04-02 In 学习笔记 Word count in article: 412 Reading time ≈ 1 mins.

Feasibility of Learning

Learning is Impossible?

in sample→out sample

In big sample(N large),υ is probably close to u (within ϵ) ℙ[|v−u|>ϵ] ≤ 2exp(−2ϵ²N) called Hoeffding’s Inequality, for marbles,coin,polling

the statement v = u is probably approximately correct(PAC)

valid for all N and ϵ
does not depend on u,no need to knowu
larger sample size N or looser gap ϵ→higher probability for v = u

if large N,can probably infer unknown u by know v

for any fixed h, can probably infer unkown $E_out(h)=\underset{X\approx P}{\varepsilon}[h(x)\ne f(x)]$by known$ E_in(h)=^N_{n=1}[h(x)f(x)]$

if E_in(h) ≈ E_out(h) and$E_{in}(h)smallE_{out}(h)samllhf $with respect to P

if E_in(h) small for the fixed h and A pick the h as g→ g=f PAC

if A force to pick THE h as g → E_in(h) almost always not small → g ≠ f PAC

real learning:

A shall make choices$\in \H$ (like PLA) rather than being forced to pick one h.

BAD Sample:$E_{out}=\frac{1}{2}$,but getting all heads(E_in = 0)

BAD Data for One h:E_out(h) and E_inh far away

BAD data for many h ⇔ no freedom of choice by A ⇔ there exists some h such that E_out(h) and E_in(h) far away

if |ℍ|= M finite, N large enough,for whatever g picked by A,E_out(g) ≈ E_in(g)

if A finds one g with E_in(g) ≈ 0,PAC guarantee forE_out(g)⇒learning possible

M=∞? - see you in the next lectures~

这个作业题是真的难啊，花了一个半小时才堪堪通过，尤其是最后几个写PLA和pocket算法的