@shaobaobaoer 2020-04-11T10:37:44.000000Z 字数 2708 阅读 967

A brief Note about the 4th chapter of 《NN & DL》

未分类

ref EXPLAINING AND HARNESSING ADVERSARIAL EXAMPLES
http://neuralnetworksanddeeplearning.com/chap4.html

I found some hardship when writing the diplomat essay. I have learn a lot about the methods of A.Es but still find it hard to explain the reason why A.E can make it.

Otherwise, I find a lack of acknowledge when reading the eassy about FGSM. I find that the writer of that eassy is a very prestigious person named "Goodfellow" , the father of GAN.

A visual proof that neural nets can compute any function

One of the most striking facts about neural networks is that they can compute any function at all. That is, suppose someone hands you some complicated, wiggly function, $F(x)$

下载 (2).png-10.8kB

$f(x)=0.2+0.4x2+0.3xsin(15x)+0.05cos(50x)$

May be that func can be much more complex , or may have muti-inputs & outputs. We can conclude them in a matrix and explain it in a higher space. so the $f(x1,x2...)$ can be showed as follows:

图片标题

To sum up , this result tells us that neural networks have a kind of universality.

The universality theorem is well known by people who use neural networks. But why it's true is not so widely understood.

2 caveats(注意点)

when u are using the statement " a NN can compute any func" . U should pay attention to 2 caveats

Firstly, it doesn't mean NN can exactly compute any func, but it means NN can approximation compute them. And how can NN make it ? Just increasing the num of hidden neurons.

Secondly, the approximated funcs should be described as continuous func. if an func is discontinuous, for e.g. abs function. I won't in general be possible to approximate by NN. However when we are meeting these funcs , NN also can use an continuous func to approximation it well.

one input; one output;

so here comes to a very simple NN , its hidden neuron is sigmoid funciton and it has one layer

下载 (5).png-19.3kB

if we change the weight , the curve will become much shaper ,while we change the bias the loc of inflection point will change.

Let's see what will happen when the weight equals to 999 and bias equals to -400,

high_weight_function.jpg-18.2kB

Amazing this curve is much more like and discontinue func but it just consist by a continuous func $\sigma(999 \times x + -400)$

Another detail of this func is the parametier $s=-b/w$ , mathetically , it explains where the infleciton point is.

here we can change the curve a bit more complex, so when we consider 2 groups of parameter what will happen ?

下载 (6).png-23kB

We can find that the curve can have other inflection points becasue the $f(x)$ consists of 2 parts , $w1a1+w2a2$

However we aren't satisfied with this situation ,because the curve is monotonic increasing. If we want a part of decreasing curve , what we can do is make weight become negative
此处输入图片的描述

so as we add more hidden layer the curve will have more inflection points and the curve will become closer and closer to the given func $f(x)$

TIM截图20200411181532.png-99.9kB

Muti input situation

so if we have muti input(e.g. 2) what will happen to our graph ?

下载 (7).png-22.5kB

If we could build such tower functions, then we could use them to approximate arbitrary functions, just by adding up many towers of different heights, and in different locations

下载 (8).png-21.2kB

下载 (9).png-32.8kB

Why adversarail Sample Exsits?

Szegedy:2013

神经网络的高度非线性性质导致对抗样例的存在。以及纯粹的监督学习模型中不充分的模型平均和不充分的正则化所导致的过拟合。

Explain: X

Goodfellow:2015

高维空间中线性性质才是导致对抗样例存在的真正原因

Explain: FGSM