mecㆍviewer v1.4 :: lecture6-nn.pdf

T&C LAB-AI

Robotics

Neural Network

Lecture 3

Jeong-Yean Yang

2020/10/22

T&C LAB-AI

Neural Network
Basic Questions Before Learning it

T&C LAB-AI

Robotics

We Learn Regression Model

Question: How we do regression for Next Data?

• It is NOT a linear and It is NOT a squared function
• It looks like a sine function but it is NOT.
• How we do? 

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.1

0.2

0.3

0.4

0.5

0.6

0.7

Multiple Lines?

T&C LAB-AI

Robotics

If we divide Three Ranges,

We use Regression

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.1

0.2

0.3

0.4

0.5

0.6

0.7

( 1)

( 2)

( 3)

(

) ||

(

) ||

(

) ||

N R

i R

a x



















Who Determines Three or More Region?

Three Regions are Correct?

???

T&C LAB-AI

Robotics

How can we do it?

T&C LAB-AI

Robotics

If we add all lines, is it good?

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.1

0.2

0.3

0.4

0.5

0.6

0.7

(

) ||

(

) ||

(

) ||

i R

a x



















T&C LAB-AI

Robotics

The Sum of Lines is Always a Line

• Given condition)

– Region,R =R1+R2+R3
– We cannot determine Proper Region, R1, R2, and R3.
– We must use R.

• Then, if we use ax+b for every Region, R,

• Thus, we need another Nonlinear function for

( 1)

( 2)

( 3)

(

) ||

(

) ||

(

) ||

N R

i R

a x



















i R

a x b









 

 





ˆy

T&C LAB-AI

Robotics

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.1

0.2

0.3

0.4

0.5

0.6

0.7

Some Function shapes like,

• Linear function cannot satisfy specific Region
• We Must use every Region  Function must not be Line

 Non linear function

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.1

0.2

0.3

0.4

0.5

0.6

0.7



T&C LAB-AI

Robotics

Some function shapes like

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.1

0.2

0.3

0.4

0.5

0.6

0.7



• Non linear functions are the candidate for Neural Network
• Even sine or cosine functions works.

T&C LAB-AI

Robotics

Some function shapes like

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.1

0.2

0.3

0.4

0.5

0.6

0.7



• Triangle curves also work
• It is also non linear  Remind linearity condition

– Triangle curve is NOT equal that av1+bv2 = v

T&C LAB-AI

Robotics

Remind Boundary Decision

with Linear Function requires Sign func.

• Sign function is also a nonlinear function Phi,

( )

(

)

sign ax b

 







T&C LAB-AI

Robotics

Which Nonlinear Functions are

the Best for Kernel?

• No answer, Definitely

-5

-4

-3

-2

-1

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

exp

RBF

Radial Basis Function













 

























-5

-4

-3

-2

-1

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

Sigmoidal

Function



 



-5

-4

-3

-2

-1

0.5

1.5

2.5

3.5

4.5

(Rectifier linear Unit)

ax x

LU Function





  





T&C LAB-AI

Robotics

Define Kernel Function

• These functions are called Kernel function, K

• All Input data are thought as the results of
• Is it good for every cases?

– 1. In most cases, the results are bad.

One kernel function cannot satisfy all possible cases

– 2. Thus, we use many kernel functions  Modern Learnings

K(X, Z)

(

( )

)

(Z)







( ) :

function is a dot product of map data

X Input vector

Nonlinear function

Kernel



( )



linear

T&C LAB-AI

Robotics

Kernel Function K or

• Definition of Kernel function is a dot product of

– But in many books kernel function K and nonlinear function

Pi are in mixed usages.

• Basic of Kernel trick

– K is a dot product of Pi’s, thus K is Scalar function

– Nonlinear function Pi, simplifies high dimensional input

into unknown feature space

– Through Non linear Pi, Kernel can simplifies and linearize a

given problem.



Scalar

   



T&C LAB-AI

Kernel Functions are thought as
Nonlinear Regression

T&C LAB-AI

Robotics

Kernel Function as Regression

• Use Some function like Gaussian function

• Gaussian function( Probabilistic Density Function)

• Radial-basis function

( )

exp

( , )

pdf x







 





































2

( )

exp

( )

RBF x





























T&C LAB-AI

Robotics

Regression with RBF 1

• Objective Function, J is,

ax b













( )

exp





























































 



( )

exp









 























 





































T&C LAB-AI

Robotics

Tip for operation like (x-b)^2

• X=linspace(s,t,number)

– Ex) x= linspace(0,5,6) =[ 0,1,2,3,4,5]

• Matrix multiplication has two types.

• Matlab has two operations

– Matrix = A*B Hadarmard Product = A.*B

• loop.sys also has two operations

– Matrix = A*B Hadarmard Product = A.mul(B)

A B





 







 







 





 







 





 



General Matrix

multiplication

Hadamard Product

T&C LAB-AI

Robotics

ex/ml/l6rbf1









( )

exp















































( )



 







 



 





exp

( )























 







































 



T&C LAB-AI

Robotics

Ex/ml/l6rbf1

Blue: y=0.5x+0.3

Red: RBF function

• c is the initial center of RBF.
• The result differs with various Initial guess of C.

– Guess C = 0, 0.5, 1, and so on.

ERRORS!

T&C LAB-AI

Robotics

Regression with RBF 2

• Objective Function, J is,

ax b













( )

exp



















 















































( )









 













 







T&C LAB-AI

Robotics

Ex/ml/l6rbf2

• Result Comparison

– Use guess, c= 0

• Weight, w works for the better estimation!!

l6rbf1

l6rbf2

T&C LAB-AI

Robotics

Regression with RBF 3

• Objective Function, J is,

ax b













( )

exp



















 

























































( )

(

)

w w









 









 













 







T&C LAB-AI

Robotics

Ex/ml/l6rbf3









( )

exp



















 

























































( )

w w





















 













 





T&C LAB-AI

Robotics

Comparison

between ex/ml/l6rbf2 and ex/ml/l6rbf3

• Result Comparison

– Use guess, c= 1

• Both weight, w and bias w0 work for the better

estimation.

l6rbf2

l6rbf3

T&C LAB-AI

Robotics

Definition of Weight and Bias Parameter

• Only Kernel function

• Use Weight, w

• Use Bias, w0



2

( )











2

( )















2

( )











T&C LAB-AI

Neural Network
Equation

T&C LAB-AI

Robotics

Weight

• Non linear function Phi, strengthens some value

in every input space

• Weight is a linear combination of Phi,



( )



( )



( )





(wX)



linear

nonliear

T&C LAB-AI

Robotics

Weight

• Linear weight
• Nonlinear weight

• Linear weight can be simplified as in Homogeneous

Transform

( )

 

  

(

)

 





( )











 

 



















(X )

(X)

(X )









  

   

 











  

   

 





  

   

 







Example

T&C LAB-AI

Robotics

Define Discrimination Function

• Input X
• Output Y:

– During learning, output of is NOT well learned
 Approximation of Y =

• Weight : our goal parameter

– like a and b in linear regression

• Cost function, J is a function of Error

i R









( )

 

( )

  





 





 

Minimization

Gradient Descent Method

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.1

0.2

0.3

0.4

0.5

0.6

0.7

Data(xi,yi)

T&C LAB-AI

Robotics

Neural Network Layer

• Input=X and Output= Y
• More Layers are used for creation of hyper space

0.2

0.1 1

0.6





 



  

 



 



  

 



 



 

  

 



 

  

 



  

 







Linear



(Z)



( )

(

)

W X









Z: Hidden Layer

6 3



3 6



T&C LAB-AI

Robotics

Hidden Layers maps

from Lower to Higher level

• Remind

– Face detection by

Haar feature

• Stage 0 maps

dominant feature

• Stage 21 maps

features in more
detailed ways

• Some stages(or

layers) seem
meaningless

 Output is meaningful

Raw image

Detailed Feature

T&C LAB-AI

Robotics

Build Space Shuttle

Your resources, Money, time, labor, and so on

math

cycle

motor

control

Program

turbine

Physics

…

Aviation Manufact-

uring

Fever

Space shuttle

Hidden or Middle layers can

be more than input or output

T&C LAB-AI

Robotics

Sigmoidal Function-based Neural Network

-5

-4

-3

-2

-1

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9



2

Sigmoidal

(

)

Function



 







 

  













   

(

)

W X

W Z







(

)

(

) 0

(

)

) X

(

) 0

(

) (

)

(

)

i D

J W W

J dw

y W

W X

y Z











 

 























 









 























 

 

 













T&C LAB-AI

Background of
Vector and matrix Differentiation

T&C LAB-AI

Robotics

Differentiation of Neural Network

• Remind Differentiation for Gradient Descent Method

(

) 0

(

)

) X

i D

y W

W X































 









 



Vector

Matrix

• Question:

“Differentiation Vector with Matrix”, Is it Possible?

It looks like

Matrix Multiplication

T&C LAB-AI

Robotics

Differentiation Scalar, Vector, Matrix

Scalar

Vector

Matrix

Scalar

Vector

Matrix




ˆy











ˆy

ˆx

• Unfortunately,

Differentiation Matrix by Matrix is Impossible

T&C LAB-AI

Robotics

Differentiation of Matrix 1

• Lemma 1

• Proof

m n n

A x













a x





(0 0 ..

.. 0 0)

a x







   

  







for all i=1...m and j=1... n













T&C LAB-AI

Robotics

Differentiation of Matrix 2

• Lemma 2

• Lemma 3

m n n

A x





y x

x z



 











m n n

A x





if c

y Ax

y y

x A y



y Ax

y A











y Ax









x A y

x A









T&C LAB-AI

Robotics

Differentiation of Matrix 3

• Lemma 4

• Proof

x Ax



x A











a x x

a x

















T&C LAB-AI

Robotics

Differentiation of Matrix 4

• Lemma 5

• Proof

( ),

( )

y z x

x z

y x

scalar

















c y

c x

y z

x z



 







 











y x

x y



T&C LAB-AI

Robotics

Differentiation of Matrix 5

• Lemma 6

• Lemma7

x x

scalar







y Ax



c y

c x

y z

x z

y A

x A

y A



 







 

















T&C LAB-AI

Robotics

Hadamard Product

• Matrix Multiplication ( what you have learned)

• Hadamard Product (Matlab  A.*B)

m a a n

m n

A B





11 11

12 12

21 21

22 22

...

m n

A B

a b





 





 





 







 





 





 





...

m n

mn mn

a b













 













T&C LAB-AI

Robotics

Neural Network Multiplication Problems

• You cannot differentiate NN directly

(

) 0

(

)

) X

i D

y W

W X





























 







i j

k k

A B

a b







i j

ij ij

A B

a b





Matrix Multiplication

Hadamard Multiplication

Neural Network

Multiplication

T&C LAB-AI

Neural Network Weight Update

T&C LAB-AI

Robotics

Weight Update by Gradient Descent Method

is the key for Neural Network

But, Differentiation is Complex

We learn Basic Structure first

Extend above Neural Network Structure

T&C LAB-AI

Robotics

n x h

(h+1)x 1



[

]





2 h



n h



(

[

]

 

(

1) 1

 

n 1





([

]

)

[

]

I W

 


Network Model by Matrix Expression I

Neural Network

( )

 

x y

Data

e 





T&C LAB-AI

Robotics

n x h

(h+1)x 1



[

]





2 h



n h



(

[

]

 

(

1) 1

 

n 1





Network Model by Matrix Expression I

( )

 

x y

Data

e 







Learn

T&C LAB-AI

Robotics

n x h

(h+1)x 1



[

]





2 h



n h



(

[

]

 

(

1) 1

 

n 1





Network Model by Matrix Expression I

( )

 



[x I]

w11

w12

w13

w14

w15

w21

w22

w23

w24

w25

w11
+w21

w12
+w22

w13
+w23

w14
+w24

w15
+w25

2w11
+w21

2w12
+w22

2w13
+w23

2w14
+w24

2w15
+w25

3w11
+w21

3w12
+w22

3w13
+w23

3w14
+w24

3w15
+w25

4w11
+w21

4w12
+w22

4w13
+w23

4w14
+w24

4w15
+w25

T&C LAB-AI

Robotics

n x h

(h+1)x 1



[

]





2 h



n h



(

[

]

 

(

1) 1

 

n 1





Network Model by Matrix Expression I

( )

 



w11
+w21

w12
+w22

w13
+w23

w14
+w24

w15
+w25

2w11
+w21

2w12
+w22

2w13
+w23

2w14
+w24

2w15
+w25

3w11
+w21

3w12
+w22

3w13
+w23

3w14
+w24

3w15
+w25

4w11
+w21

4w12
+w22

4w13
+w23

4w14
+w24

4w15
+w25













 













T&C LAB-AI

Robotics

n x h

(h+1)x 1



[

]





2 h



n h



(

[

]

 

(

1) 1

 

n 1





Network Model by Matrix Expression I

( )

 



w11
+w21

w12
+w22

w13
+w23

w14
+w24

w15
+w25

2w11
+w21

2w12
+w22

2w13
+w23

2w14
+w24

2w15
+w25

3w11
+w21

3w12
+w22

3w13
+w23

3w14
+w24

3w15
+w25

4w11
+w21

4w12
+w22

4w13
+w23

4w14
+w24

4w15
+w25



























w2,1

w2,2

w2,3

w2,4

w2,5

w2,6

T&C LAB-AI

Robotics

n x h

(h+1)x 1



[

]





2 h



n h



(

[

]

 

(

1) 1

 

n 1





Network Model by Matrix Expression I

( )

 



2,1

2,2

2,3

2,4

2,5

2,6

2,1

2,2

2,3

2,4

2,5

2,6

2,1

2,2

(

)

(

)

(

)

(

)

(

)

(







 



 



 



 







 



 



 



 







 



 

2,3

2,4

2,5

2,6

2,1

2,2

2,3

2,4

2,5

2,6

)



















 



 















 



 



 



 







T&C LAB-AI

Robotics

Differentiation with W2

e e















(

1) 1

[

]

[

]

[

]

[

]

I W

Transpose

Vector



 



 





 













 











Lemma

x x









T&C LAB-AI

Robotics

Differentiation with W1

2,1

2,2

2,3

2,4

2,5

2,6

2,1

2,2

2,3

(

)

(

)

(

)

(

)

(

)

(

)











 







 



 



 



 



  







 



 



 

 



2,4

2,5

2,6

2,1

2,2

2,3

2,4

2,5

2,6

2,1

2,2

2,3

2,4

)



 









 



 



 



 



 







 



 



 



 



2,5

2,6

)





T&C LAB-AI

Robotics

Differentiation with W1

2,1

(

)

(

)

'(2

'(3











 



















 





  



 



 



2,1

'(4



 



Can you find the PATTERN?

T&C LAB-AI

Robotics

Differentiation with W1

2, j

(

)

(

)

'(2

'(3











 



















 





  



 



 



2, j

'(4

(

)

(

)

'(2



 











 



















 





  



 



2, j

)

'(3

)

'(4

)



 



 



T&C LAB-AI

Robotics

Differentiation with W1

2, j

(

)

'(2

'(3

'(4

)

(

)

x w











 



  



 



 



 





 















2, j

)

'(2

)

'(3

)

'(4

)

x w







  



 



 



 





 





T&C LAB-AI

Robotics

Differentiation with W1

















[

)

k k

x e

x w

x e w

x eW

x w









 



  

 















 









 















 









 











Hadamard

multiplication

i j

k k

A B

a b







i j

ij ij

A B

a b





T&C LAB-AI

Robotics

Neural Network Example









[

]

[

]

e e

Matrix

Vector











 





 





n x h

(h+1)x 1



[

]





2 h



n h



(

[

]

 

(

1) 1

 

n 1





( )

 



T&C LAB-AI

Robotics

Python Example: l3sig

n x h

(h+1)x 1



[

]





2 h



n h



(

[

]

 

(

1) 1

 

n 1





( )

 



[

]





T&C LAB-AI

Robotics

Python Example: l3sig

Blue: y

Red: Y(est)

J during 2000 iterations

Jittering?

Why?

You can see

from l3sig.py

T&C LAB-AI

Robotics

If we increase Hidden Space?

Hidden Space Increases Estimation Performance

• h=20  h=200
• What happens?

Hidden layer increases the DOF of Estimation Results