Generative vs. Discriminative Deep Belief Netwok for ...

1 downloads 0 Views 2MB Size Report
Mar 1, 2017 - GDBN model. 1. The probabilistic generative model;. Presenter: Haris Ahmad Khan — Mar 01th. , 2017. Generative vs. Discriminative DBN.
Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

VISAPP 2017

Generative vs. Discriminative Deep Belief Netwok for 3D Object Categorization Nabila Zrira*, Mohamed Hannat*, El Houssine Bouyakhf* and Haris Ahmad Khan** *LIMIARF, Mohammed V University, Faculty of Sciences Rabat, Morocco **NTNU, Norwegian University of Science and Technology, Gjøvik, Norway

01 March 2017 Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

1/29

Outline

1

Introduction

2

Method Overview

3

3D Object Categorization

4

Experimental Results

5

Conclusion and Future Work

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Context 3D Object categorization

RGB-Depth images

Kinect Camera

Point Cloud

PCL descriptors

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Neural Network

Generative vs. Discriminative DBN

3/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Problematic

How to ensure a good 3D object categorization approach in real-world object environment?

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

4/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Contribution

A new 3D object categorization pipeline based on VFH and GDBN/DDBN;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

5/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Contribution

A new 3D object categorization pipeline based on VFH and GDBN/DDBN; The comparison of GDBN and DDBN architectures;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

5/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Contribution

A new 3D object categorization pipeline based on VFH and GDBN/DDBN; The comparison of GDBN and DDBN architectures; The use of CD, PCD and FEPCD sampling.

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

5/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Our proposed approach

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

6/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

VFH descriptor

Global descriptor

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

7/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

VFH descriptor

Global descriptor Computed for a whole cluster of object

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

7/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

VFH descriptor

Global descriptor Computed for a whole cluster of object Consists of two components: 1 2

a surface shape component; a viewpoint direction component.

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

7/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

VFH descriptor

Global descriptor Computed for a whole cluster of object Consists of two components: 1 2

a surface shape component; a viewpoint direction component.

VFH size is equal to 308 bins

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

7/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

VFH descriptor: example

(a)

(b)

Figure : (a) 3D point cloud of food box. (b) VFH descriptor of food box point cloud.

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

8/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Restricted Boltzmann Machines (RBMs) RBM model 1 First layer: contains visible units (x); 2

Second layer: contains hidden units (h);

Figure : Generative RBM model.

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

9/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Restricted Boltzmann Machines (RBMs)

The energy function of an RBM is defined as: 0

0

0

E(x, h) = −b x − c h − h W x

(1)

where: W represents the symmetric interaction term between x and h; b and c are vectors that store the visible (input) and hidden biases (respectively).

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

10/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Generative Deep Belief Network (GDBN)

GDBN model 1 The probabilistic generative model;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

11/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Generative Deep Belief Network (GDBN)

GDBN model 1 The probabilistic generative model; 2

Training sequence of RBMs;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

11/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Generative Deep Belief Network (GDBN)

GDBN model 1 The probabilistic generative model; 2

Training sequence of RBMs;

3

DBN with l hidden layers trains l RBMs;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

11/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Generative Deep Belief Network (GDBN)

GDBN model 1 The probabilistic generative model; 2

Training sequence of RBMs;

3

DBN with l hidden layers trains l RBMs;

4

Layer-wise algorithm;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

11/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Generative Deep Belief Network (GDBN)

GDBN model 1 The probabilistic generative model; 2

Training sequence of RBMs;

3

DBN with l hidden layers trains l RBMs;

4

Layer-wise algorithm;

5

”Fine-tuning” the resulting weights of all layers together.

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

11/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Generative Deep Belief Network (GDBN)

Figure : Generative DBN architecture (GDBN).

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

12/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Discriminative Restricted Boltzmann Machines (DRBMs)

DRBM model DRBM trains a density model by means of a particular RBM consisting of two sets of visible elements (inputs x = (x1 , ..., xd ) and the target y).

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

13/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Discriminative Restricted Boltzmann Machines (DRBMs) 0

0

0

0

0

E(y, x, h) = −h W x − b x − c h − d ~y − h U ~y

(2)

with Θ = (W, b, c, d, U ) is the set of parameters and ~y = (1y=i )ci=1 for C classes.

Figure : Discriminative RBM model (DRBM). Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

14/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Discriminative Deep Belief Network (DDBN)

DDBN model 1 DRBM in the last layer is used as a classifier;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

15/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Discriminative Deep Belief Network (DDBN)

DDBN model 1 DRBM in the last layer is used as a classifier; 2

DDBN trains a joint density model through DRBM;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

15/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Discriminative Deep Belief Network (DDBN)

DDBN model 1 DRBM in the last layer is used as a classifier; 2

DDBN trains a joint density model through DRBM;

3

The label which contains the least energy is selected;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

15/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Discriminative Deep Belief Network (DDBN)

DDBN model 1 DRBM in the last layer is used as a classifier; 2

DDBN trains a joint density model through DRBM;

3

The label which contains the least energy is selected;

4

The back-propagation technique for optimal classification.

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

15/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Discriminative Deep Belief Network (DDBN)

Figure : Discriminative DBN architecture (DDBN).

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

16/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Training in GRBM/DRBM: CD Contrastive Divergence (CD) - Start with a training vector on the visible units;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

17/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Training in GRBM/DRBM: CD Contrastive Divergence (CD) - Start with a training vector on the visible units; - Update all the hidden units;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

17/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Training in GRBM/DRBM: CD Contrastive Divergence (CD) - Start with a training vector on the visible units; - Update all the hidden units; - Update all the visible units;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

17/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Training in GRBM/DRBM: CD Contrastive Divergence (CD) -

Start with a training vector on the visible units; Update all the hidden units; Update all the visible units; Update the hidden units again.

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

17/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Training in GRBM/DRBM: PCD

Persistent Contrastive Divergence (PCD) - PCD method is proposed so that only the last chain state is used in the preceding update step.

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

18/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Training in GRBM/DRBM: FEPCD

Free Energy in Persistent Contrastive Divergence (FEPCD) - A standard for the goodness of chain;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

19/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Training in GRBM/DRBM: FEPCD

Free Energy in Persistent Contrastive Divergence (FEPCD) - A standard for the goodness of chain; - Free energy is used as a measure to acquire best samples;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

19/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Training in GRBM/DRBM: FEPCD

Free Energy in Persistent Contrastive Divergence (FEPCD) - A standard for the goodness of chain; - Free energy is used as a measure to acquire best samples; - Precisely calculate the gradient of log probability from training data.

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

19/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Washington RGBD Dataset

A large 3D dataset; A collection of 300 common household objects; Objects are organized into 51 categories; All object views using Kinect camera; 640x480 RGB and depth images at 30 Hz.

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

20/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Washington RGBD Dataset

Figure : A sample of selected point clouds from Washington RGBD dataset. Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

21/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

GDBN/DDBN characteristics

Characteristics Hidden layers Hidden layer units Learn rates Epochs Input layer units

Value 3 300-300-1500 0.3 200 size of VFH descriptor (308)

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

22/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Mean squared normalized error

n

1X ˆ M SE = (Yi − Yi )2 n

(3)

i=1

With: Yˆi is a vector of n predictions Yi is the vector of observed values.

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

23/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Mean squared normalized error Best Training Performance is 0.013614 at epoch 200

Best Training Performance is 0.10789 at epoch 200

Best Training Performance is 0.013379 at epoch 200

0

10

Train Best

Mean Squared Error (mse)

Mean Squared Error (mse)

Train Best

Mean Squared Error (mse)

0

10

0

10

−1

10

Train Best

−1

10

−1

10

−2

−2

10 0

50

100

150

200

10 0

50

200 Epochs

(CD-GDBN)

150

200

0

−2

Best Training Performance is 0.0052535 at epoch 200 0

100

150

200

−2

10

−1

10

−2

10

−3

10 0

50

200 Epochs

(CD-DDBN)

Train Best

Mean Squared Error (mse)

−1

10

10 50

200

10

−3

10

150

Train Best

Mean Squared Error (mse)

−1

10

100

(FEPCD-GDBN)

Best Training Performance is 0.0095101 at epoch 200 0

10 Train Best

0

50

200 Epochs

(PCD-GDBN)

Best Training Performance is 0.065007 at epoch 200 0

10

Mean Squared Error (mse)

100

200 Epochs

100

150

200 Epochs

(PCD-DDBN)

Presenter: Haris Ahmad Khan — Mar 01th , 2017

200

0

50

100

150

200

200 Epochs

(FEPCD-DDBN)

Generative vs. Discriminative DBN

24/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Classification performance Error GDBN-CD GDBN-PCD GDBN-FEPCD

DDBN-CD DDBN-PCD DDBN-FEPCD

0.6549 0.0250 0.0206

Acc. 34.51% 97.5% 97.9%

Before

After

Acc.

0.3810 0.4491 0.4053

0.3155 0.0201 0.0111

68.45% 97.98% 98.89%

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

25/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Comparison to other methods Methods CNN [1] RGBD dataset [2] Depth Kernel [3] DDBN-CD DDBN-PCD DDBN-FEPCD

Accuracy rates 89.4% 64.7% 78.8% 68.45% 97.98% 98.89%

M. Schwarz, H. Schulz, and S. Behnke, “Rgb-d object recognition and pose estimation based on pre-trained convolutional neural network features,” in Robotics and Automation (ICRA), 2015 IEEE International Conference on, pp. 1329–1335, IEEE, 2015. K. Lai, L. Bo, X. Ren, and D. Fox, “A large-scale hierarchical multi-view rgb-d object dataset,” in Robotics and Automation (ICRA), 2011 IEEE International Conference on, pp. 1817–1824, IEEE, 2011. L. Bo, X. Ren, and D. Fox, “Depth kernel descriptors for object recognition,” in Intelligent Robots and Systems (IROS), 2011 IEEE/RSJ International Conference on, pp. 821–826, IEEE, 2011. Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

26/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Conclusion

3D object categorization using geometric features extracted from VFH descriptor and learned with GDBN/DDBN;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

27/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Conclusion

3D object categorization using geometric features extracted from VFH descriptor and learned with GDBN/DDBN; GDBN is the probabilistic model with many RBMs which are trained sequentially;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

27/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Conclusion

3D object categorization using geometric features extracted from VFH descriptor and learned with GDBN/DDBN; GDBN is the probabilistic model with many RBMs which are trained sequentially; DDBN is constructed from DRBM which is based on RBM and the joint distribution model;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

27/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Conclusion

3D object categorization using geometric features extracted from VFH descriptor and learned with GDBN/DDBN; GDBN is the probabilistic model with many RBMs which are trained sequentially; DDBN is constructed from DRBM which is based on RBM and the joint distribution model; The use of PCD is better than CD, and FEPCD outperforms PCD;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

27/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Conclusion

3D object categorization using geometric features extracted from VFH descriptor and learned with GDBN/DDBN; GDBN is the probabilistic model with many RBMs which are trained sequentially; DDBN is constructed from DRBM which is based on RBM and the joint distribution model; The use of PCD is better than CD, and FEPCD outperforms PCD; The experimental results using DDBN are encouraging.

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

27/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Future work

We will utilize a hybrid deep architecture that combines the advantage of generative and discriminative models;

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

28/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Future work

We will utilize a hybrid deep architecture that combines the advantage of generative and discriminative models; We will attempt to embed our algorithm in TurtleBot2.

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

28/29

Introduction Method Overview 3D Object Categorization Experimental Results Conclusion and Future Work

Thank you for your attention! Any questions

Presenter: Haris Ahmad Khan — Mar 01th , 2017

Generative vs. Discriminative DBN

29/29