Solution :-
a) Size of the filter is 7-by-7-by-X. Number X is the number of convolutional layers we would like to apply in the first stage. Suppose we take X as 30 filters. So we are going to convolve 30 number of 7x7 filters on the given input. X = 30
b) Size of the feature map can be given by

where W = input size and F = filter size and P = zero padding size and S = Stride.
Here W = 32 , F = 7 and P = 0 , S = 1.
So size of one feature map is 26x26
c) If we have 32 filters, then we will have 32 feature maps after applying first layer.
d) We can apply Rectified Linear Unit (ReLU) as a non linear stage.

e) Another convolutional layer's filter size can be 3-by-3-by-15.
f) We will use 2x2 pooling with stride = 2... So matrix will look like after applying pooing as..
(Here we apply max-pooling operation )
| 5 | 4 | 10 |
| 7 | 4 | 10 |
By applying pooling we will have reduced feature size without loosing the important information. Reduced feature size will help us to have further Network with less required computation and feasible to train in certain time.
Let's design a convolutional neural network together. Suppose the size of the input image is 32-by-32-by-1...
9. (10 pts) Consider a convolutional neural network whose inputs are an RGB color image of size 39 x 39 (i.e., the input volume size is 39 x 39 x 3). The network has three convolutional layers and the parameters of each convolutional layer are given in Figure 1. Compute the output volume size of the feature maps at each convolutional layer. Conv1 Conv2 Conv3 Filter size = 3 X 3 # filters = 10 Stride = 1 Padding =...
For a 2-D convolutional neural network in the following figure softmax Fully connected 4 Fully connected 128 Max pooling 2x2, stride 2x2 Conv kernal 4x4, stride 2x2, filter 32, valid padding Conv kernal 8x8, stride 4x4, filter 16, valid padding Input 84x84x1 (a) How many weights and biases are there in this neural network? Please specify the number of weights and biases for each layer respectively. (b) Please describe the shapes of the outputs of each layer. (e.g. for input...
A Convolutional Neural Network (CNN) has 3 consecutive 5×5 convolutional layers with a stride of 1 without pooling. What is the size of the receptive field for a neuron (the set of input image pixels which activate the neuron) in the 3rd convolutional layer? A. 5. B. 9. C. 13. D. 125.
from PIL import Image import random # NOTE: Feel free to add in any constant values you find useful to use BLACK = (0, 0, 0) WHITE = (255, 255, 255) # NOTE: Feel free to add in any helper functions to organize your code but # do NOT rename any existing functions (or else, autograder # won't be able to find them!!) # NOTE: The following function is already completed for you as an example # You can use...