Question

Given a figure visualising the decision boundaries learned by a SVM, explain which type of kernel function is used. Describe the hyper-parameter(s) of that kernel function and explain the meaning of those hyper-parameter(s).

OD 0990 Training dataset Decision boundaries learned by SVM

0 0
Add a comment Improve this question Transcribed image text
Answer #1

The Support Vector Machine is a supervised learning algorithm mostly used for classification purposes but can also be used for regression purposes. The main idea is that the algorithm tries to find the optimal hyperplane that can be used to classify new data points, based on the labeled data (training data) the hyperplane is a simple line, in two dimensions.

Usually the learning algorithm tries to learn the most common characteristics (which distinguishes one class from another) of the class and the classification is based on the representative characteristics learned (so classification is based on differences between classes). The SVM is working the other way around. The most similar examples are found between classes. Those are the support vectors.

It is important to select the right kernel, because if the transformation is wrong, then the model can have very bad results. As a thumb rule, always check if you have linear data, and use linear SVM (linear kernel) in that case. Linear SVM is a parametric model but it's not an RBF kernel SVM, so the latter's complexity grows with the size of the training set. Not only is it more expensive to train an RBF kernel SVM, you also have to keep the kernel matrix around, and the projection into this higher-dimensional "infinite" space where the data becomes linearly separable is also more expensive during prediction. You also have more hyperparameters to tune, so the selection of models is more expensive too. And lastly, it's easier to overfit a complex model.

A hyper parameter in machine learning is a parameter which is used to control the learning process by its value. By contrast, other parameter values (usually node weights) are derived via training. Hyper parameters can be classified as model hyperparameters that can not be deduced when fitting the machine to the training set because they refer to the selection task of the model or the hyper parameters algorithm That in principle has no influence on model performance but has an impact on the speed and quality of the learning process. The topology and size of a neural network is an example of a hyper parameter model. Examples of hyper parameters for algorithms are learning rate and mini-batch size.

Add a comment
Know the answer?
Add Answer to:
Given a figure visualising the decision boundaries learned by a SVM, explain which type of kernel...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • 1 (10 points) Multiple Choice 1. Which of the following statement(s) is/are true regarding the SVM...

    1 (10 points) Multiple Choice 1. Which of the following statement(s) is/are true regarding the SVM classifier? A. The margin definition in the SVM formulation can be considered as a r larization term to prevent overfitting B. Any function can be used a kernel function. C. Using a valid kernel, an SVM classifier can be trained without knowing the feature values for each sample D. The so-called "support vectors" refer to the positive and negative planes egu- 2. Which of...

  • Supply and Demand v Given what you have learned about 1) demand (and its determinants) and...

    Supply and Demand v Given what you have learned about 1) demand (and its determinants) and quantity demanded and 2) supply (and its determinants) and quantity supplied work with your group members to analyze this hypothetical situation: There is a shortage of eggs. Consumers are substituting what they perceive to be "healthy" sandwich spreads for traditional mayonnaise. As a result, the makers of mayonnaise are introducing new sandwich spreads into the market The price of traditional mayonnaise is falling. Why?...

  • hi all three questions are multiple choice can you please help wit answer thank you Which...

    hi all three questions are multiple choice can you please help wit answer thank you Which one of the following options is NOT a reason for massively-parallel computing to be used to improve scientific research outcomes? With massively-parallel computing, it is not possible to save time when performing a simulation. With massively-parallel computing, it becomes more feasible to explore parameter space. With massively-parallel computing, it is possible to perform simulations at much higher resolution. With massively-parallel computing, it is possible...

  • PLease answer the questions given, I dont need just the conclusion of anova test Three brands...

    PLease answer the questions given, I dont need just the conclusion of anova test Three brands of gasoline (Shell, BP, and Marathon) were deliberately selected for an observational study. The study was conducted to understand which brand of gasoline among the three selected is cheaper across the metropolitan areas of mid-west. A gas station for each brand of gas from 10 metropolitan areas were randomly selected for the study. It is a well-known fact that, the price drivers pay for...

  • 1. (2 points) If you were given a microscope slide with an unknown type of muscle...

    1. (2 points) If you were given a microscope slide with an unknown type of muscle tissue, what features would you look for to determine whether the tissue was smooth muscle or skeletal muscle? Give 2 characteristics that would allow you to distinguish between these tissue types. Be sure to state which type of muscle has the characteristics you describe.) 2. Give 3 examples of different types of connective tissues found within the body and briefly note one function of...

  • Item 7 In the case below, the original source material is given along with a sample...

    Item 7 In the case below, the original source material is given along with a sample of student work. Determine the type of plagiarism by clicking the appropriate radio button. Original Source Material Student Version Major changes within organizations are usually initiated by those who are in power. Such decision-makers sponsor the change and then appoint someone else - perhaps the director of training - to be responsible for implementing and managing change. Whether the appointed change agent is in...

  • Application: 1) After reading the missing report and examining the evidence, which linear equation should be...

    Application: 1) After reading the missing report and examining the evidence, which linear equation should be used in order to predict the height of the victim? 2) Graph the linear equation that should be used to determine the height of the victim on the axis below. 4Y 200 180t 160 140 120 100 80 60 40 20 75 60 45 30 15 Bone Length (cm) Height (cm) Examine the evidence and compare the data to your graph. Do you think...

  • The assignment requires reading and understanding the data model used by Cassandra and representing a given ER model with Cassandra “tables”. The decision of which tables to define is driven by the qu...

    The assignment requires reading and understanding the data model used by Cassandra and representing a given ER model with Cassandra “tables”. The decision of which tables to define is driven by the queries that need to be answered. A software company is interested in developing a Video Sharing application. They have designed a conceptual model of the data they need to handle in order to offer users these main functions: 1. The creation of channels where they can upload videos....

  • Which of the following statements describe a Type II error? astion 11 yet swered rked out...

    Which of the following statements describe a Type II error? astion 11 yet swered rked out of Flag estion Select one: a. Stating that there was an effect when actually there was no effect. O b. Stating that there was no effect when in fact there was an effect. c. Saying that a person is guilty as charged when in fact the person is innocent O d. A researcher rejects a true null hypothesis. RE ion 12 Confidence intervals are...

  • (Write or type in answers, except for problem #15, which should be entered as a small...

    (Write or type in answers, except for problem #15, which should be entered as a small program, tested, and submitted in Repl.it) 1. Write a Python statement to define a list named temps using the following elements, in order: 95, 100, 77, 54, 103, 82 2. a) What is the length of the list temps? b) What Python function can be used to obtain the length of the list? Use it in a statement to obtain the length of temps...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT