Home Certification A Foundation Program in Data Science Quiz Answers – Grey Campus

A Foundation Program in Data Science Quiz Answers – Grey Campus

1257
0
A Foundation Program in Data Science Quiz Answers - Grey Campus

Enroll Here: A Foundation Program in Data Science Certification

Question 1: Fill in the blanks with the correct option(s): Logistic regression is a ____________ regression technique that is used to model data having a ________ outcome

  • linear, numeric
  • linear, binary
  • nonlinear, numeric
  • nonlinear, binary

Question 2: Which of the following is NOT a supervised learning?

  • PCA
  • Decision Tree
  • Linear Regression
  • Naive Bayesian

Question 3: Which of the following is the method to find the best fit line for data in Linear Regression?

  • Least Square Error
  • Maximum Likelihood
  • Logarithmic Loss
  • Both A and B

Question 4: Which of the following assumption in regression modelling impacts the trade-off between under-fitting and over-fitting the most?

  • The polynomial degree
  • Whether we learn the weights by matrix inversion or gradient descent
  • The use of a constant-term
  • None of the above

Question 5: Which one of the following statements is true regarding residuals in regression analysis?

  • Mean of residuals is always zero
  • Mean of residuals is always less than zero
  • Mean of residuals is always greater than zero
  • There is no such rule for residuals.

Question 6: Q7. Which of the one is true about Heteroskedasticity?

  • Linear Regression with varying error terms
  • Linear Regression with constant error terms
  • Linear Regression with zero error terms
  • None of these

Question 7: Q10. To test linear relationship of y(dependent) and x(independent) continuous variables, which of the following plot best suited?

  • Scatter plot
  • Bar chart
  • Histograms
  • None of these

Question 8: Which of the following is true about “Ridge” or “Lasso” regression methods in case of feature selection?

  • Ridge regression uses subset selection of features
  • Lasso regression uses subset selection of features
  • Both use subset selection of features
  • None of above

Question 9: Which of the following options is true regarding “Regression” and “Correlation”? Note: y is the dependent variable and x is an independent variable.

  • The relationship is symmetric between x and y in both.
  • The relationship is not symmetric between x and y in both.
  • The relationship is not symmetric between x and y in case of correlation but in case of regression it is symmetric.
  • The relationship is symmetric between x and y in case of correlation but in case of regression it is not symmetric.

Question 10: Which of the following methods does not have a closed form solution for its coefficients?

  • Ridge regression
  • Lasso
  • Both Ridge and Lasso
  • None of both

Question 11: Which of the following step/assumption in regression modeling impacts the trade-off between under-fitting and over-fitting the most?

  • The polynomial degree
  • Whether we learn the weights by matrix inversion or gradient descent
  • The use of a constant-term
  • None of the above

Question 12 : Let’s say a “Linear regression” model perfectly fits the training data (train error is zero). Now, Which of the following statement is true?

  • You will always have test error zero
  • You can not have test error zero
  • None of the above
  • Both A and B

Question 13: Which of the following indicates a fairly strong relationship between X and Y?

  • Correlation coefficient = 0.9
  • The p-value for the null hypothesis Beta coefficient =0 is 0.0001
  • The t-statistic for the null hypothesis Beta coefficient=0 is 30
  • None of these

Question 14: Which of the following algorithm are not an example of an ensemble learning algorithm?

  • Random Forest
  • Extra Trees
  • Gradient Boosting
  • Decision Trees

Question 15: Which of the following is/are true while applying bagging to regression trees? 1.We build the N regression with N bootstrap sample. 2.We take the average the of N regression tree. 3. Each tree has a high variance with low bias.

  • 1 and 2
  • 2 and 3
  • 1 and 3
  • 1,2 and 3

Question 16: How to select best hyperparameters in tree based models?

  • Measure performance over training data
  • Measure performance over validation data
  • Both of these
  • None of these

Question 17: What are tree based classifiers?

  • Classifiers which form a tree with each attribute at one level.
  • Classifiers which perform series of condition checking with one attribute at a time.
  • Both the options given above.
  • None of the above

Question 18: How will you counter over-fitting in decision tree?

  • By pruning the longer rules
  • By creating new rules
  • Both By pruning the longer rules’ and ‘ By creating new rules’
  • None of the option

Question 19: Which of the following sentence(s) is/are correct?

  • In pre-pruning a tree is ‘pruned’ by halting its construction early.
  • A pruning set of class labeled tuples is used to estimate cost complexity.
  • The best pruned tree is the one that minimizes the number of encoding bits.
  • All of the above

Question 20: Which one of these is not a tree based learner?

  • CART
  • ID3
  • Bayesian Classifier
  • Random Forest

LEAVE A REPLY

Please enter your comment!
Please enter your name here