Home / CSE MCQs / Data Science Basics and Toolbox MCQs :: Data Science Machine Learning

CSE MCQs :: Data Science Basics and Toolbox MCQs

  1. Which of the following can be used to create sub“samples using a maximum dissimilarity approach ?
  2. A.
    minDissim
    B.
    maxDissim
    C.
    inmaxDissim
    D.
    All of the Mentioned

  3. Which of the following can be used to impute data sets based only on information in the training set. ?
  4. A.
    postProcess
    B.
    preProcess
    C.
    process
    D.
    All of the Mentioned

  5. Which of the following model model include a backwards elimination feature selection routine?
  6. A.
    MCV
    B.
    MARS
    C.
    MCRS
    D.
    All of the Mentioned

  7. Which of the following is a categorical outcome?
  8. A.
    RMSE
    B.
    RSquared
    C.
    Accuracy
    D.
    All of the Mentioned

  9. Which of the following function provides unsupervised prediction ?
  10. A.
    cl_forecast
    B.
    cl_nowcast
    C.
    cl_precast
    D.
    None of the Mentioned

  11. Which of the following library is used for boosting generalized additive models ?
  12. A.
    gamBoost
    B.
    gbm
    C.
    ada
    D.
    All of the Mentioned

  13. Which of the following is characteristic of best machine learning method ?
  14. A.
    Fast
    B.
    Accuracy
    C.
    Scalable
    D.
    All of the Mentioned

  15. Point out the correct statement:
  16. A.
    Asymptotics are used for inference usually
    B.
    caret includes several functions to pre-process the predictor data
    C.
    The function dummyVars can be used to generate a complete set of dummy variables from one or more factors
    D.
    All of the Mentioned

  17. Point out the correct statement:
  18. A.
    findLinearColumns will also return a vector of column positions can be removed to eliminate the linear dependencies
    B.
    findLinearCombos will return a list that enumerates dependencies
    C.
    The function findLinearRows can be used to generate a complete set of row variables from one factor
    D.
    None of the Mentioned

  19. Point out the correct statement:
  20. A.
    The difference between the class centroids and the overall centroid is used to measure the variable influence
    B.
    The Bagged Trees output contains variable usage statistics
    C.
    Boosted Trees uses different approach as a single tree
    D.
    None of the Mentioned