Machine Learning Questions (MCQs) and Answers Practice Problems

Question 1

Which function in sklearn is used to implement logistic regression?

Accepted Answer

LogisticRegression()

Answer

log_regression()

Answer

reg_log()

Answer

log_reg()

Question 2

How do you calculate the precision score in a classification task using sklearn?

Accepted Answer

precision_score()

Answer

precision()

Answer

calc_precision()

Answer

precision_calc()

Question 3

Which library is used for implementing decision trees in Python?

Accepted Answer

sklearn

Answer

numpy

Answer

pandas

Answer

matplotlib

Question 4

A classification model has high accuracy but low recall. What does this indicate?

Accepted Answer

Model has high false negatives

Answer

Model is overfitting

Answer

Model has high false positives

Answer

Data is imbalanced

Question 5

A logistic regression model is overfitting on training data. What can be done to mitigate this?

Accepted Answer

Apply regularization

Answer

Reduce the training data size

Answer

Increase the number of features

Answer

Use higher learning rate

Question 6

A decision tree model has a very high depth and performs poorly on test data. What could be the reason?

Accepted Answer

Overfitting

Answer

Underfitting

Answer

Insufficient training data

Answer

Wrong cost function

Question 7

What is a decision tree in machine learning?

Accepted Answer

A tree-like structure for decision making

Answer

A model for regression tasks

Answer

A model for clustering

Answer

A method for dimensionality reduction

Question 8

What is the purpose of pruning in decision trees?

Accepted Answer

To prevent overfitting

Answer

To increase the depth of the tree

Answer

To reduce the size of the tree

Answer

To improve training speed

Question 9

How does a decision tree split data at each node?

Accepted Answer

By maximizing information gain

Answer

By maximizing accuracy

Answer

By minimizing distance between data points

Answer

By using random splits

Question 10

What is the main advantage of using Random Forest over a single Decision Tree?

Accepted Answer

It reduces overfitting

Answer

It reduces the complexity of the model

Answer

It increases the depth of trees

Answer

It uses fewer features

Question 11

What is the role of entropy in decision trees?

Accepted Answer

It measures the homogeneity of data

Answer

It measures model accuracy

Answer

It measures the distance between data points

Answer

It calculates the tree depth

Question 12

Which function in sklearn is used to implement decision trees?

Accepted Answer

DecisionTreeClassifier()

Answer

DecisionTree()

Answer

TreeDecision()

Answer

ClassifierTree()

Question 13

How can you visualize a decision tree in Python using sklearn?

Accepted Answer

plot_tree()

Answer

visualize_tree()

Answer

draw_tree()

Answer

show_tree()

Question 14

Which sklearn function is used to implement a Random Forest Classifier?

Accepted Answer

RandomForestClassifier()

Answer

RandomForest()

Answer

ForestClassifier()

Answer

RandomClass()

Question 15

How does the max_depth parameter in a decision tree model affect its performance?

Accepted Answer

Controls the depth of the tree

Answer

Controls the number of features

Answer

Controls the number of samples

Answer

Controls the number of trees

Question 16

A decision tree is overfitting the training data. What can you do to resolve this?

Accepted Answer

Prune the tree

Answer

Increase the tree depth

Answer

Use fewer features

Answer

Increase the training data

Question 17

A Random Forest model gives inconsistent predictions across different datasets. What could be the cause?

Accepted Answer

Overfitting in individual trees

Answer

Not enough trees

Answer

Too many features

Answer

No randomness in the dataset

Question 18

A decision tree has poor performance on test data but excellent performance on training data. What is the issue?

Accepted Answer

Overfitting

Answer

High variance

Answer

Underfitting

Answer

Data imbalance

Question 19

What is the primary goal of clustering algorithms?

Accepted Answer

To group similar data points

Answer

To label data

Answer

To classify data

Answer

To reduce dimensionality

Question 20

Which of the following is an example of a density-based clustering algorithm?

Accepted Answer

DBSCAN

Answer

K-Means

Answer

Hierarchical clustering

Answer

Agglomerative clustering

Question 21

What is the key difference between K-Means and Hierarchical clustering?

Accepted Answer

K-Means requires predefined number of clusters

Answer

Hierarchical clustering is faster

Answer

K-Means doesn't need distance metrics

Answer

Hierarchical clustering cannot be used for large datasets

Question 22

Which clustering algorithm does not require specifying the number of clusters in advance?

Accepted Answer

DBSCAN

Answer

K-Means

Answer

K-Nearest Neighbors

Answer

PCA

Question 23

Which function in sklearn is used to implement K-Means clustering?

Accepted Answer

KMeans()

Answer

kmeans_clustering()

Answer

cluster_means()

Answer

cluster_KMeans()

Question 24

How do you specify the number of clusters in K-Means clustering using sklearn?

Accepted Answer

n_clusters

Answer

num_clusters()

Answer

cluster_count()

Answer

num_clust

Question 25

Which function is used to implement the DBSCAN algorithm in sklearn?

Accepted Answer

DBSCAN()

Answer

dbscan_clustering()

Answer

density_cluster()

Answer

cluster_density

Question 26

In K-Means, what is the effect of using a large number of clusters (n_clusters)?

Accepted Answer

Increases overfitting

Answer

Increases the number of iterations

Answer

Increases randomness

Answer

Improves accuracy

Question 27

A K-Means model produces clusters of very different sizes. What could be the reason?

Accepted Answer

Data is not scaled

Answer

Wrong distance metric

Answer

Overfitting

Answer

High dimensionality

Question 28

A DBSCAN model labels a large number of points as noise. What could be the cause?

Accepted Answer

Low epsilon value

Answer

High epsilon value

Answer

Low min_samples value

Answer

High min_samples value

Question 29

A hierarchical clustering algorithm produces a large number of small clusters. What could resolve this issue?

Accepted Answer

Reduce the number of clusters

Answer

Increase the number of clusters

Answer

Increase the linkage criterion

Answer

Use a different distance metric

Question 30

What is the purpose of dimensionality reduction in machine learning?

Accepted Answer

To remove redundant features

Answer

To increase model complexity

Answer

To reduce model accuracy

Answer

To create more features

Machine Learning Multiple Choice Questions (MCQs) and Answers