NLP MCQ Questions Practice Problems

Question 1

Text preprocessing slows down significantly when applied to large datasets. What is a potential fix?

Accepted Answer

Use faster tokenization methods

Answer

Use smaller datasets

Answer

Skip normalization

Answer

Disable stemming

Question 2

What is the primary purpose of POS tagging in NLP?

Accepted Answer

To label each word with its grammatical role

Answer

To identify stopwords

Answer

To tokenize text

Answer

To generate embeddings

Question 3

Which of the following is a common POS tagging technique?

Accepted Answer

Rule-based

Answer

Bag-of-Words

Answer

Transformer-based

Answer

Embedding-based

Question 4

Which Python library provides the pos_tag method for tagging words?

Accepted Answer

nltk

Answer

spaCy

Answer

TextBlob

Answer

pandas

Question 5

What is the main challenge of POS tagging for ambiguous words like "can"?

Accepted Answer

Ambiguity in context

Answer

Lack of training data

Answer

Complex tokenization

Answer

Non-standard text

Question 6

How does POS tagging assist in Named Entity Recognition (NER)?

Accepted Answer

It identifies word context

Answer

It detects sentence structure

Answer

It assigns roles to entities

Answer

It identifies grammatical errors

Question 7

Which POS tagging method uses hidden states to model word sequences?

Accepted Answer

Hidden Markov Model

Answer

Rule-based

Answer

Bag-of-Words

Answer

Embedding-based

Question 8

How do you perform POS tagging using spaCy in Python?

Accepted Answer

nlp(text)

Answer

nlp.pos(text)

Answer

nlp(text).pos_

Answer

nlp.pos_tags(text)

Question 9

Which attribute of spaCy tokens can be used to get the POS tag?

Accepted Answer

pos_

Answer

text

Answer

lemma_

Answer

tag_

Question 10

How do you display the detailed POS tags of a sentence using nltk?

Accepted Answer

pos_tag(word_tokenize(sentence))

Answer

pos_tag(sentence)

Answer

tag(sentence)

Answer

tokenize(sentence)

Question 11

A POS tagging model incorrectly tags all nouns as verbs. What could be a likely issue?

Accepted Answer

Insufficient training data

Answer

Incorrect tokenization

Answer

Incorrect tagging logic

Answer

Normalization errors

Question 12

A POS tagging system struggles with unseen words in a test dataset. What should you use?

Accepted Answer

Pre-trained embeddings

Answer

Rule-based methods

Answer

Bag-of-Words

Answer

Word frequency analysis

Question 13

A POS tagging pipeline fails to distinguish between “book” as a noun and a verb. What should you improve?

Accepted Answer

Context modeling

Answer

Tagging rules

Answer

Tokenization

Answer

Dataset size

Question 14

What is the primary goal of Named Entity Recognition (NER)?

Accepted Answer

Classify entities into predefined categories

Answer

Identify grammatical errors

Answer

Generate embeddings

Answer

Tokenize text

Question 15

Which of the following is a commonly recognized entity type in NER?

Accepted Answer

Location

Answer

Noun

Answer

Verb

Answer

Adjective

Question 16

How does context affect the performance of NER models?

Accepted Answer

Improves recognition of ambiguous entities

Answer

Context doesn’t affect

Answer

Reduces performance

Answer

No impact

Question 17

Which algorithm is commonly used for NER tasks?

Accepted Answer

Conditional Random Fields (CRF)

Answer

Decision Tree

Answer

K-Means

Answer

Linear Regression

Question 18

What is the role of a gazetteer in NER?

Accepted Answer

Lists predefined entities

Answer

Provides training data

Answer

Generates embeddings

Answer

Tokenizes text

Question 19

Which neural network architecture is commonly paired with CRF for NER?

Accepted Answer

LSTM

Answer

RNN

Answer

CNN

Answer

Transformer

Question 20

Which library in Python provides a pretrained NER model using spacy?

Accepted Answer

spaCy

Answer

nltk

Answer

TextBlob

Answer

pandas

Question 21

How do you extract named entities using spaCy?

Accepted Answer

doc.ents

Answer

doc.entities

Answer

doc.tokens

Answer

doc.entity_types

Question 22

How do you train a custom NER model using spaCy?

Accepted Answer

Update the pipeline

Answer

Modify stopwords

Answer

Train a new word2vec model

Answer

Manually tag data

Question 23

An NER model incorrectly tags all city names as organizations. What is a likely issue?

Accepted Answer

Ambiguous training data

Answer

Poor tokenization

Answer

Incorrect embeddings

Answer

Low batch size

Question 24

An NER model fails to recognize new entities in a specific domain. What should you do?

Accepted Answer

Use a gazetteer

Answer

Ignore domain data

Answer

Train on unrelated datasets

Answer

Reduce model size

Question 25

An NER model struggles to generalize across different datasets. What technique can help?

Accepted Answer

Use domain adaptation

Answer

Train a larger model

Answer

Skip embedding layers

Answer

Reduce training data

Question 26

What is the main purpose of word embeddings in NLP?

Accepted Answer

To capture semantic meaning of words

Answer

To tokenize text

Answer

To remove stopwords

Answer

To perform lemmatization

Question 27

Which method does GloVe use to learn word embeddings?

Accepted Answer

Matrix factorization

Answer

Probabilistic models

Answer

Recurrent networks

Answer

Transformers

Question 28

What is the difference between Word2Vec and GloVe?

Accepted Answer

Word2Vec uses local context, GloVe uses global context

Answer

Word2Vec is count-based, GloVe is predictive

Answer

Word2Vec uses global statistics

Answer

GloVe ignores word frequency

Question 29

Which of the following training modes is available in Word2Vec?

Accepted Answer

Skip-gram

Answer

Bag-of-Words

Answer

LSTM

Answer

Transformer

Question 30

Why are pre-trained embeddings like GloVe preferred over training from scratch?

Accepted Answer

They reduce training time

Answer

They are less accurate

Answer

They ignore rare words

Answer

They work with any language

NLP Multiple Choice Questions (MCQs) and Answers