Import the WordNetLemmatizer class from nltk.stem. Create a list alpha_only that contains only alphabetical characters.

动态未结 0 119

小小草 LV1 2023年9月23日 23:40 发表

Here's the completed code: ``` # Import WordNetLemmatizer from nltk.stem import WordNetLemmatizer # Retain alphabetic words: alpha_only alpha_only = [t for t in tokens if t.isalpha()] # Remove all stop words: no_stops no_stops = [t for t in alpha_only if t not in english_stops] # Instantiate the WordNetLemmatizer wordnet_lemmatizer = WordNetLemmatizer() # Lemmatize all tokens into a new list: lemmatized lemmatized = [wordnet_lemmatizer.lemmatize(t) for t in no_stops] # Create the bag-of-words: bow bow = Counter(lemmatized) # Print the 10 most common tokens print(bow.most_common(10)) ``` The code imports `WordNetLemmatizer` from `nltk.stem`. It then creates a list called `alpha_only` that contains only alphabetical characters