AttributeError: 'CountVectorizer' object has no attribute 'get_feature_names' #19

shelbywhite · 2023-03-22T23:27:09Z

Trying to run this code on Google Colab and seeing this error now. Simply just trying to use the demo provided in this repo, but now it's throwing the following error:

AttributeError Traceback (most recent call last)
in
3 # Fit the Concept model to the images and vocabulary
4 concept_model = ConceptModel()
----> 5 concepts = concept_model.fit_transform(img_names, docs=selected_nouns)
6
7 # Get the predicted probabilities for each concept cluster for each image

1 frames
/usr/local/lib/python3.9/dist-packages/concept/_model.py in _extract_textual_representation(self, docs)
400 # Extract vocabulary from the documents
401 self.vectorizer_model.fit(docs)
--> 402 words = self.vectorizer_model.get_feature_names()
403
404 # Embed the documents and extract similarity between concept clusters and words

AttributeError: 'CountVectorizer' object has no attribute 'get_feature_names'

MaartenGr · 2023-03-24T05:25:44Z

Ah, I believe that is an issue with the scikit-learn version. I believe that if you install a sklearn version pre 1.0, then it should work.

renswilderom · 2023-09-02T06:42:09Z

Hello Maarten, I had the same issue. Installing a sklearn version older than 1.0 will probably work indeed.

What I understand from this SO post, is that get_feature_names is depreciated and replaced by get_feature_names_out() from sklearn version 1.0 and higher.

MaartenGr · 2023-09-02T09:01:22Z

Also, I would advise using BERTopic instead as that has more options for multi-modal topic modeling.

renswilderom · 2023-09-02T09:16:39Z

OK - thanks for the tip. I was already using BERTopic for text, but didn't know it had this multimodal feature. Great!

BingBing20230401 · 2024-04-19T05:18:10Z

thanks.!! it solved my problem too!

--What I understand from this SO post, is that get_feature_names is depreciated and replaced by get_feature_names_out() from sklearn version 1.0 and higher.

bjornekstrom · 2024-10-29T08:30:28Z

Would it be possible to make this work with a newer version of scikit-learn?

Edit: I edited get_feature_names() to get_feature_names_out() for noun passing to Concept to work with newer versions of scikit-learn. I take it that the version of pip needs to be updated but now the _model.py version in the repo works. A pull request has been created. #24

bjornekstrom · 2024-11-05T17:59:28Z

This has been solved through #24.

aysedeniz09 mentioned this issue Jun 22, 2023

AttributeError: 'ConceptModel' object has no attribute 'image_cluster_df' #22

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AttributeError: 'CountVectorizer' object has no attribute 'get_feature_names' #19

AttributeError: 'CountVectorizer' object has no attribute 'get_feature_names' #19

shelbywhite commented Mar 22, 2023

MaartenGr commented Mar 24, 2023

renswilderom commented Sep 2, 2023

MaartenGr commented Sep 2, 2023

renswilderom commented Sep 2, 2023

BingBing20230401 commented Apr 19, 2024

bjornekstrom commented Oct 29, 2024 •

edited

Loading

bjornekstrom commented Nov 5, 2024

AttributeError: 'CountVectorizer' object has no attribute 'get_feature_names' #19

AttributeError: 'CountVectorizer' object has no attribute 'get_feature_names' #19

Comments

shelbywhite commented Mar 22, 2023

MaartenGr commented Mar 24, 2023

renswilderom commented Sep 2, 2023

MaartenGr commented Sep 2, 2023

renswilderom commented Sep 2, 2023

BingBing20230401 commented Apr 19, 2024

bjornekstrom commented Oct 29, 2024 • edited Loading

bjornekstrom commented Nov 5, 2024

bjornekstrom commented Oct 29, 2024 •

edited

Loading