Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Incorrect suggestion to replace == True with is True for pandas dataframe #17009

Closed
unknown-eps opened this issue Mar 27, 2025 · 2 comments
Closed

Comments

@unknown-eps
Copy link

Summary

https://play.ruff.rs/c4303e07-49f1-456d-b48c-6a6a9b9bfcd5

import pandas as pd

temp =pd.DataFrame({'a':[1,2,3],'b':['True','True','False']})

filter_temp = temp[temp['b'] == True] #Correct

ruff_suggested = temp[temp['b'] is True] # Will give error on running

Ruff suggests the second one

Version

No response

@Daverball
Copy link
Contributor

Daverball commented Mar 27, 2025

This requires type inference and multi-file analysis, both of which are a little ways off still.

This boils down to, that the rule should not trigger when x == True returns something other than bool.

I tend to use .eq(True) instead of == True in pandas, it reads better and won't trigger this rule. Usually you don't even really need that though and can just directly use temp['b'] as the filter, since eq(True) just gives you back the same series, if all the values are already boolean, so you only need it when you have mixed values.

@dylwil3
Copy link
Collaborator

dylwil3 commented Mar 27, 2025

Sorry you ran into this! This is a known issue, and it's why the rule auto-fix is marked as unsafe (see the documentation for the rule)

@dylwil3 dylwil3 closed this as completed Mar 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants