-
Notifications
You must be signed in to change notification settings - Fork 188
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue with crawling ImmoScout24: window.IS24 property resultList is missing #458
Comments
Okay it seems like for "gewerbe-flaechen" there is no resultList... |
Yeah - there's no resultList, but there is a https://stackoverflow.com/questions/67203717/beautifulsoup-how-to-get-data-from-window-initial-state It would probably be possible to extend the Immoscout crawler to check if Are you a python developer? You want to give that a go? |
Hey @codders thank you for your reply! I‘ve managed to get the INITIAL_STATE as a String, but somehow i‘m Not able to convert to a functional dict/json. Do you have any advice? |
What kind of error do you get? How are you parsing it? |
I've tried that: logger.info("Trying to get __INITIAL_STATE__")
data = re.search(r"window\.__INITIAL_STATE__=(.*?);", search_url)
if data is not None:
data = data.group(1)
data = json.loads(data)
print(json.dumps(data, indent=4)) but I'm still dealing with "IS24 bot detection has identified our script as a bot - we've been blocked" so I can not check if it's really working :D |
Hey there,
i debugged the code for the immoscout24 crawler and it seems like the follwing script returns "null"
result_json = self.get_driver_force().execute_script('return window.IS24.resultList;')
I've checked in the browser and it looks like the window.IS24.resultList is not there anymore.
Anyone has a working solution?
Thanks
The text was updated successfully, but these errors were encountered: