Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ability to re-scrape URLs and replace old files #73

Open
faucetlol opened this issue Jan 2, 2023 · 3 comments
Open

Ability to re-scrape URLs and replace old files #73

faucetlol opened this issue Jan 2, 2023 · 3 comments

Comments

@faucetlol
Copy link
Contributor

I ended up making the mistake of scraping an artist's DeviantArt account that had its art as "Watchers-Only Deviations" which resulted in it only downloading blurry preview images.

image

Watching the artist to gain access to the files is easy enough, but there doesn't seem to be any way that I can see to re-scrape the URL and replace the old blurry files with the regular submissions. I thought about just deleting them from the database, but it seems to violate foreign key restraints and I didn't really feel like breaking everything was a good idea.

Not really of great importance to me since I'm not even sure I'll upload any of these files anyway, but could be a nice-to-have. I've experienced artists uploading edited versions or different resolutions after I've scraped their gallery before, but it's rare that happens anyways.

@Earlopain
Copy link
Owner

Makes sense. It would actually be interesting to somehow keep both versions of the image around.

Less interesting for the DeviantArt samples, but more so when the artist uploads an updated version.

@Earlopain
Copy link
Owner

Can you link the artist you've got in that screenshot? Would be nice to have something to properly test with.

@faucetlol
Copy link
Contributor Author

I believe it was https://www.deviantart.com/zetsuboucchi

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants