I’ve installed your plugin but pressing any of the buttons doesn’t seem to change the string under the “Scraper User Agent” setting. I’ve tried putting my user-agent in there myself but no luck, I still get a “http error 403:Forbidden” error when I try to scrape.
Thank you, hopefully that can let me scrape NHentai.net again. They recently added Anubis and ever since then I haven’t been able to scrape, though the Anubis pop-up hasn’t been there in a week or two. I’ve been trying to find a way to get around it for a couple months.
with anubis, you need a completely different approach, in fact you’ll have much much better luck without a useragent set (or using the default golang UA)
Damn, so there’s no way to just solve the Anubis challenge on my browser and then have the scraper mimic me?
So what, will NHentai need a whole new scraper, because currently it’s just a YML file, but it sounds like it will need a Python script to solve the POW challenge from Anubis. That’s all I understand so far. I know some Python and YAML but I’ve never made something like this before
No, anubis doesn’t challenge you if you don’t pass in a Mozilla User-Agent, PoW isn’t that difficult to break, you’ll probably just have issues with other scrapers instead
I swear it was using Anubis a week or two ago, it would give the splash screen (or loading screen, whatever it should be called) until very recently. I emailed NHentai about the issue, since they provide download links for the galleries I figured preventing normal users from scraping was not intended, maybe they listened to my email.
I submitted an issue when it first became a problem but I can’t find my issue when searching through them. I made a throwaway GitHub account because I didn’t want to connect my real one with a porn app, so I can’t login to find my submission. It’s just gone, so can’t trust that a new one will get addressed.