NHentai scraping

I’ve installed your plugin but pressing any of the buttons doesn’t seem to change the string under the “Scraper User Agent” setting. I’ve tried putting my user-agent in there myself but no luck, I still get a “http error 403:Forbidden” error when I try to scrape.

How do I pass in my login cookies to Stash?

You can add cookies by modifying your scraper definitions

You can reference the DMM scraper for the syntax

Thank you, hopefully that can let me scrape NHentai.net again. They recently added Anubis and ever since then I haven’t been able to scrape, though the Anubis pop-up hasn’t been there in a week or two. I’ve been trying to find a way to get around it for a couple months.

with anubis, you need a completely different approach, in fact you’ll have much much better luck without a useragent set (or using the default golang UA)

Damn, so there’s no way to just solve the Anubis challenge on my browser and then have the scraper mimic me?
So what, will NHentai need a whole new scraper, because currently it’s just a YML file, but it sounds like it will need a Python script to solve the POW challenge from Anubis. That’s all I understand so far. I know some Python and YAML but I’ve never made something like this before

No, anubis doesn’t challenge you if you don’t pass in a Mozilla User-Agent, PoW isn’t that difficult to break, you’ll probably just have issues with other scrapers instead

NHentai is not using anubis it’s using CF. Please open up an issue where it’s appropiate GitHub - stashapp/CommunityScrapers: This is a public repository containing scrapers created by the Stash Community.

I swear it was using Anubis a week or two ago, it would give the splash screen (or loading screen, whatever it should be called) until very recently. I emailed NHentai about the issue, since they provide download links for the galleries I figured preventing normal users from scraping was not intended, maybe they listened to my email.

I submitted an issue when it first became a problem but I can’t find my issue when searching through them. I made a throwaway GitHub account because I didn’t want to connect my real one with a porn app, so I can’t login to find my submission. It’s just gone, so can’t trust that a new one will get addressed.

Unfortunately the current NHentai scraper doesn’t have a Python file, so I can’t just slap cloudscraper onto it and see if that helps.I’ll read through the other issues with the cloudflare label and your write-up on DDOS guards and see if I can put something together myself.