FanScrape

DogmaDragon · May 30, 2025, 2:24am


	Summary	A fork of dc_onlyfans_fansdb with better scraper support.
	Repository	https://github.com/toddhow/FanScrape
	Source URL	https://toddhow.github.io/FanScrape/stable/index.yml
	Install	How to install a scraper?

FanScrape (FansDB)

[!note]
This a fork of dc_onlyfans_fansdb

This script is a companion to the OnlyFans/Fansly data scrapers by DIGITALCRIMINAL and derivatives.
Above tools download posts from OnlyFans/Fansly and save metadata to ‘user_data.db’ SQLite files.

[!note]
This script requires python3, stashapp-tools, and sqlite3.

Scraper Support

UltimaHoarder/UltimaScraper

datawhores/OF-Scraper

[!important]
If you are using datawhores/OF-Scraper you have two choices.
Either you will need to change your scraper config or add a setting to the config.json file.
The options you need to change for OF-scraper can be found below.

"dir_format": "{sitename}/{model_username}/{responsetype}/{value}/{mediatype}",
"metadata": "{save_location}/{sitename}/{model_username}/Metadata",

Installation

[!warning]
Breaking Change:
As of commit 76f80f4, this code requires the Python module markdown installed where Stash runs (this could be inside your docker container). Likely a command such as:
pip install markdown
or
pip3.12 install markdown --break-system-packages

Managed

Go to Settings > Metadata Providers > Available Scrapers.
Click the Add Source button.
Insert the following values into their corresponding field

Name: FanScrape
Source URL: https://toddhow.github.io/FanScrape/stable/index.yml
Local Path: fanscrape

Select FanScrape, then press the Install button.

Manual

Instructions for manually install scrapers can be found in Stash’s Documentation

Scenes

The post information for scenes will be scraped from the metadata database based on file name.

Currently the scraper returns the following information for scenes:

Title
Details
Date
Code
Studio
URLs
Performers
Tags

Please refer to Post Metadata for more information.

Galleries

The post information for galleries will be scraped from the metadata database based on directory.

[!important]
Since galleries are matched on directory, each post should be contained in a separate directory.

Currently the scraper returns the following information for galleries:

Title
Details
Date
Studio
URLs
Performers
Tags

Please refer to Post Metadata for more information.

Post Metadata

Title

In all cases, the title will be truncated on word boundaries (if possible) up to the configured max_title_length in config.json (default 64 characters).

When post contains no text: <username> - <post_date> [(<index_in_post>)]
Example: jonsnow - 2023-10-16 (2)
When first line of post text contains less than six (6) characters: <first_line> - <post_date> [(<index_in_post>)]
Example: Hi! - 2023-10-16
When first line of post text does not contain alpha-numeric characters: <first_line> - <post_date> [(<index_in_post>)]
Example: ❤️❤️❤️❤️❤️❤️❤️❤️ - 2023-10-16 (4)
Else: <first_line> [(<index_in_post>)]
Example: Lorem ipsum dolor sit amet, consectetur adipiscing elit.

Details

The details will contain the entirety of the post text.

Date

The date will contain the date on which the post was created.

Code

The code will contain the unique file id based on the value in either <link> or <linked> columns of the medias table.

Studio

The creator studio name will be set to the following: <username> (<network>) e.g. jonsnow (OnlyFans)
The creator studio URL will be set to the following:

OnlyFans: https://onlyfans.com/<username>
Fansly: https://fansly.com/<username>

The parent studio name will be set to the following: <network> (network) e.g. Fansly (network)
The parent studio URL will be set to the following:

OnlyFans: https://onlyfans.com/
Fansly: https://fansly.com/

URLs

For scenes and galleries, the URL will be set to the following:

OnlyFans: https://onlyfans.com/<post_id>/<username>
Fansly: https://fansly.com/post/<post_id>

Performers

The performer username is taken from the name of the folder proceeding “OnlyFans” or “Fansly”.

Example:
D:\stash-library\of-scraper\OnlyFans\<username>\...

[!note]
The only performer that is being matched is the “owner” of the profile.

The scraper will try to resolve performer names by searching for performers with an alias matching the username.

By default, the scraper will search recursively from the performer directory for .jpg and .png files and base64 encode up to three (3) images for use as a performer image. These files are (by default) cached for 5 minutes by saving the base64 encoded images to disk to speed up bulk scraping.

If desired this behavior can be tweaked by changing these values in config.json:

  "max_performer_images": 3   # Maximum performer images to generate.
  "cache_time": 300           # Image expiration time (in seconds).
  "cache_dir": "cache"        # Directory to store cached base64 encoded images.
  "cache_file": "cache.json"  # File to store cache information in.

Configuration

[!important]
If you have enabled password protection on your Stash instance, filling in the apikey is required.

On first run, the scraper will write a default config.json file if it does not already exist.

Additionally, the cache_dir and cache_file will be created if they do not yet exist.

The values in the default config are as follows:

{
    "stash_connection": {
        "scheme": "http",
        "host": "localhost",
        "port": 9999,
        "apikey": ""
    },
    "max_title_length": 64,                 # Maximum length for scene/gallery titles.
    "tag_messages": True,                   # Whether to tag messages.
    "tag_messages_name": "[FS: Messages]",  # Name of tag for messages.
    "max_performer_images": 3,              # Maximum performer images to generate.
    "cache_time": 300,                      # Image expiration time (in seconds).
    "cache_dir": "cache",                   # Directory to store cached base64 encoded images.
    "cache_file": "cache.json",             # File to store cache information in.
    "meta_base_path": None,                 # Base path to search for 'user_data.db' files.
    "direct_db": {
        "override": False,
        "db_format": "/path/to/the/{network}/{username}/Metadata/user_data.db", # Format of the database path.
    },  # Allow overriding the database path.
}

Thanks

Thank you to WithoutPants for originally writing the script, and to xantor for maintaining the script as well as writing the README.
Additionally Jakan-Kink, who has been making some significant updates since the beginning of August 2024.

Ehoalid · May 30, 2025, 2:57am

Hate to be the dummy, but I can’t get any of the galleries to scrape.

I copied and pasted the required fields into my ofscraper config and even started with a new folder to download to.

That didn’t work, I saw that each gallery needs to be its own folder, so I added {post_id} to the end of the directory, and that doesn’t help.

2025-05-29 20:44:45
Debug   
Scraper script finished
2025-05-29 20:44:45
Error   
[Scrape / FanScrape] Could not find metadata for gallery: /pornShare/onlyfans_ui_data/downloads/Onlyfans/bacon_bee/Posts/Free/Images/15197776
2025-05-29 20:44:45
Info    
[Scrape / FanScrape] /pornShare/onlyfans_ui_data/downloads/Onlyfans/bacon_bee/Posts/Free/Images/15197776
2025-05-29 20:44:45
Info    
[Scrape / FanScrape] Using database: /pornShare/onlyfans_ui_data/downloads/Onlyfans/bacon_bee/Metadata/user_data.db for /pornShare/onlyfans_ui_data/downloads/Onlyfans/bacon_bee/Posts/Free/Images/15197776
2025-05-29 20:44:45
Debug   
[Scrape / FanScrape] Script runtime: after metadata db: 14.050444833002985 seconds
2025-05-29 20:44:31
Debug   
[Scrape / FanScrape] Script runtime: after scene path: 0.0075932820327579975 seconds
2025-05-29 20:44:31
Debug   
[Scrape / FanScrape] Script runtime: after scene path: 0.007424882147461176 seconds
2025-05-29 20:44:31
Debug   
[Scrape / FanScrape] Using stash (v0.28.1-0) endpoint at http://localhost:9999/graphql
2025-05-29 20:44:30
Debug   
Scraper script </usr/bin/python3 fanscrape.py queryGallery> started

Video files scrape correctly, but not galleries. What do I need to do?

smith113 · November 28, 2025, 5:49pm

Is there a summary of which of the two is better?

DogmaDragon · November 28, 2025, 7:48pm

Haven’t used any of them, sorry.