A tool for scraping and processing content from ENS forums.
.
├── data/ # Data storage directory
│ └── output/ # Scraped data output
├── src/ # Source code
│ └── scrapers/ # Scraper scripts
└── logs/ # Log files
-
Install dependencies:
npm install
-
Make sure you have the required URLs in a
waymore.txt
file.
Run the standard scraper:
npm run ui
A bookmarklet is also available for manual scraping:
- Create a new bookmark in your browser
- Set the name to "ENS Forum Scraper"
- Copy the content of
src/scrapers/bookmarklet.js
as the URL - Navigate to the forum page you want to scrape
- Click the bookmarklet
Scraped data is saved in the data/output
directory in text format with the following structure:
=== BEGINNING OF TOPIC ===
Title: [Topic Title]
Author: [Author Name]
Date: [Post Date]
[Post Content]
=== END OF POST ===
Author: [Reply Author]
Date: [Reply Date]
[Reply Content]
=== END OF POST ===
====== END OF TOPIC ======