You're right. It does require an RPC endpoint for mainnet (but it can be a slow, public one if you're scraping a different chain). There's a couple of issues here:
- It should fail WAY more gracefully.
- The docs should report this requirement.
I wrote this issue: #9 (comment)
I guess there's two way to handle this:
- Require the end user to have an RPC for Ethereum mainnet,
- Assume that, if there is no RPC for mainnet, and the scraper is proceeding on a different chain, that there is no downloadable manifest for that chain, so proceed with a re-build from block zero.