Skip to content

40x errors not counted against request limits #73

@eaton

Description

@eaton

Need to do some additional testing, bit it appears that 404 errors and other failures allow unthrottled requests to continue hitting the scraped server. This includes 429 errors, which leads to the counterproductive outcome of rate limit errors triggering unthrottled crawling.

Not sure if this is an issue in the underlying Crawlee implementation of rate limiting, or a place where we're overriding some edge case behavior, but it should be investigated.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions