Skip to content
@scrapinghub

Scrapinghub

Turn web content into useful data

Pinned Loading

  1. splash Public

    Lightweight, scriptable browser as a service with an HTTP API

    Python 4.1k 513

  2. dateparser Public

    python parser for human readable dates

    Python 2.6k 470

  3. python-scrapinghub Public

    A client interface for Scrapinghub's API

    Python 205 63

  4. extruct Public

    Extract embedded metadata from HTML markup

    Python 893 116

  5. spidermon Public

    Scrapy Extension for monitoring spiders execution.

    Python 539 100

  6. python-crfsuite Public

    A python binding for crfsuite

    Python 772 222

Repositories

Showing 10 of 183 repositories
  • shub Public

    Scrapinghub Command Line Client

    Python 131 BSD-3-Clause 80 46 (7 issues need help) 14 Updated Mar 6, 2025
  • shub-workflow Public
    Python 13 BSD-3-Clause 15 2 2 Updated Feb 26, 2025
  • python-scrapinghub Public

    A client interface for Scrapinghub's API

    Python 205 BSD-3-Clause 63 25 4 Updated Feb 21, 2025
  • price-parser Public

    Extract price amount and currency symbol from a raw text string

    Python 322 BSD-3-Clause 51 16 (4 issues need help) 9 Updated Feb 13, 2025
  • scrapinghub-entrypoint-scrapy Public

    Scrapy entrypoint for Scrapinghub job runner

    Python 25 BSD-3-Clause 16 8 1 Updated Feb 12, 2025
  • scrapy-poet Public

    Page Object pattern for Scrapy

    Python 120 BSD-3-Clause 28 12 (1 issue needs help) 4 Updated Feb 12, 2025
  • scrapinghub-stack-scrapy Public

    Software stack with latest Scrapy and updated deps

    Dockerfile 63 BSD-3-Clause 20 2 2 Updated Feb 11, 2025
  • web-poet Public

    Web scraping Page Objects core library

    Python 96 BSD-3-Clause 15 18 (1 issue needs help) 13 Updated Feb 10, 2025
  • andi Public

    Library for annotation-based dependency injection

    Python 22 BSD-3-Clause 5 4 1 Updated Feb 7, 2025
  • frontera Public

    A scalable frontier for web crawlers

    Python 1,309 BSD-3-Clause 218 79 (8 issues need help) 19 Updated Feb 5, 2025