Skip to content
@apify

Apify

We're making the web more programmable.

Pinned Loading

  1. crawlee crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

    TypeScript 12.9k 562

  2. proxy-chain proxy-chain Public

    Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.

    JavaScript 804 138

  3. apify-client-js apify-client-js Public

    Apify API client for JavaScript / Node.js.

    JavaScript 61 24

  4. apify-sdk-js apify-sdk-js Public

    Apify SDK monorepo

    TypeScript 108 29

  5. got-scraping got-scraping Public

    HTTP client made for scraping based on got.

    TypeScript 422 32

  6. fingerprint-suite fingerprint-suite Public

    Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.

    TypeScript 802 83

Repositories

Showing 10 of 123 repositories
  • crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

    apify/crawlee’s past year of commit activity
    TypeScript 12,911 Apache-2.0 562 104 (1 issue needs help) 6 Updated Jul 2, 2024
  • crawlee-python Public

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

    apify/crawlee-python’s past year of commit activity
    Python 27 Apache-2.0 1 37 4 Updated Jul 2, 2024
  • actor-vector-database-integrations Public

    Transfer data from Apify Actors to vector databases (Pinecone, Chroma)

    apify/actor-vector-database-integrations’s past year of commit activity
    Python 1 Apache-2.0 1 0 0 Updated Jul 2, 2024
  • docusaurus-plugin-typedoc-api Public Forked from milesj/docusaurus-plugin-typedoc-api

    Apify's fork of `docusaurus-plugin-typedoc-api`, customized for our Python documentation.

    apify/docusaurus-plugin-typedoc-api’s past year of commit activity
    TypeScript 0 23 0 0 Updated Jul 2, 2024
  • openapi Public

    An OpenAPI specification for the Apify API.

    apify/openapi’s past year of commit activity
    JavaScript 1 MIT 0 10 2 Updated Jul 2, 2024
  • apify-docs Public

    This project is the home of Apify's documentation.

    apify/apify-docs’s past year of commit activity
    API Blueprint 22 Apache-2.0 69 69 22 Updated Jul 2, 2024
  • actor-templates Public

    This project is the 🏠 home of Apify actor template projects to help users quickly get started.

    apify/actor-templates’s past year of commit activity
    Python 21 14 7 0 Updated Jul 2, 2024
  • apify-sdk-python Public

    The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.

    apify/apify-sdk-python’s past year of commit activity
    Python 113 Apache-2.0 8 19 8 Updated Jul 1, 2024
  • apify-cli Public

    Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.

    apify/apify-cli’s past year of commit activity
    TypeScript 117 17 25 (1 issue needs help) 5 Updated Jul 1, 2024
  • apify-shared-js Public

    Utilities and constants shared across Apify projects.

    apify/apify-shared-js’s past year of commit activity
    TypeScript 11 Apache-2.0 9 5 8 Updated Jul 1, 2024