Skip to content
@apify

Apify

We're making the web more programmable.

Apify Banner

Apify is the largest ecosystem where developers build, deploy, and publish data extraction and web automation tools. We call them Actors.

Learn About Apify 🧑‍🎓

  • Find hundreds of ready-made Actors for your web scraping or automation project on Apify Store.
  • Learn everything about web scraping and automation with our free courses that will turn you into an expert scraping developer.
  • Publish your web scrapers as paid Actors on the Apify platform, attract people who need these solutions, and get regular passive income.
  • View our livestreams and video content at the Apify YouTube channel.
  • Learn more through tutorials and thought leadership content about web scraping on Apify Blog and Crawlee Blog.

Repositories 💻

  • Crawlee: A web scraping and browser automation library for Node.js and Python to build reliable scrapers.
  • Apify CLI: Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
  • proxy-chain: Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.
  • fingerprint-suite: Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.
  • got-scraping: HTTP client made for scraping based on got.
  • 👉Check out our more repositories here.👈

We are hiring! 🕸️

Check out the open positions at Apify and help us make the web more programmable.

Pinned Loading

  1. crawlee-python crawlee-python Public

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…

    Python 5.1k 333

  2. crawlee crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

    TypeScript 16.5k 736

  3. apify-cli apify-cli Public

    Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.

    TypeScript 128 20

  4. proxy-chain proxy-chain Public

    Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.

    JavaScript 868 145

  5. got-scraping got-scraping Public

    HTTP client made for scraping based on got.

    TypeScript 577 48

  6. fingerprint-suite fingerprint-suite Public

    Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.

    TypeScript 1.1k 117

Repositories

Showing 10 of 138 repositories
  • apify-cli Public

    Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.

    apify/apify-cli’s past year of commit activity
    TypeScript 128 20 37 (1 issue needs help) 5 Updated Jan 18, 2025
  • workflows Public

    Apify's reusable github workflows

    apify/workflows’s past year of commit activity
    Python 7 4 4 7 Updated Jan 18, 2025
  • apify-docs Public

    This project is the home of Apify's documentation.

    apify/apify-docs’s past year of commit activity
    API Blueprint 31 Apache-2.0 81 75 (2 issues need help) 25 Updated Jan 17, 2025
  • actors-mcp-server Public

    Model Context Protocol (MCP) Server for Apify's Actors

    apify/actors-mcp-server’s past year of commit activity
    TypeScript 0 Apache-2.0 0 0 0 Updated Jan 17, 2025
  • apify-client-python Public

    Apify API client for Python

    apify/apify-client-python’s past year of commit activity
    Python 53 Apache-2.0 12 9 4 Updated Jan 17, 2025
  • apify-sdk-js Public

    Apify SDK monorepo

    apify/apify-sdk-js’s past year of commit activity
    TypeScript 128 Apache-2.0 41 11 11 Updated Jan 17, 2025
  • apify-client-js Public

    Apify API client for JavaScript / Node.js.

    apify/apify-client-js’s past year of commit activity
    TypeScript 67 Apache-2.0 27 17 6 Updated Jan 17, 2025
  • actor-whitepaper-web Public

    Documentation site for the Actor Programming Model – a fresh take on serverless microapps. Built with Astro.

    apify/actor-whitepaper-web’s past year of commit activity
    MDX 1 MIT 0 4 0 Updated Jan 17, 2025
  • apify-sdk-python Public

    The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.

    apify/apify-sdk-python’s past year of commit activity
    Python 121 Apache-2.0 10 12 1 Updated Jan 17, 2025
  • crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

    apify/crawlee’s past year of commit activity
    TypeScript 16,540 Apache-2.0 736 133 (1 issue needs help) 23 Updated Jan 17, 2025