Crawlee for Python: Güvenilir Web Scraping Araçları

Crawlee: Powerful Web Scraping and Browser Automation Library

Introduction

Crawlee is a robust web scraping and browser automation library for Python. It enables developers to build reliable crawlers quickly and efficiently.

Key Features

Python implementation with type hints
Seamless switching between HTTP and headless browser crawling
Built on Playwright for browser automation
Automatic scaling and proxy management
Support for Chrome, Firefox, and other browsers

Use Cases

Web scraping at scale
Browser automation tasks
Data extraction from JavaScript-rendered websites
Maintaining large-scale crawling projects

Teams

Crawlee is developed by experienced web scraping professionals who use it daily for large-scale data extraction projects.

Getting Started

pipx run crawlee create my-crawler
pip install 'crawlee[playwright]'
playwright install

Example Usage

import asyncio
from crawlee.playwright_crawler import PlaywrightCrawler, PlaywrightCrawlingContext

async def main():
    crawler = PlaywrightCrawler(
        max_requests_per_crawl=5,
        headless=False,
        browser_type='firefox',
    )

    @crawler.router.default_handler
    async def request_handler(context: PlaywrightCrawlingContext) -> None:
        await context.enqueue_links()
        data = {
            'url': context.request.url,
            'title': await context.page.title(),
            'content': (await context.page.content())[:100],
        }
        await context.push_data(data)

    await crawler.run(['https://crawlee.dev'])
    await crawler.export_data('results.json')

if __name__ == '__main__':
    asyncio.run(main())

Crawlee for Python Alternatifleri

No-Code Scraper

Kod yazmadan herhangi bir web sitesinden veri çıkarın.

Octoparse

Herkes İçin Kolay Web Kazıma.

Kimono Labs

Var olmayan yerlerde API oluşturun. Kimono ile hızlıca...

Saldor

Saldor, büyük dil modelleri için en iyi web verilerini çıkarır.

InstantAPI

Web sitelerini anında özelleştirilebilir API'lere dönüştürün.

AgentQL

Acısız Veri Çıkarma ve Web Otomasyonu

Nimble API

Web verilerini sorunsuz bir şekilde tarayın, ayrıştırın ve ölçeklendirin

Scraping Fish

Engellenmeden web kazıma için en basit API.

Bytebot

Yapay zekâ destekli tarayıcı otomasyonları.

MrScraper

Web scraping'i kolaylaştırın

Crawlee for PythonBuild reliable scrapers in Python

Crawlee: Powerful Web Scraping and Browser Automation Library

Introduction

Key Features

Use Cases

Teams

Getting Started

Example Usage

Crawlee for Python Alternatifleri

No-Code Scraper

Octoparse

Kimono Labs

Saldor

InstantAPI

AgentQL

Nimble API

Scraping Fish

Bytebot

MrScraper

Haftanın En İyi 10 Ürünü

Osmos

Zivy

Fibr

AnyParser API (YC S23)

Surfsite AI

AIPhone.AI

Supademo 3.0

Cracked (YC S24)

ConfettiTherapy.com

Creem