Showing content from https://crawlee.dev/python/api/class/PlaywrightCrawler below:
PlaywrightCrawler | API | Crawlee for Python ยท Fast, reliable Python web crawlers.
PlaywrightCrawler
Index Methods
- __init__(*, browser_pool, browser_type, user_data_dir, browser_launch_options, browser_new_context_options, fingerprint_generator, headless, use_incognito_pages, request_handler, statistics, configuration, event_manager, storage_client, request_manager, session_pool, proxy_configuration, http_client, max_request_retries, max_requests_per_crawl, max_session_rotations, max_crawl_depth, use_session_pool, retry_on_blocked, concurrency_settings, request_handler_timeout, abort_on_error, configure_logging, statistics_log_format, keep_alive, additional_http_error_status_codes, ignore_http_error_status_codes, respect_robots_txt_file, status_message_logging_interval, status_message_callback): None
- async add_requests(requests, *, forefront, batch_size, wait_time_between_batches, wait_for_all_requests_to_be_added, wait_for_all_requests_to_be_added_timeout): None
- error_handler(handler): ErrorHandler[TCrawlingContext]
- async export_data(path, dataset_id, dataset_name): None
- failed_request_handler(handler): FailedRequestHandler[TCrawlingContext]
- async get_dataset(*, id, name): Dataset
- pre_navigation_hook(hook): None
- Parameters
- hook: Callable[[PlaywrightPreNavCrawlingContext], Awaitable[None]]
Returns None
Properties
router: Router[TCrawlingContext]
statistics: Statistics[TStatisticsState]
RetroSearch is an open source project built by @garambo
| Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4