Demonstration

Example 1: (Right Click) Select a product title, the same type of title will be automatically matched, click the 'Select All' option -> Click the 'Collect Data' option, you can collect the title information of all products.

Example 2: (Right click) Select a large product block -> Click the 'Select All' option -> Click the 'Select Child Elements' option -> Click the 'Collect Data' option, you can collect the information of all products, and will be saved by sub-field.

动态演示
Screenshot

Software Features

Suitable for All Groups of People at Work

Design and execute crawler/browser automation test tasks visually like using Excel, regardless of coding experience.

Support for Configuring Cloud Servers

You can configure the cloud server address for unified storage of task information and unified management.

Define Any Complex Tasks

Supports infinite loop nesting and conditional judgments, can also execute custom JavaScript instructions and system instructions. You can insert the exit loop Break statement into any position of the flowchart.

Open Source, Free, No Ads

The code is open source, and all features of the software are free (except for commercial use). There are no pop-ups or external advertisements.

Cross-platform

The software can run on Windows, MacOS, and Linux.

Simple and Fast

Graphical interface for setting up and executing web scraping tasks. Typically, designing a web scraping task takes only 2-5 minutes.

Secure

No registration required. All tasks and data are saved locally and do not pass through any third-party servers.

Flexible

Can freely add browser extensions, execute JavaScript instructions, and directly manipulate the browser using Selenium statements.

Parallel Execution

Can run multiple instances of the program simultaneously to achieve parallel data collection.

Dynamic Debugging
Click/double-click the completed operation to automatically mark/trial run, convenient for locating and debugging issues, saving time on modifying tasks.
Custom Plugins
Supports external custom plugins to achieve any custom functionality.

Captcha Recognition

Supports various captcha recognition methods, such as image captchas and reCAPTCHA.

Element Screenshot and OCR

Supports element screenshot, OCR recognition, and image downloading.

Proxy IPs

Supports switching tunnel IPs and private IPs.

LAN Access

Supports usage within local area networks without internet access.

External Program Invocation

Can invoke external programs to fulfill complex requirements.

API Calls

Can execute tasks through API calls, enabling advanced automation.

Scheduled Execution

Supports scheduled task execution, becoming a helpful assistant in daily life.

Pause Anytime

Pause task execution anytime for manual debugging, page inspection, or entering captchas.

Custom Environment

Can modify the Python environment at any time, such as defining custom variables, and write the variable values into any code statement.

Mobile Device Emulation

Supports emulation of mobile devices for scraping mobile web pages.

Flexible Import

Supports reading Excel files for importing large batches of input parameters.

Free Export

Can export data as Excel/CSV/TXT files and write to MySQL databases.

Task Migration

Tasks can be copied and migrated to other machines, regardless of the operating system environment.

Page Scrolling

Can set up page scrolling to retrieve content that loads dynamically.

IFRAME Support

Supports scraping data within iframe tags, including multi-layer nested iframes.

Cookie Manipulation

Can retrieve and modify page cookies.

Command Line Execution

Execute tasks directly from the command line, seamlessly integrating with other programs.

JSON Support

Can collect JSON files.

File Download

Can download files, such as picture, PDF, etc.

Popup Handling
Supports handling browser Alert and Confirm popups.

Copyright and Support Author

This software is intended for educational and communication purposes only. It is strictly prohibited to use the software for any illegal activities or operations, such as crawling government/military websites that are not allowed to be crawled. The user bears all consequences resulting from the use of this software and the author shall not be held responsible or liable in any way. Furthermore, the software is protected by patent rights. If you intend to use it for commercial purposes or profit-making activities, such as using the software for client orders, selling the collected data, please contact author: naibowang@foxmail.com for patent authorization and payment operations: https://www.patentguru.com/cn/search?q=一种自定义提取流程的服务封装系统

For individual users, EasySpider is a completely free and ad-free open-source software. The development and maintenance of the software rely entirely on the author's voluntary contributions. Therefore, you can choose to support the author and enable them to have more passion and energy to maintain this software. If you have benefited from this software and made a profit, you are also welcome to support the author through the following methods:

1. Alipay account: naibowang@foxmail.com. You can also scan the QR code below.

2. WeChat Pay: Scan the QR code below.

3. PayPal account: naibowang. You can also scan the QR code below.