- Notifications
You must be signed in to change notification settings - Fork14
A powerful Chrome extension for web scraping
License
dream-num/univer-clipsheet
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation

Univer Clipsheet is a powerful Chrome extension for web scraping and data automation. It simplifies the process of extracting, organizing, and managing web data with powerful scraping capabilities and workflow integration.
English |简体中文
Univer Clipsheet | Univer Clipsheet | Clipsheet Releases(ZIP) |
To install Clipsheet using the compressed ZIP file, follow the steps outlined in theManual Installation Guide.
- Custom Scraper Configuration: Easily define a scraper and its scraping strategy with a user-friendly interface.
- Flexible Execution: Run scrapers to extract data with support for advanced modes such as scrolling, clicking, and navigating through URLs.
- Dynamic Extraction: Extract and structure data dynamically from complex web pages.
- AJAX Request Interception: Intercept AJAX requests to capture and extract data from responses.
- Nested Data Handling: Configure drill-down columns to capture data from nested or linked elements.
- Granular Task Execution: Run drill-down tasks to collect detailed data from specified elements efficiently.
- Comprehensive Workflow Management: Create workflows that integrate multiple scrapers for end-to-end automation.
- Event-Driven Execution: Use scheduling features to automate workflows, allowing tasks to run at specified intervals or predefined times.
- Data Operations: Filter, de-duplicate, and schedule tasks to streamline the data collection process.
- Efficient Data Storage: Save and organize scraped data in a structured format.
- Data Aggregation: Merge results from multiple scrapers and workflows for consolidated analysis.
- CSV Export: Export scraped data to CSV files for easy sharing and further processing.
- Seamless Browser Interaction: Utilize the Chrome extension to interact directly with web pages for scraper configuration.
- Background Management: Leverage background scripts to control scrapers and workflows effortlessly.
- Popup Page Management: Use the popup page to display and manage lists of scrapers, workflows, and data, making it easy to view and operate various tasks.
- Side Panel Navigation: Manage scrapers, create and edit scraper configuration directly from an intuitive side panel.
- Comprehensive Forms: Configure scraping and workflow parameters through detailed forms for precision.
The core functionality ofUniver Clipsheet is implemented in a separate repository.For more details and access to the core codebase, please visit:univer-clipsheet-core.
To useUniver Clipsheet, ensure you have the following installed:
- Node.js
- pnpm
Clone the repository:
git clone https://github.com/dream-num/univer-clipsheet.git
Navigate to the project directory and install dependencies:
cd univer-clipsheetgit submodule update --initpnpm install
Build the project:
pnpm run build
Start the development server:
pnpm run dev
To useUniver Clipsheet as a Chrome extension, follow these steps:
Build the Extension:First, ensure the extension files are properly built.
pnpm run build
Open Chrome Extensions Page:Open Chrome and navigate to the extensions management page:
- Enter
chrome://extensions/
in the address bar and hitEnter. - Alternatively, click the menu (three dots in the top-right corner), go toMore Tools > Extensions.
- Enter
Enable Developer Mode:Toggle theDeveloper Mode switch in the top-right corner of the extensions page.
Load the Extension:
- Click theLoad unpacked button.
- In the file dialog, navigate to the
univer-clipsheet/dist
folder inside the project directory. - Select the folder and clickOpen.
Verify Installation:Once loaded, the extension should appear in the list of extensions, and its icon should be visible in the Chrome toolbar.
We welcome contributions to improveUniver Clipsheet!You can submitissues or feature requests to us
Copyright © 2021-2024 DreamNum Co,Ltd. All Rights Reserved.
Licensed under theApache-2.0 license.
Have questions or feedback?Feel free to open an issue on GitHub or reach out to us via email:developer@univer.ai.
About
A powerful Chrome extension for web scraping