Highlights : About CG Enterprise

The visual point and click editor is easy to use even for non-technical users. Automatically detects and configures all commands types. Browser-like view of website data. Often no coding is required, custom code can be added at any point in the workflow.

Powerful testing and debugging features help you build reliable agents.Solid error handling and error recovery will keep the agents running in the most difficult scenarios. Easily scale with multiple sessions running in parallel and work distributed across multiple servers/clouds.

Embed the CG Enterprise runtime into your own software Call the CG Enterprise Rest API from anywhere Export directly into third-party Data Analytics / Visualization tools

Easily shift your operation from an outsourced services model to in-house without needing to start again. Scripting can be used for more precise control if you have unusual requirements or for process tuning.

For organizations that rely on web data as an input to their own data products, CG Enterprise helps ensure strict compliance to website data usage terms. Agent configurations are stored in version control with changes tracked, supporting an audit ready operation and clear control over key concerns like rate or type of requests being made, making it easy to comply with pre-defined operating guidelines. An agent can even be configured to halt all data collection if requests are not in compliance with the target website’s robots.txt file.

You can run CG Enterprise on your own infrastructure to develop agents and extract content from as many websites as you like. There are no restrictions on the number of agents, page loads or websites to extract from and there are no monthly data fees. You can also control your own data security.

Export data in numerous formats including Excel, CSV, JSON, XML, PDF, MYSQL, SQL Server, Oracle, Apache Parquet, MongoDB, Cosmos and most other databases via OleDB. Ability to deliver data to many local and cloud object stores (i.e. Amazon AWS S3, Azure, Google Drive/Cloud, Dropbox, SFTP, Email). Data de-duplication & the ability to write directly to custom data structures.
Product Details
Features
Image Extraction
Email Address Extraction
Web Data Extraction
Disparate Data Collection
Phone Number Extraction
IP Address Extraction
Document Extraction
Pricing Extraction