• Screenshot 1

Description


Norconex HTTP Collector - Fast and Efficient Website Crawling Tool



If you are looking for a reliable tool to crawl websites quickly and efficiently, look no further than Norconex HTTP Collector. This powerful application is designed to help you extract valuable information from websites and feed it directly to a search engine or save it to a local folder.



Key Features:




  • Multi-threaded operations for faster results

  • Automatic language detection

  • Support for extracting text from images and PDFs

  • Compatibility with various document formats, including HTML and Office documents

  • Processing of canonical URLs

  • Customizable crawling speed

  • Ability to treat embedded documents as distinct files

  • Filtering options based on URL or HTTP headers

  • Support for metadata information

  • Sample files for easy testing

  • Online manual and forums for assistance



Technical Specifications:



System Requirements:



  • Operating System: Windows, Mac, Linux

  • Processor: Intel Core i5 or higher

  • RAM: 4GB minimum

  • Storage: 100MB available space



Additional Information:






Tags:

User Reviews for Norconex HTTP Collector 1

  • for Norconex HTTP Collector
    Norconex HTTP Collector is an invaluable tool for web crawling. Its multi-threaded operations and support for various formats make it efficient and versatile.
    Reviewer profile placeholder Sarah Johnson