2017 © Pedro Peláez
 

library crawler

Crawl your own website with various clients for SEO and indexing purposes.

image

mediamonks/crawler

Crawl your own website with various clients for SEO and indexing purposes.

  • Monday, December 4, 2017
  • by mediamonks
  • Repository
  • 8 Watchers
  • 14 Stars
  • 308 Installations
  • PHP
  • 1 Dependents
  • 0 Suggesters
  • 4 Forks
  • 0 Open issues
  • 6 Versions
  • 3 % Grown

The README.md

Build Status Scrutinizer Code Quality Code Coverage Total Downloads Latest Stable Version Latest Unstable Version SensioLabs Insight License, (*1)

MediaMonks Crawler

This tool allows you to easily crawl a website and get a DOM object for every url that was found. We use this to crawl our own site pages regardless if it was generated with server and/or client side content by using the Prerender.io client. The resulting data can be used for creating a full site search and/or improving SEO for single-page applications., (*2)

Highlights

  • Ships with Prerender & Prerender.io clients, uses Goutte by default
  • Supports any Symfony BrowserKit client
  • Supports both whitelisting and blacklisting of urls
  • Supports url normalization which allow you to prevent duplicates based on minor url differences
  • Implements the PSR-3 Logger Interface

Documentation

Documentation and examples can be found in the /doc folder., (*3)

System Requirements

You need:, (*4)

  • PHP >= 5.5.0

To use the library., (*5)

Install

Install this package by using Composer., (*6)

$ composer require mediamonks/crawler

Security

If you discover any security related issues, please email devmonk@mediamonks.com instead of using the issue tracker., (*7)

License

The MIT License (MIT). Please see License File for more information., (*8)

The Versions

04/12 2017

dev-master

9999999-dev https://www.mediamonks.com/

Crawl your own website with various clients for SEO and indexing purposes.

  Sources   Download

MIT

The Requires

 

The Development Requires

search dom crawler seo index spider goutte robot domcrawler prerender prerender.io

04/12 2017

2.0.0

2.0.0.0 https://www.mediamonks.com/

Crawl your own website with various clients for SEO and indexing purposes.

  Sources   Download

MIT

The Requires

 

The Development Requires

search dom crawler seo index spider goutte robot domcrawler prerender prerender.io

17/10 2017

dev-feature-client-interface

dev-feature-client-interface https://www.mediamonks.com/

Crawl your own website with various clients for SEO and indexing purposes.

  Sources   Download

MIT

The Requires

 

The Development Requires

search dom crawler seo index spider robot prerender prerender.io

11/08 2017

1.1.0

1.1.0.0 https://www.mediamonks.com/

Crawl your own website with various clients for SEO and indexing purposes.

  Sources   Download

MIT

The Requires

 

The Development Requires

search dom crawler seo index spider robot prerender prerender.io

31/03 2017

v1.0.1

1.0.1.0 https://www.mediamonks.com/

Crawl your own website with various clients for SEO and indexing purposes.

  Sources   Download

MIT

The Requires

 

The Development Requires

search dom crawler seo index spider robot prerender prerender.io

28/11 2016

v1.0.0

1.0.0.0 https://www.mediamonks.com/

Crawl your own website with various clients for SEO and indexing purposes.

  Sources   Download

MIT

The Requires

 

The Development Requires

search dom crawler seo index spider robot prerender prerender.io