2017 © Pedro Peláez
 

project auto_scrapy

Car-ads crawler

image

darkerth/auto_scrapy

Car-ads crawler

  • Saturday, May 20, 2017
  • by darkerth
  • Repository
  • 0 Watchers
  • 0 Stars
  • 12 Installations
  • Python
  • 0 Dependents
  • 0 Suggesters
  • 0 Forks
  • 0 Open issues
  • 1 Versions
  • 0 % Grown

The README.md

Car-ads crawler

Crawl todays ads from various Lithuanian car-ads websites based on some parameters., (*1)

Requirements

  • Python 2.7+

How to run

  • Navigate to the spider directory: cd auto/spiders
  • Execute the spider with python auto_spider.py {filename} {manufacturer} {model} {year_from} {year_to} {price_from} {price_to}

Example: python auto_spider.py response audi a4 2000 2006 0 3000 - this will crawl Audi A4 cars by 2000-2006 years and price 0-3000. Crawled result is saved on {filename}.json, (*2)

HTTPS support:

In case you need to crawl websites with SSL certificate * sudo apt-get install libssl-dev * pip install pyopenssl --upgrade, (*3)

To-Do:

  • Proxy support
  • In-depth ad crawling
  • Random delays

The Versions

20/05 2017

dev-master

9999999-dev

Car-ads crawler

  Sources   Download

MIT