project auto_scrapy
Car-ads crawler
darkerth/auto_scrapy
Car-ads crawler
- Saturday, May 20, 2017
- by darkerth
- Repository
- 0 Watchers
- 0 Stars
- 12 Installations
- Python
- 0 Dependents
- 0 Suggesters
- 0 Forks
- 0 Open issues
- 1 Versions
- 0 % Grown
Car-ads crawler
Crawl todays ads from various Lithuanian car-ads websites based on some parameters., (*1)
Requirements
How to run
- Navigate to the spider directory:
cd auto/spiders
- Execute the spider with
python auto_spider.py {filename} {manufacturer} {model} {year_from} {year_to} {price_from} {price_to}
Example: python auto_spider.py response audi a4 2000 2006 0 3000
- this will crawl Audi A4 cars by 2000-2006 years and price 0-3000. Crawled result is saved on {filename}.json, (*2)
HTTPS support:
In case you need to crawl websites with SSL certificate
* sudo apt-get install libssl-dev
* pip install pyopenssl --upgrade
, (*3)
To-Do:
- Proxy support
- In-depth ad crawling
- Random delays