Develope a domain and E-Mail scraper
1. You scrape https://www.similarweb.com/website/ for domains, categories, language and emails 2. You scrape the meta search engine https://www.ixquick.de/do/search or http://msxml.excite.com/search/web?qsi=41&q=*.tumblr.com&p=5&fcoid=4&fcop=bottomnav&fpid=2 or for every domain i want to search and find domains and subdomains Example: i entry *.tumblr.com and you find all tumblr.com web 20 subdomains or *.wordpress.com. i want to edit a domain and then crawle the domain and find domains and subdomains. 3. Open all domains and subdomains and serach for mailadress with @ or at
I want the results in the database with all the informations: Domain/Subdomain - category - emails - website language
Find category (box category rank) and language (box language rank) on https://www.similarweb.com/website/hatenablog.com#overview
You deliver the scaper and a example result in csv. Please give me your proposal and tell me which language do you develope the scaper. i like linux software.
Functions: I want insert IPs that are rotate
-The scraper is to be delivered as a clean valid code without any syntax errors and detected files for hack. -In order to follow every step of the development, i should get a link from your development sever as soon as you start with the project. -All 8 checking tools must show good results: http://www.sitepoint.com/8-must-have-php-quality-assurance-tools/ . -The clean code is to be delivered with english documentation in the code lines. - W3C validated and good google speed up -I should get a free, open source code. -You Install it on your linux Server and i will check the functions as soon as you ́re done. -After you finished you work i should get the complete scraper with a documentation for the installation.
or is there a way that i can change the 3 different meta search engines for scraping in a small admin: 1. https://www.ixquick.de/do/search?cmd=process_search&query=*.tumblr.com&language=deutsch&lui=&pl=&with_date=&ff=&with_language=lang_de&hmb=1&abp=-1 2. or https://www.fastbot.de/?query=+%2A.tumblr.com 3. or http://msxml.excite.com/search/web?fcoid=417&fcop=topnav&fpid=27&q=*.tumblr.com
When you dont have your own crawler, we can take this one: https://codecanyon.net/item/advanced-email-scraper-saas-pack/14851978 https://codecanyon.net/item/php-search-engine-crawler/1917466
what is your offer?
Delivery Time: 10 Days