Closed

Data Mining

The Project:

I put $3-$250 but realize it will cost more than that.

You will be responsible for programming the backend and administering the retrieval of information of domain name owner’s also knows as the whois database. You will be given a list of 101 million domain names organized by extension which include all the com, net, org, biz, info and us. You will collect and organize the following information of these domains into spreadsheets as follows;

”Domain,TLD,Registrant,AdminName,AdminAddress,AdminCity,AdminState,AdminZip,AdminCountry,AdminPhone,AdminFax,AdminEmail,NameServer,Created,Updated,Expired”

There are software’s specifically designed to extract the whois database of domain ownerws, you can download these as examples:

[url removed, login to view]

[url removed, login to view]

There are about 1,000 registrars to pull whois. One issue is that the whois database will block an IP (for spam reasons) temporarily every certain amount of queries, which can be every 20-30 domains and the IP block may last 1 minute as an example. These are some of the variables which you will need to examine for your own calculations.

One possibility about going about this is to program a server based extractor and use many non-sequential ip’s and servers to collect the data. You would program the back end and rotate the ips for every 20 queries (or whatever number you determine is best). You can do you own testing to determine what is involved. This is just one idea for you to examine, you may come up with your own solutions for this project.

Project Start and Completion Date:

Ideally we would like to get this out by: April 14th, 2008

Completion date will be negotiated:

Deliverables:

- The entire database for com, net, org, biz, info, us properly sorted into columns and organized into spreadsheets and as described above.

- Any software used and programmed to collect or sort the database with setup and user instructions. Any programs should be fully commented.

All deliverables will be considered "exclusively made for the use of the buyer". Buyer will receive exclusive and complete rights to all work. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the coder's Seller Legal Agreement).

Compensation:

Calculate all the expenses involved and your work and submit a figure at which you will complete the proposed task on an agreed upon schedule. Compensation will be paid upon work completion or we can arrange it to be paid in pieces as the work progresses. For example, when 25% of the database is delivered in accordance with the project, then 25% of the compensation will be paid.

*This information contained in this document is legally privileged, confidential and is for use of the individual or entity to which it is addressed. Access, disclosure, copying, distribution or reliance on any or all of its contents by anyone other than the intended recipient is strictly prohibited.

Skills: Data Entry, Data Processing

See more: www data processing, www all programs com, what's data processing, what is date entry, sorted data, software testing cost, sequential programming, programming with data, programming 101, non disclosure agreement document, download data entry software, domain testing in software testing, data programming, data processing in data mining, data processing com, data entry software programs, data entry software download, data entry programs list, confidential work, confidential disclosure agreement example, best data entry programs, organized at work, www programs download com, what data processing, domain name buyer

About the Employer:
( 7 reviews ) Val Des Monts, Thailand

Project ID: #247560