City Data(repost)

Completed Posted Feb 13, 2010 Paid on delivery
Completed Paid on delivery

The project entails writing a program to scrape wikipedia for data for 11 fields of information for approximately 9000 US cities. The data would then be formatted in a spreadsheet for my use.

Several of the fields can be found in a summary of the city that wikipedia has on the right hand side of each page on a city. The other fields involve searching the text of the wikipedia city entry and returning a value of "1" if the text is found. For certain fields (the ones that start "1 if...") I only care if there is text in the wikipedia document that says "Home Rule City" for example. In such a case, a "1" should be entered in the document. Note that "Home Rule City" may also appear as Homerule city, home rule city, home-rule city, etc.

I dont care about the program/script. I just care about the finished product which would be the completed table.

## Deliverables

The list of city names, and the state in which the city is located, are in the attached excel file. The list of fields are also listed in the attached excel file. Basically, the job is to screen scrape wikipedia pages on the cities listed to fill out the table.

Data Entry Engineering MySQL PHP Project Management Software Architecture Software Testing

Project ID: #3176211

About the project

2 proposals Remote project Active Feb 13, 2010

Awarded to:

rxhector2k5

See private message.

$85 USD in 14 days
(63 Reviews)
4.8

2 freelancers are bidding on average $77 for this job

RockStone435

See private message.

$68 USD in 14 days
(352 Reviews)
7.9