[login to view URL] crawler

Cancelled Posted Jan 25, 2013 Paid on delivery
Cancelled Paid on delivery

I need a custom crawler that can accept a range of documents from [url removed, login to view]

example: [url removed, login to view] to [url removed, login to view]

and return a csv file with these fields: Could also be mongodb, open for suggestions..

Clinical Study ID

Title

Phase

Study Status

Start Date

start Enroll

End Date

Primary Comp Date

Study Completion Date

Sponsor

Indication(s)

Intervention(s)

# of Sites

Enrollment (Actual #s where available)

List of Countries

Study Design

# of Study ARMs

Can be written in python or java or can be based on an opensource crawler like Nutch, Hetrix, Bixo web mining toolkit, Mechanize for Python, Crawler4j, etc

Java NoSQL Couch & Mongo Python Web Scraping XML

Project ID: #4174668

About the project

30 proposals Remote project Active Jan 27, 2013