Convert this java program to a MapReduce job (To be run on amazon elastic mapreduce)

Completed Posted Mar 19, 2010 Paid on delivery
Completed Paid on delivery

I have html files I need to parse. I want to use MapReduce to do the work, but don't know how. I wrote the java program to loop through a local directory with the files and parse them into a CSV file. I want to upload all my files to Amazon S3 and then use a mapreduce job to parse the files into 1 or more CSV files. For this project, I would like someone to take the attached java program and convert it into a java mapreduce job that I can run on Amazon. I would also like detailed, step-by-step instructions on how to initiate the job and get the results. Also, sample html files are in the attached zip file.

## Deliverables

0a) Convert attached java program into a java mapreduce job that I can run on Amazon. 0b) Detailed, step-by-step instructions on how to initiate the job and get the results. 1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.

2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):

a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.

b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.

3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).

## Platform

Amazon Elastic MapReduce

Amazon Web Services

Project ID: #3275896

About the project

2 proposals Remote project Active Mar 30, 2010

Awarded to:

mihalychvw

See private message.

$85 USD in 3 days
(1 Review)
2.2

2 freelancers are bidding on average $49 for this job

webspiderinc

See private message.

$12.75 USD in 3 days
(11 Reviews)
4.5