In Progress

Parse Data from HTML Files

I had a user that ran an automated script against our solution database to download a local copy of each solution. That user had access to a link to delete each solution, and, as a result, their program crawled each page, followed the delete link, and deleted each solution/article.

As a result, I have 10,479 separate HTML files that contains the information that is no longer in my database. Each of these files contains 7 pieces of information I am interested in:



Create Date

Revised Date




All other information is not necessary and can be ignored.

The problem and solution values may or may not contain HTML markup.

There are two sample files attached. Additional sample data can be provided upon request.

This project is to parse the requested data out of the 10,479 files, and:

1. Provide a sample export of the resulting data to ensure it meets requirements,

2, Complete a full export of the entire data set once we have agreed on the sample export.

The export can be provided in a number of ways - database file, excel file, delimited file, etc. Please present your proposed format with your proposal.

Skills: C Programming, C# Programming, Data Mining, Data Processing, Software Development

See more: sample project proposal format, sample program proposal, sample of program proposal, programming with data, parse programming, kbid, html programming file, full project proposal sample, format of project proposal, excel and access programming, excel access programming, the parse platform, sample data, parse html, parse an html, parse html link, processing export, html parse link, html copy excel, delimited access

About the Employer:
( 18 reviews ) Oakfield, United States

Project ID: #6348222

10 freelancers are bidding on average $139 for this job


Hello, This is a simple parsing job, I would write a one page script that parses the file and saves the data in a database or saves it as a csv file. I can supply all the data in one csv, excel and/or database file. More

$222 USD in 3 days
(3 Reviews)

Hello, I'm a computer scientist with a lot of experience in big data and data scraping. For this problem I will probably use python (even though it is not necessary, Java can also be an option, as well as C#), and the More

$100 USD in 3 days
(6 Reviews)

Hi, I am an expert for scraper scripts written in PHP, please check out my profile and ratings on scraper projects. I can write a script that would parse store all in Mysql database, and then, through phpmyadmin, More

$55 USD in 1 day
(4 Reviews)

Hello, I can do sample entries. Please check my reviews. Looking forward for the reply. Regards, Aditya

$40 USD in 2 days
(42 Reviews)

Hello, I have an experience of more than 4 years in web development and maintenance. I have in-depth knowledge of php, mysql, jquery, paypal integrations, API's, css, html, html5. Our team is experienced, creativ More

$157 USD in 8 days
(6 Reviews)

Hello. How exactly should the inner html be parsed? Should the visual style be represented the same way in the export file or do you just want to clean the text out of tags? Anyway, i can take on your project. Her More

$88 USD in 3 days
(5 Reviews)

I am very adept at parsing data and normalizing it. From the examples, the data looks straightforward and easy to parse. I can provide this as either delimited (CSV) or as an sqlite database file. My suggestion would b More

$161 USD in 2 days
(3 Reviews)

Hello sir, I can help you on this. Need to discuss on the exact information to be extracted from the HTML files, I have check the file and I can extract the data as required. I will create a desktop ( Windows) More

$166 USD in 3 days
(3 Reviews)

Hello! I am Antal, i can do this job easily and fast, if you interest message me, i can create a sample for you if you want before you award me! :) Thanks, Antal

$250 USD in 2 days
(0 Reviews)

I wont say that i know every thing, but i know to do your requirement in an perfect time. I had an knowledge of all the that your are needed. So kindly accept my Notice.

$155 USD in 3 days
(0 Reviews)