Completed

Parse Data from HTML Files

I had a user that ran an automated script against our solution database to download a local copy of each solution. That user had access to a link to delete each solution, and, as a result, their program crawled each page, followed the delete link, and deleted each solution/article.

As a result, I have 10,479 separate HTML files that contains the information that is no longer in my database. Each of these files contains 7 pieces of information I am interested in:

Platform

Version

Create Date

Revised Date

KBID

Problem

Solution

All other information is not necessary and can be ignored.

The problem and solution values may or may not contain HTML markup.

There are two sample files attached. Additional sample data can be provided upon request.

This project is to parse the requested data out of the 10,479 files, and:

1. Provide a sample export of the resulting data to ensure it meets requirements,

2, Complete a full export of the entire data set once we have agreed on the sample export.

The export can be provided in a number of ways - database file, excel file, delimited file, etc. Please present your proposed format with your proposal.

Skills: C Programming, C# Programming, Data Mining, Data Processing, Software Development

See more: sample project proposal format, sample program proposal, sample of program proposal, programming with data, parse programming, kbid, html programming file, full project proposal sample, format of project proposal, excel and access programming, excel access programming, the parse platform, sample data, parse html, parse an html, parse html link, processing export, html parse link, html copy excel, delimited access, delimited data, parse data file, parse data, crawled data, export data project access

About the Employer:
( 15 reviews ) Oakfield, United States

Project ID: #6348222

Awarded to:

lafor

Hi, I specialize in web data extraction and processing; on my reviews page you will find dozens of examples of similar tasks I've completed here on Freelancer. I took a look at your attached files and the informa More

$70 USD in 1 day
(139 Reviews)
6.5

11 freelancers are bidding on average $133 for this job

Ivan83

Hi, I am an expert for scraper scripts written in PHP, please check out my profile and ratings on scraper projects. I can write a script that would parse store all in Mysql database, and then, through phpmyadmin, More

$55 USD in 1 day
(4 Reviews)
4.6
xcodbin

please check our company freelancer profile https://www.freelancer.com/u/xcodbin.html we already developed this type of project so we can take your project. why you hire us ? have 5*** with 100% complete rate with we More

$140 USD in 6 days
(10 Reviews)
3.5
Tyrrrz

Hello. How exactly should the inner html be parsed? Should the visual style be represented the same way in the export file or do you just want to clean the text out of tags? Anyway, i can take on your project. Her More

$88 USD in 3 days
(5 Reviews)
3.3
adityasl

Hello, I can do sample entries. Please check my reviews. Looking forward for the reply. Regards, Aditya

$40 USD in 2 days
(13 Reviews)
3.3
peebes

I am very adept at parsing data and normalizing it. From the examples, the data looks straightforward and easy to parse. I can provide this as either delimited (CSV) or as an sqlite database file. My suggestion would b More

$161 USD in 2 days
(2 Reviews)
1.6
varshil8060

Hello sir, I can help you on this. Need to discuss on the exact information to be extracted from the HTML files, I have check the file and I can extract the data as required. I will create a desktop ( Windows) More

$166 USD in 3 days
(2 Reviews)
1.4
ergo1wish

Hello, I'm a computer scientist with a lot of experience in big data and data scraping. For this problem I will probably use python (even though it is not necessary, Java can also be an option, as well as C#), and the More

$100 USD in 3 days
(1 Review)
1.0
kotiantal

Hello! I am Antal, i can do this job easily and fast, if you interest message me, i can create a sample for you if you want before you award me! :) Thanks, Antal

$250 USD in 2 days
(0 Reviews)
0.0
pointlogic

Hello, I have an experience of more than 4 years in web development and maintenance. I have in-depth knowledge of php, mysql, jquery, paypal integrations, API's, css, html, html5. Our team is experienced, creativ More

$157 USD in 8 days
(0 Reviews)
0.0
vpradeep315

I wont say that i know every thing, but i know to do your requirement in an perfect time. I had an knowledge of all the that your are needed. So kindly accept my Notice.

$155 USD in 3 days
(0 Reviews)
0.0
burhanbvk

Hello, This is a simple parsing job, I would write a one page script that parses the file and saves the data in a database or saves it as a csv file. I can supply all the data in one csv, excel and/or database file. More

$222 USD in 3 days
(0 Reviews)
0.0