Find Jobs
Hire Freelancers

Comments (from message boards) Extractor

$250-750 USD

Closed
Posted over 8 years ago

$250-750 USD

Paid on delivery
Hi, I'm interested in a crawler / extractor with a very simple task. I just want to be able to give it a list of URLs to different articles (e.g. [login to view URL]) OR a URL to a site that hosts many articles (e.g. [login to view URL]) that all have comment sections at the bottom of each article page. I want the crawler/extractor to collect all of the comments and export them into a spreadsheet with the following columns: URL, article title, commenter, comment, keywords (based on simple word frequencies, minus, of course common stop words; should calculate by stem/type, not each token of a word). It's important that there be no limit on how many URLs I can input or how many comments can be scraped - it should be a COMPLETE collection of all the user comments for each URL; it should also be able to detect if there are multiple pages and grab the comments from all of the pages.
Project ID: 9322709

About the project

1 proposal
Remote project
Active 8 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
1 freelancer is bidding on average $250 USD for this job
User Avatar
hello sir , c/c++/python/autohotkey expert worked for samsung & huawei a sample can be provided before hired hope to get message from u thank you very much
$250 USD in 3 days
5.0 (2 reviews)
2.7
2.7

About the client

Flag of UNITED STATES
United States
0.0
0
Payment method verified
Member since Jan 14, 2016

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.