Find Jobs
Hire Freelancers

Big data Project - using sentence embedding(word2vec, doc2vec) and gradient boost models such as catBoost

$30-250 AUD

Closed
Posted over 2 years ago

$30-250 AUD

Paid on delivery
1. Collect and process pdf data dump from COVID-19 Open Research Dataset Challenge (CORD-19) [login to view URL] 2. Analyze the data and provide publication statistics such as the number of publications according to time, location but not limited to. Provide (any type of) visualization for the results. 3. Learn sentence embedding from the articles' abstract and main content respectively. 4. Build a tool for question answering: given a user input sentence or query, outputs the top 10 most relevant sentences from the data and the source of the data, i.e., the sentence comes from which article. The tool could be command-line based or a simple Web-based interface. Note that the dataset is large, so if you have difficulties processing all the articles provided in the dataset, you could work on part of it but no less than 5000 articles. And provide justification of why you choose the number of articles to work o
Project ID: 31906837

About the project

7 proposals
Remote project
Active 2 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
7 freelancers are bidding on average $169 AUD for this job
User Avatar
Hi, Hope you are doing well. I have over 6 years of rich experience in data science and machine learning. I have worked hands on in Python with different datasets for data wrangling, data manipulation, data analysis etc. I have worked on several kinds of ML techniques like regression, classification and clustering and NLP techniques like text classification, word2vec , tfidf etc. I understand your problem completely. Since I have worked on such problems in the past, I am sure I would be able to deliver your work. Looking forward to work with you
$250 AUD in 5 days
5.0 (14 reviews)
4.7
4.7
User Avatar
Hello, I am a data scientist with strong background in machine learning and statistics and more than 3 years in building complex ML systems in NLP, computer vision, active learning, federared learning, few-shot learning, etc, . In my past work, I have managed teams of people to work on every steps of the project, from data collection, data exploration, feature extraction, building model, evaluating using different metrics to build API to serve more than 5 millions requests/day. My tools include Python, Tensorflow/keras, Tensorflow Lite for Android, Numpy, Pandas, OpenCV, AWS (for deploying and monitoring model) as well as many visualization tools like matplotlib, seaborn, etcs. I am confident that I can help you on this. Please send me more detail about the project so that I can further assist you on this. I am looking forward to hearing back from you. Dat
$200 AUD in 7 days
5.0 (11 reviews)
4.3
4.3
User Avatar
Dear Client Thank you for your project. I've just checked your job description carefully. I'm senior developer with 9+ years of Python. By using Python, I developed AI engine, BOT, Web Scraping Tools, Web Searching Tools and so on. Python is my major so you will be satisfied. I always find my happy and pleasant within client's mind whenever the they are satisfied. So I try harder and harder to develop the projects perfectly. If someone asks me about my happy and pleasant, I always will talk that when the clients are satisfied as I develop their projects excellently and perfectly, I am very happy. I would love to work with you. I can complete your project on time and within your budget. I hope you contact me soon. Thank you. Best Regards.
$140 AUD in 7 days
5.0 (1 review)
1.9
1.9
User Avatar
Hi dear, I have read your job description carefully and I am very interested in your pdf scraping project. I will use Beautifulsoup to perform your large data scraping. Beautifulsoup is very good python scraping libray for pdf scraping on websites. I am very familiar with pdf scraping libraries such as Beautifulsoup, PDFMiner, PyPDF2 and Camelot. The most familiar library for me is Beautifulsoup so I prefer it. I have good experience in scraping data of websites especially articles and votings on sports websites. Many experience in web scraping and confidence in python give me confidence in your project :) Please ping me so I can help you very well. Thanks for your posting.
$140 AUD in 7 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of AUSTRALIA
sydney, Australia
5.0
1
Payment method verified
Member since Oct 13, 2020

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.