Search PDF Document and Autofill Properties Fields Software
$250-750 USD
Cancelled
Posted about 11 years ago
$250-750 USD
Paid on delivery
MSDS PDF Document Properties project
The purpose of this project is to create software that will read a MSDS document in PDF format, and put specific data elements from the MSDS sheet in the PDF document properties box.
You must have the ability to search process PDF documents in a folder, or folders, subfolders, sub-sub folder etc.
You must sort your documents based on which fields you were able to match, and any possible combination of matches:
All 5 requirements
requirements 1,2,3,4
requirements 1,2,3
etc.
The minimum goal is to match requirements 1, 2, 3, and 4. There will not be very many documents that match #5.
I will provide a file of all CAS numbers for you to match.
There are five data elements that we will be searching for in each document:
1. Company Name
2. Product Name
3. Most current date on the document
4. CAS numbers
5. GHS compliant.
For every string, ignore not alphanumeric’s, such as ; : - . etc.
replace all comma’s with a space (so ABC Company, Inc. will be ABC Company Inc)
1. Company Name - placed in the Author field:
Look for a string, only on the first page of the pdf document, that looks like a company name, so it has one of the following (with a non-alpha before and after) :
company
companies
corporation
enterprises
laboratories
laboratory
labs
corp
co
llp
lp
llc
industries
ind
international
intl
s.a.
Inc
incorporated
ltd
limited
pty
supply
or, if there is not a match above, it has a string immediately following:
company
company name
company id
corporate office
manufacturer
manufacture
manufacturer information
distributor
manufacturer(s) name
supplier
distributed by
company address
Responsible Party
manufactured
supplied
supplier name and address
contact
contact details
Consignee
importer
manufacturer/supplier
manufactured for
Company Identification
or, if there is not a match above, it is part of an address sequence:
Revkem
PO Box 28104
Green Bay, WI 54324
AQUASOLVE CHEMICAL CO.
P.O. box 1952
Houston, Texas 77251
2. Product name - placed in the Title field
Has a string immediately following this string:
product name
product id
product identity
msds name
trade name
common name
product
brand name
identity
identity (As used on label and list)
Product Identity (Name / Number)
product trade name
Product Brand Name
Trade Name and Synonyms
material name
chemical product
commercial product name
3. Most current date in the document- place in the Subject field. Search for all of the dates found in the document utilizing the date formatting criteria used in the date changer program, compare the dates found, and keep the most current.
4. CAS numbers - place in the Keywords field, one per line
All CAS numbers will be in a format like this: x-x-x, where there is one or more number for each x. (ex. 23-1122-221, or 113-23-1223). I will provide a list of CAS numbers for you to compare the number found. If the CAS number sequence matches a CAS number on my list, then place that CAS number in the Keyword field. There may be 0, 1, or many CAS numbers on each MSDS.
5. GHS version MSDS - Place the word “GHS” in the Keyword field. You will search the document for one or more strings of text, and if those strings match exactly, you will put the word GHS in the Keyword fiels.
PLEASE LOOK AT THE ATTCHED FILE FOR SAMPLE DOCUMENTS THAT YOUR SOFTWARE WILL BE SEARCHING THROUGH.
We will only consider bids under $500 USD
Funds will be place in escrow for the project
Hi compliance11!
I have 20+ years experience in coding and 10+ years experience in coding and designing data manipulation apps in .net. I have recently created a web scraping app that downloads and parses PDF files, so I have a good understanding of what you need (the app parsed over 400,000 pdfs successfully).
Please check PM for details!
Thanks!
Hi This is Piyush Patel
I have 8 yrs of Exp in VBA,VB6,VB.net,SQL and MsAccess.
I have developed many applications in VB6,VBA,VB.NET.
Please Check My PM
Hi.....,
I (Myself a senior ASP.NET programmer using C# and VB) have a team having experience of more than 5 years in developing web and mobile applications. I have a team of expert programmers who write robust and bug-free codes at the very first instance. After seeing our quality of work and the cooperation offered to you even after the work has been delivered to you, you'll come to us again and again to avail our services.
I read through the job details extremely carefully and I am absolutely sure that I can do the project very well. I can achieve the results that you are asking for within your timeframe and budget. I am confident I can exceed your expectations and I would love to work on your project.
Thanks,
Murtaza
Hello,
I have experience with examining content from PDF's. It's not always so easy to extract texts from a pdf. Because PDF can be build up on many different ways. But it is possible to do. Will provide a short demo before the 6 days are over to extract company info from PDF documents (with the aid of your attachment)
It's an easy task for us. We have gone through your requirements and we are ready to start the work immediately on your project. We will send you the complete list of company's projects and portfolio once you reply us back. Also please check the PMB.
Thanks.
Thanks for the opportunity to bid on your project. I'm new freelance and wanna prove my skills, so i will give my life to finish this project in a good manner.