Program Engine in VB/C++/C/C# which analyses an English document and does the following: 1) Removes all misspelled words and numbers, leaving only words known in the english dictionary. 2) *this is the tricky part* - It must then analyze the remainder of the document, it must cleverly take in each word of the document and create a code to identify this document. For example: a document could read; "Welcome to Boxer High School" after analyzing the several words a code is created to identify this text IE: "77fdd8853fhje" The smaller the end code is relative to the amount of text in the document THE BETTER, this is important so if you have a large amount of codes you can search them quickly... Basically its a way of summarizing a document full of English words, and producing an ORIGINAL code number to identify this document. So by the same law you could type 77fdd8853fhje into the engine and when it reverse processes it back, it will read "Welcome to Boxer High School" If you think you are ready to push the limits and are 100% sure you are capable of making such an engine (which can handle larger documents), make a bid and it will be considered. I AM OPEN TO ANY DIFFERENT METHODS YOU MIGHT HAVE TO ACCOMPLISH THE SAME OBJECTIVE 'GET A WAY OF IDENTIFYING DOCUMENTS WITH A CODE" Proof of being able to do this will help you get the bid easier.
## Deliverables
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Installation package that will install the software (in ready-to-run condition) on the platform(s) specified in this bid request.
3) Exclusive and complete copyrights to all work purchased. (No GPL, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site).
## Platform
Windows (ALL)