Thank you for providing a clear description of your project. I have reviewed the architecture document and have a few questions.
1. The example on page four refers to "w1" and "j1". j1, or "milk", is assigned a distance value of "5". However, given the sample ontology this should be "4". Is this correct?
2. Do you require languages other than English?
3. Have you considered using Topic Modeling using Gensim for this project?
4. How will the grouped sentences be consumed?
5. Are you open to alternate ways to represent the lexical database format? (Like::selfsame@1,alike@1,identical@...)
6. The project specifies Python or Java. Do you have a preference?
My bid includes the following services.
1. Work with you to create sample data and test cases.
2. Implement the sentence boundary detection.
3. Implement the sentence similarity measurement logic.
4. Implement the lexical database.
5. Implement the sentence grouping functionality.
6. Review the deliverables with you to ensure you are satisfied with the result.
I am located in Alberta, Canada and have worked with many clients across the globe. I am an expert in Semantic technologies and Natural Language concepts. As a native English speaking freelancer I can bring creativity, dedication, and a wealth of experience in the computing industry to your project.
Thank you for your consideration. Feel free to message me if you have any questions.
Best regards,
Jay