Python project repair - improvement

In Progress Posted Jun 20, 2015 Paid on delivery
In Progress Paid on delivery

I need program improvement. The documentation of it is attached. But now program works incorrectly.

Phase 1 - works perfect :)

Phase 2 - not because it uses results of Phase 1 but it should not. It should find simillar articles within one language as it does now but not on results of Phase 1 but on RAW data. The EN portion of data should be downloaded file by file and processed separately because it is not possible to fit it on HDD - just like it is in phase 1. So modifiacation is files it works on.

PHase 3 - now it needs part 2 to finish. But it should be lauched after each EN File processed. Crosslingual sets of articles should be saved in separate files but only the most simillar ones like 5% - not more.

Phase 2 and 3 should be automatic and repeated 99X for each EN FIle.

The code is here: [login to view URL]!AhMECSbJ!hFKXXjib_3Y20VnLlma2L-aTSYa28W1cIsi_ZCdYAsU

Correct EN data is here: http://data.statmt.org/ngrams/docs_en/

C Programming Data Mining Data Processing Perl Python

Project ID: #7898427

About the project

5 proposals Remote project Active Jun 23, 2015

5 freelancers are bidding on average $289 for this job

flashsaiful

Hello Dear, I can do this for you. Please send a massage in the PMB for details.......Best Regards flashsaiful

$444 USD in 5 days
(118 Reviews)
6.6
anuyadav1

A proposal has not yet been provided

$250 USD in 5 days
(63 Reviews)
6.0
SharjeelSohail

hi. I can do this work for you

$250 USD in 7 days
(2 Reviews)
2.6
anuraggupta131

A proposal has not yet been provided

$200 USD in 10 days
(2 Reviews)
2.8