In Progress

sort & combine text by topic (keywords)

i have a script to find similar text by keywords

-> attachment "[url removed, login to view]"

in this result i get double entries: 13 & 10

-> how can i get a result like:

"obama & clinton"

3 bla sdf asd fb la fg dfg blb ala bla bla clinton

4 b lad fg bl obama ba la dfg clinton dsf bla bla

6 b la bl obama bd fg sdf a la bla bla

7 db la dbl obama bd dfg sdf ad la bla bla

11 s ons ti ges obama sdf as df

12 s clinton ons ti ges sdf as df

"twitter & web20-web20 & facebook-mircosoft & facebook"

2 b la bl twitter ba la bla bl dfg a

8 bla df gd sfg blb ala bla bla twitter

10 twitter s ons web20 ti ges sdf as df

5 bla blb dfg dfg ala bla bla ds fg mircosoft

13 s mircosoft ons facebook ti ges sdf as df

1 bla web20 blb ala bla bla facebook dfg

"Non-keyword items"

9 s ons ti ges sdf about as df

-> task to do is: "if one keyword is the same keyword as one of the keywords in another group, merge text from both of them in one groupe"

+

i also need somethuing like an id-array-output where i can read out what text belongs together, something like:

$keywordid[topic][textid]

(-> var_dump($keywordid); should show all text)

to work again with the results

to make it clear, output now is:

mircosoft & facebook
5 bla blb dfg dfg ala bla bla ds fg mircosoft
13 s mircosoft ons facebook ti ges sdf as df

obama & clinton
3 bla sdf asd fb la fg dfg blb ala bla bla clinton
4 b lad fg bl obama ba la dfg clinton dsf bla bla
6 b la bl obama bd fg sdf a la bla bla
7 db la dbl obama bd dfg sdf ad la bla bla
11 s ons ti ges obama sdf as df
12 s clinton ons ti ges sdf as df

twitter & web20
2 b la bl twitter ba la bla bl dfg a
8 bla df gd sfg blb ala bla bla twitter
10 twitter s ons web20 ti ges sdf as df

web20 & facebook
1 bla web20 blb ala bla bla facebook dfg
10 twitter s ons web20 ti ges sdf as df
13 s mircosoft ons facebook ti ges sdf as df
Non-keyword items

9 s ons ti ges sdf about as df

-> now i want to merge all text with same topic, meaning all text with the keyword "mircosoft & facebook" should be merged with "web20 & facebook" because of "facebook" and all text with "twitter & web20" should also be merged with "web20 & facebook" because of "web20".
-> result should be something like:

"obama & clinton"
3 bla sdf asd fb la fg dfg blb ala bla bla clinton
4 b lad fg bl obama ba la dfg clinton dsf bla bla
6 b la bl obama bd fg sdf a la bla bla
7 db la dbl obama bd dfg sdf ad la bla bla
11 s ons ti ges obama sdf as df
12 s clinton ons ti ges sdf as df

"twitter & web20-web20 & facebook-mircosoft & facebook"
2 b la bl twitter ba la bla bl dfg a
8 bla df gd sfg blb ala bla bla twitter
10 twitter s ons web20 ti ges sdf as df
5 bla blb dfg dfg ala bla bla ds fg mircosoft
13 s mircosoft ons facebook ti ges sdf as df
1 bla web20 blb ala bla bla facebook dfg

"Non-keyword items"
9 s ons ti ges sdf about as df

Skills: PHP

See more: sort array c, how can i find twitter id, ges, assembly language program count letters text file sort frequency , java text file sort, text randomly sort, text file sort java code, text random sort online, visual basic text file sort, create method sort array items name product, text random sort excel

About the Employer:
( 22 reviews ) munich, Germany

Project ID: #8291007

Awarded to:

mondersky

I tried to understand the description but I couldn't follow you. can you please try to explain differntly ?

€30 EUR in 1 day
(72 Reviews)
6.0

2 freelancers are bidding on average €25 for this job

emransl

I can do it, If you like, you can ask me for test output before hire me. Waiting for your reply...... . Update: I already solved it. here is the output Array ( [clinton obama ] => Array ( More

€19 EUR in 1 day
(8 Reviews)
2.3