We need to develop a wrapper of Apache spark for an existing application which works on a custom distributed framework. The work involves the following tasks
1) Create a job execution code that will read a JSON file which will have the list of the tasks to be executed in order.
2) Task will have a push method which will generate RDDs from task,
10 freelancers are bidding on average ₹29694 for this job
I have working in hadoop and spark. I have knowledge of RDD, DataFrame and Dataset. I'll create this application is very optimized manner and distributed in cluster.