You will be given a group of files. These files represent the customer reviews of a group of products. These products are: Canon G3 camera, Dvd player, Jukebox, Nikon Coolpix, and Nokia [login to view URL] review files are semi-structured in a format very specific to the website that generated this data. Usually this format limits the gains that can be attained from this data. To overcome this, you need to change this format into a popular format which is the JSON format.
i have 7 year of work experience on ETL with Informatica and talend. Worked with different format. Also converted from JSON to CSV and CSV to JSON file.
I have good experience in ETL tools like pig and hive, and hence I can work on your file and change the format accordingly. Can you tell the size of the files.