• Played around with different file formats including Parquet, AVRO, ORC, JSON, XML, CSV
• File format conversions.
• Processing data in SPARK with SCALA
• AWS Services S3, EMR, EC2, lambda, Kinesis firehose, Athena, Glue, Redshift
• Apache Airflow, AWS Data Pipeline
• Having hands on experience in Scala, Python, Java, R
• Using IDE's like IntelliJ, Eclipse, Jupiter Notebook
• Build tools : sbt, Maven
• SCM : Bitbucket, Github