I have 3 years of experience in the field of Big Data Hadoop and Spark. I have worked on the below tools and technologies like Hive, pig, Oozie, Sqoop, HDFS, Hue, data warehouse, data lake, Spark streaming, kafka.
I have worked on projects in healthcare and finance domain that includes migration of the existing data warehouse from Oracle to open source Hadoop. In that, I have prepared Hive scripts, Sqoop scripts for ingesting the data, Oozie for scheduling.
In one other project I have mainly worked on the creation of spark scripts, spark SQL, data frames, datasets and running the script over the cluster by creating the spark application.