•
Proficient in Core JAVA
(Multithreading and Collections framework)
•
Proficient in cleaning,
transforming, and analyzing vast amounts of raw data from various systems using
Spark to provide ready-to-use data to our feature developers and business
analysts
•
Proficient in designing
efficient data processing pipelines by writing suitable Spark batch and
streaming Jobs involving various transformations and aggregations
•
Proficient in Spark data
structures and APIs based on RDDs, Data frame and Datasets
•
Proficient in Spark query
tuning and performance optimization