Skills
Python Hadoop Hive Pig Scripts Neo4j Sqoop Jobs Sql Server NoSQL TalendCompany
Calance• Explored different implementations in hadoop environment for data extraction and summarization by using packages like Hive, Pig. • Importing and exporting data into HDFS using Sqoop. • Worked on python files to load the data from csv, json, mysql, hive files to Neo4j Graphical databse. • Created Neo4j Graphical database nodes and relationship. • Implement cypher queries to manipulate data on Neo4j database. • Involved in Design, analysis, Implementation, Testing and support of ETL processes for Stage, ODS and Mart. • Used Talend as a Data Cleansing tool to correct the data before loading into the staging area. • Collect and link metadata from diverse sources, including relational databases and flat files. • Extracted data from different Flat files, MS Excel, HIve and transformed the data based on user requirement using Talend and loaded data into target, by scheduling the sessions. • Supporting daily loads and work with business users to handle rejected data. • Implemented data cleansing for files using Talend. • Created data model in Postgres using dimensional model. • Load data into Postgres using snow flake schema. • Performed Unit Testing and tuned for better performance. • Created Reusable Transformations and multiple Mappings.