Projects

Otsuka

Skills
Python Hadoop Hive Pig Scripts Neo4j Sqoop Jobs Sql Server NoSQL Talend
Company
Calance

• Explored different implementations in hadoop environment for data extraction and summarization by using packages like Hive, Pig. • Importing and exporting data into HDFS using Sqoop. • Worked on python files to load the data from csv, json, mysql, hive files to Neo4j Graphical databse. • Created Neo4j Graphical database nodes and relationship. • Implement cypher queries to manipulate data on Neo4j database. • Involved in Design, analysis, Implementation, Testing and support of ETL processes for Stage, ODS and Mart. • Used Talend as a Data Cleansing tool to correct the data before loading into the staging area. • Collect and link metadata from diverse sources, including relational databases and flat files. • Extracted data from different Flat files, MS Excel, HIve and transformed the data based on user requirement using Talend and loaded data into target, by scheduling the sessions. • Supporting daily loads and work with business users to handle rejected data. • Implemented data cleansing for files using Talend. • Created data model in Postgres using dimensional model. • Load data into Postgres using snow flake schema. • Performed Unit Testing and tuned for better performance. • Created Reusable Transformations and multiple Mappings.