Data Processing Platform with AWS Data Lake

Advanced Data Processing Platform with AWS Data Lake for Multinational Media & Entertainment Conglomerate .

Business Problems

• Need for a data warehousing environment for management of big data.
• Absence of a platform for active data processing.
• Requirement of ETL components for data extraction.
• Data visualization with Tableau as the client has the required licenses.
• Mastering data management from different sources with external and internal integrations, multiple data sources with unstructured data.
• To process raw data for comprehensive visualization with AWS

Solution Details

• Complete data warehousing environment with cutting-edge technologies.
• Development with Informatica, Teradata, and Unix. To create data model and ETL architecture for data processing.
• Managed data loads from database to Hadoop using Sqoop.
• Analysis, design, coding, and development of various Java/J2EE/Hadoop-based ETL components to extract and load data. Built Hadoop-based data processing software to perform analytics on data sets.


• Effective storage environment and creation of big- data based analytics.
• Complete data warehousing of huge datasets.
• Channeled the data pipeline and consolidated reports from different sources.
• Built ETL components to extract and migrate data from legacy systems to Amazon S3 data lake.
• Achieved a one shop visualization interface with Tableau to get a better picture about the analysis.