Data Lake on Cloud

By August 9, 2019 Case Study

Customer: One of India’s largest media companies

Problem Statement

One of India’s largest media companies uses various SaaS platforms to run their media streaming application. Hence all of the customers’ data was residing in these SaaS applications. The customer wanted to build a Data Lake to bring all their customers’ and operations’ data at one place to understand their business better

Proposed Solution

Powerup built real-time and batch ETL jobs to bring the data from varied data sources to S3. The raw data was stored in S3. The data was then populated in Redshift for further reporting while advanced analytics was run using Hadoop based ML engines on EMR. Reporting was done using QuickSight.

Cloud platform

AWS.

Technologies used

S3, DynamoDB, AWS ElasticSearch, Kibana, EMR Clusters, RedShift, QuickSight,
Lambda, Cognito, API gateway, Athena, MongoDB, Kinesis.

Leave a Reply