Skip to content

Latest commit

 

History

History
64 lines (46 loc) · 2.74 KB

File metadata and controls

64 lines (46 loc) · 2.74 KB

LiveLogAnalytics

Real-Time Web Server Log Processing
CSC 591 Data Intensive Computing

Team: Group 6

  • Daxkumar Amin (dkamin)
  • Khantil Choksi (khchoksi)
  • Riken Shah (rshah9)

Architecture


Steps to setup the Kinesis pipeline with DynamoDB

  • Make sure you are in US-WEST-2 AWS Region
  • Create EC2 instance by CloudFormation Template
  • Create DynamoDB to be used by consumers to put data for analysis
  • Producers will use raw data and simulate like high velocity of stream from the dataset
  • Configure the conf.py file depending on your requirements of input and output stream rates and how you want to visualize.
  • Run the script on EC2 instance with Python3 installed.
  • Run setup.sh file to install pip3 and dependencies.
  • Producer: python ./datagenerator/producer.py
  • Consumer: python ./consumer/consumer.py <shard_id>
  • Also follow this steps: https://aws.amazon.com/blogs/big-data/perform-near-real-time-analytics-on-streaming-data-with-amazon-kinesis-and-amazon-elasticsearch-service/

Screenshots:


Milestone Project Proposal

Click Here to access Project Proposal


Milestone Project Overview (Quad Chart)

Click Here to access Project Overview


Milestone Project Status Report (Quad Chart)

Click Here to access Project Status Report


Final Report & Presentation


References: