
This repo contains Big Data Project, its about "Real Time Twitter Sentiment Analysis via Kafka, Spark Streaming, MongoDB and Django Dashboard".
The Big Data Project focusing on real-time Twitter sentiment analysis is an impressive initiative that combines several cutting-edge technologies to provide valuable insights into public sentiment. This project utilizes Apache Kafka for real-time data ingestion, Apache Spark for processing, and MongoDB for storage, all while utilizing Django to create an interactive dashboard. The ability to classify tweets into various sentiment categories—positive, negative, and neutral—via natural language processing adds significant depth to the analysis.
This repository not only highlights the architecture of the system but also serves as a detailed guide for anyone interested in setting up a similar project. The inclusion of both training and validation datasets allows for comprehensive testing and model building, making it an excellent resource for data enthusiasts and developers alike.

Django is a high-level Python web framework that encourages rapid development and clean, pragmatic design. It follows the model-view-controller (MVC) architectural pattern, providing an extensive set of built-in tools and conventions to streamline the creation of robust and scalable web applications.
A website that uses Docker for containerization to streamline development, testing, and deployment workflows. This includes features such as containerization of dependencies, automated builds and deployments, and container orchestration to ensure scalability and availability.