Big Data is a term for an industry that encompasses an ever-evolving set of software for analyzing data sets. Not only is Big Data revolutionizing marketing and business, but it’s also helping us gain a better understanding of our social world.
Big Data with Apache Spark and Python
• Use Data Frames and Structured Streaming in Spark 3
• Use the MLLib machine learning library to answer common data mining questions
• Understand how Spark Streaming lets your process continuous streams of data in real time
The Hadoop Ecosystem
• Process Big Data using batch
• Process Big Data using real time data
• Be familiar with the technologies in the Hadoop Stack