Description
Upon completing this book, you will have the knowledge and skills to seamlessly implement large-scale batch and streaming workloads to analyze real-time data streams with Apache Spark.
What You Will Learn
- Master the concepts of Spark clusters and batch data processing
- Understand data ingestion, transformation, and data storage
- Gain insight into essential stream processing concepts and different streaming architectures
- Implement streaming jobs and applications with Spark Streaming
Who This Book Is ForData engineers, data analysts, machine learning engineers, Python and R programmers
Author: Alfonso AntolĂnez GarcĂa
Publisher: Apress
Published: 06/06/2023
Pages: 403
Binding Type: Paperback
Weight: 1.60lbs
Size: 10.00h x 7.00w x 0.86d
ISBN13: 9781484293799
ISBN10: 1484293797
BISAC Categories:
- Computers | Information Theory
- Computers | Artificial Intelligence | General
- Computers | Languages | Python
About the Author
Alfonso AntolĂnez GarcĂa is a senior IT manager with a long professional career serving in several multinational companies such as Bertelsmann SE, Lafarge, and TUI AG. He has been working in the media industry, the building materials industry, and the leisure industry. Alfonso also works as a university professor, teaching artificial intelligence, machine learning, and data science. In his spare time, he writes research papers on artificial intelligence, mathematics, physics, and the applications of information theory to other sciences.