Description
Chapter 2: Data Exploration and PreparationChapter Goal: The basis for building a good Machine Learning model is to have a clear understanding and well preparedness of data. This chapter will explain ways to explore the data for understanding and how to deal with the inconsistencies present in the data. No of pages: 50Sub - Topics1. Various Data Formats2. Summary Statistics3. Missing Values4. Data Imputation5. Transforming Unstructured Data
Chapter 3: Sampling and Resampling TechniquesChapter Goal: In many real-world dataset, the biggest challenge is the sheer volume of the data. This volume makes the computational limitations more evident for building the Machine Learning Models. In order to reduce the need for computational power and at the same time not compromising the efficacy of the model, this chapter explains some sampling techniques for selecting a smaller dataset from the bigger dataset. We will also explore the idea of resampling which increases the accuracy of many Machine Learning Models.No of pages: 50Sub - Topics: 1. Simple Random Sampling2. Systematic Sampling3. Stratified Sampling4. Cluster Sampling5. Bootstrap sampling
Chapter 4: Visualization of DataChapter Goal: Visualization is a powerful tool to see through things in our data which might not be very evident when a manual exploration is carried out. This chapter will explain some of the commonly used plots and diagrams to see visually appealing insights coming out from our data. No of pages: 50Sub - Topics: 1. Scatterplot, Histogram and Box Plot2. Heat maps and Waterfall Charts3. Dendrogram for Clustering4. Bubble Chart and Word Cloud5. Sankey Diagrams6. Time Series Graphs7. Cohort Diagram
Chapter 5: Feature Engineering Chapter Goal: One more challenge in the real world dataset is the number of features it contains. There might be hundreds of feature in a dataset but not all of it is useful for building our model. So, in order to select the features which explain our dataset more than the other features, and hence give a more accurate result, we have certain well proven technique derived from statistics. The feature engineering has now become an unavoidable step in our Machine Learning Model building process.No of pages: 40Sub - Topics: 1. Feature Ranking2. Variable Subset Selection 3. Dimensionality Reduction
Chapter 6: Machine Learning Models: Theory and PracticeChapter Goal: This chapter is the core of this book. After we had the fair understanding of our data and performed the feature engineering, it's now time to build some really powerful Machine Learning Models. This chapter lists all the ML algorithms under one header. A clear demarcation will be drawn for explaining how each of these ML algorithms are different from each other and which algorithm suits the given use-cases.
Author: Karthik Ramasubramanian, Abhishek Singh
Publisher: Apress
Published: 12/13/2018
Pages: 700
Binding Type: Paperback
Weight: 2.73lbs
Size: 10.00h x 7.00w x 1.45d
ISBN13: 9781484242148
ISBN10: 1484242149
BISAC Categories:
- Computers | Computer Science
- Computers | Programming | Open Source
- Computers | Languages | General
About the Author
Karthik Ramasubramanian has over seven years' experience leading data science and business analytics in retail, FMCG, e-commerce, information technology and hospitality for multi-national companies and unicorn startups. A researcher and problem solver with a diverse set of experience in the data science life cycle, starting from a data problem discovery to creating data science PoCs and products for various industry use cases. In his leadership roles, he has been instrumental in solving many ROI-driven business problems through data science solutions. He has mentored and trained hundreds of professionals and students around the world through various online platforms and university engagement programs in data science.
He has designed, developed and spearheaded many A/B experiment frameworks for improving product features, conceptualized funnel analysis for understanding user interactions and identifying the friction points within a product, and designed statistically robust metrics. On the predictive side, he has developed intelligent chatbots based on deep learning models which understands human-like interactions, customer segmentation models, recommendation systems and many natural language processing models.His current areas of interest include ROI-driven data product development, advanced machine learning algorithms, data product frameworks, Internet of Things (IoT), scalable data platforms, and model deployment frameworks.
Karthik completed his M.Sc. (Theoretical Computer Science) from PSG College of Technology, Coimbatore (Affiliated to Anna University, Chennai), where he pioneered the application of machine learning, data mining and fuzzy logic in his research work on computer and network security.
Abhishek Singh is on a mission to profess the de facto language of this millennium, the numbers. He is on a journey to bring machine closer to human, for a better and beautiful world around us by generating opportunities with artificial intelligence and machine learning. He leads team of data science professionals who are solving pressing problems in food security, cyber security, natural disaster, healthcare and many more areas, all with help of data and technology. Abhishek is in the process of bringing smart IoT devices to smaller cities in India for people to leverage technology to improve their lives.He has worked with colleagues from many parts of the USA, Europe and Asia, and strives to work with more people from various backgrounds. In a span of six years at big corporates, he has stress tested the assets of US banks, solved insurance pricing models, and made the telecom experience easier for customers, and is now creating data science opportunities with his team of young minds.
He actively participates in analytics-related thought leadership, writing, public speaking, meet-ups and training in data science. He is staunch supporter of responsible use of AI to remove biases and fair use for a better society.
Abhishek completed his MBA from IIM Bangalore, B.Tech. (Mathematics and Computing) from IIT Guwahati, and PG Diploma (Cyber Law) from NALSAR University, Hyderabad.