Menu

Machine Learning

Interpolation in Python

Interpolation in Python – How to interpolate missing data, formula and approaches

Interpolation can be used to impute missing data. Let’s see the formula and how to implement in Python. But, you need to be careful with this technique and try to really understand whether or not this is a valid choice for your data. Often, interpolation is applicable when the data is in a sequence or …

Interpolation in Python – How to interpolate missing data, formula and approaches Read More »

Missing Data Imputation Approaches

Missing Data Imputation Approaches | How to handle missing values in Python

Machine Learning works on the idea of garbage in – garbage out. If you put in useless junk data to the machine learning algorithm, the results will also be, well, ‘junk’. The quality and consistency of results depend on the data provided. Missing values in data degrade the quality. Why clean the data before training …

Missing Data Imputation Approaches | How to handle missing values in Python Read More »

EDA

Exploratory Data Analysis (EDA) – How to do EDA for Machine Learning Problems using Python

Exploratory Data Analysis, simply referred to as EDA, is the step where you understand the data in detail. You understand each variable individually by calculating frequency counts, visualizing the distributions, etc. Also the relationships between the various combinations of the predictor and response variables by creating scatterplots, correlations, etc. EDA is typically part of every …

Exploratory Data Analysis (EDA) – How to do EDA for Machine Learning Problems using Python Read More »

ML Modeling - Problem statement and Data description

ML Modeling – Problem statement and Data description

ML modeling is the step where machine learning is used to find patterns in data and use that learned knowledge to predict an outcome. The type of ML modeling we are going to solve in this problem is called ‘Churn Modeling’. Let’s first understand the Churn modeling problem statement and then go over the data …

ML Modeling – Problem statement and Data description Read More »

An Introduction to AdaBoost

AdaBoost – An Introduction to AdaBoost

Adaboost is one of the earliest implementations of the boosting algorithm. It forms the base of other boosting algorithms, like gradient boosting and XGBoost. This tutorial will take you through the math behind implementing this algorithm and also a practical example of using the scikit-learn Adaboost API. Contents: What is boosting? What is Adaboost? Algorithm …

AdaBoost – An Introduction to AdaBoost Read More »

How to formulate machine learning problem

Let’s understand how to define and formulate the machine learning problem (for predictive modeling) from a business problem. This structured approach should help you apply the process to most other types of predictive modeling problems at work. Introduction Often in ML teams, you will hear from the business/company departments about the problems and issues they …

How to formulate machine learning problem Read More »

Train Test Split – How to split data into train and test for validating machine learning models?

The train-test split technique is a way of evaluating the performance of machine learning models. Whenever you build machine learning models, you will be training the model on a specific dataset (X and y). Once trained, you want to ensure the trained model is capable of performing well on the unseen test data as well. …

Train Test Split – How to split data into train and test for validating machine learning models? Read More »

Data Science Roadmap – How to become a Data Scientist? (6 month self study plan)

Today, I discuss the Data Science Roadmap, the missing guide to self study machine learning. I’ll discuss what exactly you need to know and do in order to self study Data science / ML / AI / Stats. I will provide you with some of the best resources for each topic, why you need to …

Data Science Roadmap – How to become a Data Scientist? (6 month self study plan) Read More »

Why learn the math behind Machine Learning and AI?

Why learn the math behind machine learning algorithms when you can readily implement it using the python libraries like scikit-learn, h2o, statsmodels etc? This is a fair question especially coming from beginners when it is easy to implement ML with few lines of code and get the results fast. Now, you must understand that learning …

Why learn the math behind Machine Learning and AI? Read More »

Machine Learning Use Cases – The Big List of Real World Applications by Vertical and Industry

The use cases of machine learning to real world problems keeps growing as ML/AI sees increased adoption across industries. However, there are certain core use cases that add lot of value for organizations and you’ll often find them being implemented in banks, healthcare, manufacturing, product companies or by consulting organizations as well. Let’s tour of …

Machine Learning Use Cases – The Big List of Real World Applications by Vertical and Industry Read More »

Nebullvm Workflow

Nebullvm – Tutorials and benchmarks on Nebullvm, the open-source deep learning inference accelerator

Nebullvm is an open-source library that takes a deep learning model as input and outputs an optimized version that runs 5-20 times faster on your machine. Nebullvm tests multiple deep learning compilers to identify the best possible way to execute your model on your specific hardware, without impacting the accuracy of your model (GitHub link). …

Nebullvm – Tutorials and benchmarks on Nebullvm, the open-source deep learning inference accelerator Read More »

ARIMA Model – Complete Guide to Time Series Forecasting in Python

Using ARIMA model, you can forecast a time series using the series past values. In this post, we build an optimal ARIMA model from scratch and extend it to Seasonal ARIMA (SARIMA) and SARIMAX models. You will also see how to build autoarima models in python ARIMA Model – Time Series Forecasting. Photo by Cerquiera …

ARIMA Model – Complete Guide to Time Series Forecasting in Python Read More »

Text Summarization Approaches

Text Summarization Approaches for NLP – Practical Guide with Generative Examples

Text summarization in NLP is the process of summarizing the information in large texts for quicker consumption. In this article, I will walk you through the traditional extractive as well as the advanced generative methods to implement Text Summarization in Python. Contents 1. Introduction 2. Types of Text Summarization 3. Text Summarization using Gensim 4. …

Text Summarization Approaches for NLP – Practical Guide with Generative Examples Read More »

Bias Variance Tradeoff Cover Image

Bias Variance Tradeoff – Clearly Explained

Bias Variance Tradeoff is a design consideration when training the machine learning model. Certain algorithms inherently have a high bias and low variance and vice-versa. In this one, the concept of bias-variance tradeoff is clearly explained so you make an informed decision when training your ML models Bias Variance Tradeoff – Clearly Explained. Photo by …

Bias Variance Tradeoff – Clearly Explained Read More »

Julian Programming Language

Logistic Regression in Julia – Practical Guide with Examples

Julia is a powerful programming language for Machine Learning and Logistic regression is one of the most popular predictive modeling algorithms, used for binary classification. In this one, you will see the full work flow of how to implement churn modeling using Logistic regression in Julia. Logistic Regression with Julia. Photo by Sergio. This is …

Logistic Regression in Julia – Practical Guide with Examples Read More »

Course Preview

Machine Learning A-Z™: Hands-On Python & R In Data Science

Free Sample Videos:

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science