Selva Prabhakaran

Selva is the Chief Author and Editor of Machine Learning Plus, with 4 Million+ readership. He has authored courses and books with100K+ students, and is the Principal Data Scientist of a global firm.

Matplotlib histogram feature

Matplotlib Histogram – How to Visualize Distributions in Python

Matplotlib histogram is used to visualize the frequency distribution of numeric array by splitting it to small equal-sized bins. In this article, we explore practical techniques that are extremely useful in your initial data analysis and plotting. Content What is a histogram? How to plot a basic histogram in python? Histogram grouped by categories in …

Matplotlib Histogram – How to Visualize Distributions in Python Read More »

Time Series Analysis

Time Series Analysis in Python – A Comprehensive Guide with Examples

Time series is a sequence of observations recorded at regular time intervals. This guide walks you through the process of analyzing the characteristics of a given time series in python. Time Series Analysis in Python – A Comprehensive Guide. Photo by Daniel Ferrandiz. Contents What is a Time Series? How to import Time Series in …

Time Series Analysis in Python – A Comprehensive Guide with Examples Read More »

Matplotlib Tutorial

Matplotlib Tutorial – A Complete Guide to Python Plot with Examples

Until next time Until next time This tutorial explains matplotlib’s way of making plots in simplified parts so you gain the knowledge and a clear understanding of how to build and modify full featured matplotlib plots. 1. Introduction Matplotlib is the most popular plotting library in python. Using matplotlib, you can create pretty much any …

Matplotlib Tutorial – A Complete Guide to Python Plot with Examples Read More »

Topic modeling visualization – How to present the results of LDA models?

In this post, we discuss techniques to visualize the output and results from topic model (LDA) based on the gensim package. Topic modeling visualization – How to present the results of LDA models? Contents Introduction Import NewsGroups Dataset Tokenize Sentences and Clean Build the Bigram, Trigram Models and Lemmatize Build the Topic Model Presenting the …

Topic modeling visualization – How to present the results of LDA models? Read More »

Top 50 matplotlib Visualizations – The Master Plots (with full python code)

A compilation of the Top 50 matplotlib plots most useful in data analysis and visualization. This list lets you choose what visualization to show for what situation using python’s matplotlib and seaborn library. Introduction The charts are grouped based on the 7 different purposes of your visualization objective. For example, if you want to picturize …

Top 50 matplotlib Visualizations – The Master Plots (with full python code) Read More »

Python @Property Explained – How to Use and When? (Full Examples)

A python @property decorator lets a method to be accessed as an attribute instead of as a method with a ‘()’. Today, you will gain an understanding of when it is really needed, in what situations you can use it and how to actually use it. Contents 1. Introduction2. What does @property do?3. When to …

Python @Property Explained – How to Use and When? (Full Examples) Read More »

Naive Bayes Feature

How Naive Bayes Algorithm Works? (with example and full code)

Naive Bayes is a probabilistic machine learning algorithm based on the Bayes Theorem, used in a wide variety of classification tasks. In this post, you will gain a clear and complete understanding of the Naive Bayes algorithm and all necessary concepts so that there is no room for doubts or gap in understanding. Contents 1. …

How Naive Bayes Algorithm Works? (with example and full code) Read More »

parallel processing python

Parallel Processing in Python – A Practical Guide with Examples

Parallel processing is a mode of operation where the task is executed simultaneously in multiple processors in the same computer. It is meant to reduce the overall processing time. In this tutorial, you’ll understand the procedure to parallelize any typical logic using python’s multiprocessing module. 1. Introduction Parallel processing is a mode of operation where …

Parallel Processing in Python – A Practical Guide with Examples Read More »

Cosine Similarity – Understanding the math and how it works (with python codes)

Cosine similarity is a metric used to measure how similar the documents are irrespective of their size. Mathematically, it measures the cosine of the angle between two vectors projected in a multi-dimensional space. The cosine similarity is advantageous because even if the two similar documents are far apart by the Euclidean distance (due to the …

Cosine Similarity – Understanding the math and how it works (with python codes) Read More »

Gensim Tutorial

Gensim Tutorial – A Complete Beginners Guide

Gensim is billed as a Natural Language Processing package that does ‘Topic Modeling for Humans’. But it is practically much more than that. It is a leading and a state-of-the-art package for processing texts, working with word vector models (such as Word2Vec, FastText etc) and for building topic models. Gensim Tutorial – A Complete Beginners …

Gensim Tutorial – A Complete Beginners Guide Read More »

Lemmatization Approaches with Examples in Python

Lemmatization is the process of converting a word to its base form. The difference between stemming and lemmatization is, lemmatization considers the context and converts the word to its meaningful base form, whereas stemming just removes the last few characters, often leading to incorrect meanings and spelling errors. Comparing Lemmatization Approaches in Python. Photo by …

Lemmatization Approaches with Examples in Python Read More »

Feature Selection – Ten Effective Techniques with Examples

In machine learning, Feature selection is the process of choosing variables that are useful in predicting the response (Y). It is considered a good practice to identify which features are important when building predictive models. In this post, you will see how to implement 10 powerful feature selection approaches in R. Introduction 1. Boruta 2. …

Feature Selection – Ten Effective Techniques with Examples Read More »

LDA in Python – How to grid search best topic models?

Python’s Scikit Learn provides a convenient interface for topic modeling using algorithms like Latent Dirichlet allocation(LDA), LSI and Non-Negative Matrix Factorization. In this tutorial, you will learn how to build the best possible LDA topic model and explore how to showcase the outputs as meaningful results. Contents 1. Introduction 2. Load the packages 3. Import …

LDA in Python – How to grid search best topic models? Read More »

Caret Package

Caret Package – A Practical Guide to Machine Learning in R

Caret Package is a comprehensive framework for building machine learning models in R. In this tutorial, I explain nearly all the core features of the caret package and walk you through the step-by-step process of building predictive models. Be it a decision tree or xgboost, caret helps to find the optimal model in the shortest …

Caret Package – A Practical Guide to Machine Learning in R Read More »

Course Preview

Machine Learning A-Z™: Hands-On Python & R In Data Science

Free Sample Videos:

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science