An Introduction to Gradient Boosting Decision Trees

Gradient Boosting is a machine learning algorithm, used for both classification and regression problems. It works on the principle that many weak learners(eg: shallow trees) can together make a more accurate predictor. How does Gradient Boosting Work? Gradient boosting works by building simpler (weak) prediction models sequentially where each model tries to predict the error …

An Introduction to Gradient Boosting Decision Trees Read More »

Pandas iloc – How to select rows using index in DataFrames?

Pandas iloc is a method for integer-based indexing, which is used for selecting specific rows and subsetting pandas DataFrames and Series. The command to use this method is pandas.DataFrame.iloc() The iloc method accepts only integer-value arguments. However, these arguments can be passed in different ways. In this article, you will understand different methods of subsetting …

Pandas iloc – How to select rows using index in DataFrames? Read More »

Pandas drop columns using dataframe.drop and all other methods

To drop a single column or multiple columns from pandas dataframe in Python, you can use `df.drop` and other different methods. Columns from a DataFrame are dropped if they are not relevant to your analysis or problem you are trying to solve. When building a machine learning models, they are removed if it is redundant …

Pandas drop columns using dataframe.drop and all other methods Read More »

Pandas reset index – How to reset the index and convert the index to a column?

pandas.reset_index in pandas is used to reset index of the dataframe object to default indexing (0 to number of rows minus 1) or to reset multi level index. By doing so, the original index gets converted to a column. By the end of this article, you will know the different features of reset_index function, the …

Pandas reset index – How to reset the index and convert the index to a column? Read More »

Approaches to create a Pandas Dataframe in Python

In Pandas, DataFrame is the primary data structures to hold tabular data. You can create it using the DataFrame constructor pandas.DataFrame()or by importing data directly from various data sources. Tabular datasets which are located in large external databases or are present in files of different formats such as .csv files or excel files can be …

Approaches to create a Pandas Dataframe in Python Read More »

matplotlib feature image

Matplotlib Plotting Tutorial – Complete overview of Matplotlib library

Matplotlib is the most popular Python library to plot beautiful graphs. This tutorial guides you to grasp fundamental plotting through reproducible examples. Useful Posts: 1. Matplotlib Beginners Tutorial 2. Top 50 Matplotlib Plots for Data Analysis Overview This tutorial takes you through the following well-rounded concepts: 1. Plotting your first graph 2. Line style and …

Matplotlib Plotting Tutorial – Complete overview of Matplotlib library Read More »

linear regression with tensorflow

How to implement Linear Regression in TensorFlow

Linear Regression is one of the fundamental machine learning algorithms used to predict a continuous variable using one or more explanatory variables (features). In this tutorial, you will learn how to implement a simple linear regression in Tensorflow 2.0 using the Gradient Tape API. Overview In this tutorial, you will understand: Fundamentals of Linear Regression …

How to implement Linear Regression in TensorFlow Read More »

Brier Score – How to measure accuracy of probablistic predictions

Brier score is an evaluation metric that is used to check the goodness of a predicted probability score. This is very similar to the mean squared error, but only applied for prediction probability scores, whose values range between 0 and 1. Overview In this tutorial, you will understand: What is Brier score? How is Brier …

Brier Score – How to measure accuracy of probablistic predictions Read More »

Julia Programming Language for Pythonistas – A Practical Tutorial

Julia programming language tutorial is an introduction to Julia for Python programmers. It will go through the most important Python features (such as functions, basic types, list comprehensions, exceptions, generators, modules, packages, and so on) and show you how to code them in Julia IDE. By the end of this Julia tutorial, you will have …

Julia Programming Language for Pythonistas – A Practical Tutorial Read More »

dask parallel computing in python

Dask – How to handle large dataframes in python using parallel computing

Dask provides efficient parallelization for data analytics in python. Dask Dataframes allows you to work with large datasets for both data manipulation and building ML models with only minimal code changes. It is open source and works well with python libraries like NumPy, scikit-learn, etc. Let’s understand how to use Dask with hands-on examples. Dask …

Dask – How to handle large dataframes in python using parallel computing Read More »

Text Summarization Approaches

Text Summarization Approaches for NLP – Practical Guide with Generative Examples

Text summarization in NLP is the process of summarizing the information in large texts for quicker consumption. In this article, I will walk you through the traditional extractive as well as the advanced generative methods to implement Text Summarization in Python. Contents 1. Introduction 2. Types of Text Summarization 3. Text Summarization using Gensim 4. …

Text Summarization Approaches for NLP – Practical Guide with Generative Examples Read More »

Bias Variance Tradeoff Cover Image

Bias Variance Tradeoff – Clearly Explained

Bias Variance Tradeoff is a design consideration when training the machine learning model. Certain algorithms inherently have a high bias and low variance and vice-versa. In this one, the concept of bias-variance tradeoff is clearly explained so you make an informed decision when training your ML models Bias Variance Tradeoff – Clearly Explained. Photo by …

Bias Variance Tradeoff – Clearly Explained Read More »

Gradient Boosting – A Concise Introduction from Scratch

Gradient Boosting is a machine learning algorithm, used for both classification and regression problems. It works on the principle that many weak learners (eg: shallow trees) can together make a more accurate predictor. A Concise Introduction to Gradient Boosting. Photo by Zibik How does Gradient Boosting Works? Gradient boosting works by building simpler (weak) prediction …

Gradient Boosting – A Concise Introduction from Scratch Read More »

Complete Guide to Natural Language Processing (NLP) – with Practical Examples

Natural language processing (NLP) is the technique by which computers understand the human language. NLP allows you to perform a wide range of tasks such as classification, summarization, text-generation, translation and more. NLP has advanced so much in recent times that AI can write its own movie scripts, create poetry, summarize text and answer questions …

Complete Guide to Natural Language Processing (NLP) – with Practical Examples Read More »

Course Preview

Machine Learning A-Z™: Hands-On Python & R In Data Science

Free Sample Videos:

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science