Jun 19, 2020 | Data Science, Machine Learning, Python Programming

In this blog, we will see how to impute a categorical variable using the KNN technique in Python. Pre-read: K Nearest Neighbour Machine Learning Algorithm Missing Value Imputation of Categorical...
Jun 16, 2020 | Data Science, Machine Learning, Python Programming, R Programming

In statistics, imputation is the process of substituting the missing values in the data with some appropriate values. Why impute the missing value? Because statistical packages discard the...
Jun 15, 2020 | Data Science, Python Programming, Statistics

Bar Plot and Box Plot When we analyze two variables, one categorical and the other continuous, the objective is often to see the sum or mean of the continuous variables by categories and compare...
Jun 10, 2020 | Data Science, Machine Learning, Python Programming, R Programming

Outlier Outliers are the extreme values in the data. If the value of a variable is too large or too small, i.e, if the value is beyond a certain acceptable range then we consider that value to be an...
Jun 10, 2020 | Artificial Intelligence, Data Science, Machine Learning, Python Programming

In this blog, we will learn to build a single variable logistic regression using Python and also interpret the model summary output. Business Objective Overview MyBank wishes to develop a...
Jun 9, 2020 | Data Science, Python Programming, Statistics

Analysis of Two Categorical Variables Crosstab is the most preferred way of analyzing two categorical variables. It helps you to identify the affinity between the two categories. Graphically a...
## Recent Comments