## Data Science Solution Using Titanic Dataset

Overview The sinking of the Titanic is one of the most infamous shipwrecks in history. On 15 April 1912, Titanic made its first voyage. The ship sank after it collided with an iceberg. About 1500 people were killed during this incident. One…

## How Do I Begin A Career in Data Science

“Data is the new oil. We need to find it, extract it, refine it, distribute it and monetize it” – David Buckingham Table of Contents Overview Introduction to Data Science Careers in Data Science Skillsets and their roles Where do I start?…

## Naive Bayes Algorithm In Python

In spite of the greatest advancement in machine learning in last few years, Naive Bayes classifier has proved out to be one of the most simple, accurate and reliable algorithms which are widely used in industrial applications. It works…

## Comparison Between One Way and Two Way ANOVA

When it comes to research, in the field of business, economics, psychology, sociology, biology, etc. the Analysis of Variance, shortly known as ANOVA is an extremely important tool for analysis of data (both One Way and Two Way ANOVA is…

## Support Vector Machine

Table of Contents Introduction How does Support Vector Machine work (SVM) ? Kernel trick Implementing SVM in Python Advantages and Disadvantages Applications Introduction Support Vector Machine (SVM) is a popular supervised machine learning…

## K – Means Clustering Algorithm

“What gets measured, gets managed ” – Peter Drucker Table of Contents Introduction Types of clustering When to use it? Pseudocode How does it work? Distance measures Important hyper-parameters Choosing the optimal k value Picking the…

## Random Forest Algorithm

“Data will talk to you if you are willing to listen to it” – Jim Bergeson Table of Contents Real-time analogy Properties How does it work? Pseudocode Feature importance Important Hyperparameters Implementing Random Forest in Python…

## Decision Tree In Machine Learning

You will be amazed if I tell you that a decision tree has many analogies in real life and has an influence on a wide area of machine learning. There are a bazillion gazillion applications which include detection of Fraudulent financial…

## K Fold Cross Validation

“If you torture the data long enough, it will confess” – Ronald Coase For any machine learning model you design, what is the most common and the important thing you expect from it. Yes, the expected accuracy rate. Conventionally, you used…