Category Archives: Data Science

Data Science MOOC (Will do someday)

Data Science: Discrete vs Continuous

Data Mining Deep Study – Confusion Matrix

A confusion matrix shows the number of correct and incorrect predictions made by the classification models compared by actual outcomes (target value) in the data.


Found a good lecture regarding confusion matrix with easy explanation for HIV AIDS. Video is found below and my own drawing regarding this is also given below:

Data Science Track

1. Watch all the videos in youtube regarding data science including algorithms.
2. For data mining complete a series example Rushdi Shams with WEKA
3. For basic theory:
Watch and complete UDACITY, Andrew NG  machine  learning course step by step.
Udacity course I have found much interesting than Andrew NG but I will finish both In Sha Allah
4. There is a UDEMY paid course hands on data science with python
5. For python learn from codeacademy
UBUNTU is good rather windows for python
6. Subeen vaia’s book is also good for python

To be continued

Top 5 popular data science algorithms

Top 5 popular data science algorithms:

Decision Tree
Random Fores
Association Rule Mining
Linear Regression
K-means Clustering

Data science is nothing but extracting and actionable knowledge from data:

Data Scienctist must know data architecture , machine learning, data analytics.

Machine Learning Algorithms(sample)

Unsupervised Supervised
Clustering Regression
Kmeans Linear
SVD Polynomial
PCA Decision Trees
Radom Forests

Association Analysis Classification
Apriori KNN
FP Growth Trees
Hidden Markov Model Logistic Regression
Naive Bayes

Supervised Learning: The categories of the data is already known
Unsupervised Learning: The learning process attempts to find appropriate category for the data.