Machine Learning Lab (BCM601)

Machine Learning Lab (BCM601) Lab Manual - Click here

1

Histograms and Distribution Analysis

Develop a program to create histograms for all numerical features and analyze the distribution of each feature. Generate box plots for all numerical features and identify any outliers. Use California Housing dataset.

2

Correlation Matrix Analysis

Develop a program to Compute the correlation matrix to understand the relationships between pairs of features. Visualize the correlation matrix using a heatmap to know which variables have strong positive/negative correlations. Create a pair plot to visualize pairwise relationships between features. Use California Housing dataset.

3

Principal Component Analysis (PCA)

Develop a program to implement Principal Component Analysis (PCA) for reducing the dimensionality of the Iris dataset from 4 features to 2.

4

Find-S Algorithm

For a given set of training data examples stored in a .CSV file, implement and demonstrate the Find-S algorithm to output a description of the set of all hypotheses consistent with the training examples.

5

k-Nearest Neighbor Algorithm

Develop a program to implement k-Nearest Neighbor algorithm to classify the randomly generated 100 values of x in the range of [0,1]. Perform classification for k=1, 2, 3, 4, 5, 20, 30.

6

Locally Weighted Regression

Implement the non-parametric Locally Weighted Regression algorithm in order to fit data points. Select appropriate data set for your experiment and draw graphs.

7

Linear & Polynomial Regression

Develop a program to demonstrate the working of Linear Regression and Polynomial Regression. Use Boston Housing Dataset for Linear Regression and Auto MPG Dataset for Polynomial Regression.

8

Decision Tree Algorithm

Develop a program to demonstrate the working of the decision tree algorithm. Use Breast Cancer Data set for building the decision tree and apply this knowledge to classify a new sample.

9

Naive Bayesian Classifier

Develop a program to implement the Naive Bayesian classifier considering Olivetti Face Data set for training. Compute the accuracy of the classifier, considering a few test data sets.

10

K-means Clustering

Develop a program to implement k-means clustering using Wisconsin Breast Cancer data set and visualize the clustering result.