Machine Learning Lab Resources

1

Histograms and Distribution Analysis

Develop a program to create histograms for all numerical features and analyze the distribution of each feature. Generate box plots for all numerical features and identify any outliers. Use California Housing dataset.

Download

2

Correlation Matrix Analysis

Develop a program to Compute the correlation matrix to understand the relationships between pairs of features. Visualize the correlation matrix using a heatmap to know which variables have strong positive/negative correlations. Create a pair plot to visualize pairwise relationships between features. Use California Housing dataset.

Download

3

Principal Component Analysis (PCA)

Develop a program to implement Principal Component Analysis (PCA) for reducing the dimensionality of the Iris dataset from 4 features to 2.

Download

4

Find-S Algorithm

For a given set of training data examples stored in a .CSV file, implement and demonstrate the Find-S algorithm to output a description of the set of all hypotheses consistent with the training examples.

Download Dataset

5

k-Nearest Neighbor Algorithm

Develop a program to implement k-Nearest Neighbor algorithm to classify the randomly generated 100 values of x in the range of [0,1]. Perform classification for k=1, 2, 3, 4, 5, 20, 30.

Download

6

Locally Weighted Regression

Implement the non-parametric Locally Weighted Regression algorithm in order to fit data points. Select appropriate data set for your experiment and draw graphs.

Download

7

Linear & Polynomial Regression

Develop a program to demonstrate the working of Linear Regression and Polynomial Regression. Use Boston Housing Dataset for Linear Regression and Auto MPG Dataset for Polynomial Regression.

Download Dataset 1 Dataset 2

8

Decision Tree Algorithm

Develop a program to demonstrate the working of the decision tree algorithm. Use Breast Cancer Data set for building the decision tree and apply this knowledge to classify a new sample.

Download

9

Naive Bayesian Classifier

Develop a program to implement the Naive Bayesian classifier considering Olivetti Face Data set for training. Compute the accuracy of the classifier, considering a few test data sets.

Download

10

K-means Clustering

Develop a program to implement k-means clustering using Wisconsin Breast Cancer data set and visualize the clustering result.

Download

Machine Learning Lab (BCM601)

Histograms and Distribution Analysis

Correlation Matrix Analysis

Principal Component Analysis (PCA)

Find-S Algorithm

k-Nearest Neighbor Algorithm

Locally Weighted Regression

Linear & Polynomial Regression

Decision Tree Algorithm

Naive Bayesian Classifier

K-means Clustering

Home

UI/UX Tutorials

Generative AI

Machine Learning Lab