AL-405 - Machine Learning (ML) Syllabus

Unit I : Introduction to machine learning

Scope and limitations, machine learning models, Supervised Learning, Unsupervised Learning, hypothesis space and inductive bias, evaluation, cross-validation, Dimensionality Reduction: Subset Selection, Shrinkage Methods, Principle Components Analysis, Partial Least Squares.

Previous Years questions appears in RGPV exam.

Q.1) Define Machine Learning. Explain how Machine Learning differs from traditional programming paradigms. Illustrate your answer with examples. (June-2025)

Q.2) Define unsupervised learning. Explain its significance and challenges. Discuss different approaches to clustering and dimensionality reduction, citing examples of common algorithms used. (June-2025)

Q.3) Explain the concept of Principal Components Analysis (PCA). Derive the mathematical formulation of PCA. How does PCA help in dimensionality reduction and visualization? (June-2025)

Q.4) How does supervised learning differ from unsupervised learning? Discuss. (Dec-2024)

Q.5) Briefly explain the need of Inductive Bias in decision Tree Learning. (Dec-2024)

Q.6) What is Principal Component Analysis (PCA) in machine learning? How does PCA help in reducing the dimensionality of data? (Dec-2024)

Q.7) Write a short note on: Shrinkage Methods (Dec-2024)

Q.8) Explain the difference between classification and regression models. (June-2024)

Q.9) What a hypothesis space represents in machine learning? List some basic characteristics of a hypothesis space? (June-2024)

Q.10) Discuss some common limitations of machine learning. (June-2024)

Q.11) Write a short note on: Partial Least Squares (June-2024)

Q.12) Explain Principle component analysis with Examples. (Nov-2023)

Q.13) Define Machine Learning and explain different issues of Machine Learning. (Nov-2023)

Q.14) Explain different shrinkage methods in machine learning. (Nov-2023)

Q.15) Write short notes on: Subset selection (Nov-2023)

Q.16) Write short notes on: Inductive bias (Nov-2023)

Unit II : Neural Networks

From Biology to Simulation, Neural network representation, Neural Networks as a paradigm for parallel processingPerceptron Learning, Training a perceptron, Multilayer perceptron, back propagation Algorithm, Training & Validation,Activation functions, Vanishing and Exploding Gradients.

Previous Years questions appears in RGPV exam.

Q.1) What is a neural network? Explain how a neural network is represented mathematically. Describe the role of weights, biases, and activation functions in the working of a neural network. (June-2025)

Q.2) What is a Multilayer Perceptron (MLP)? Describe its architecture and how it differs from a single-layer perceptron. Explain how hidden layers contribute to solving complex problems. (June-2025)

Q.3) Describe the gradient descent algorithm. Compare its different variants like Stochastic Gradient Descent (SGD), Mini-batch Gradient Descent, Momentum, RMSProp, and Adam optimizers. (June-2025)

Q.4) Evaluate the effectiveness of biologically inspired neural network architectures in solving complex machine learning tasks. (Dec-2024)

Q.5) What is the back propagation algorithm in neural networks? How does it compute gradients to update the weights of a neural network? (Dec-2024)

Q.6) Discuss how the Multilayer Perceptron algorithm compute outputs from inputs through its interconnected layers? (Dec-2024)

Q.7) Write a short note on: Vanishing and Exploding Gradients (Dec-2024)

Q.8) How do neural networks leverage parallel processing to perform computations efficiently? (June-2024)

Q.9) What is the Perceptron learning algorithm? Discuss how the Perceptron learning algorithm adjust the weights of connections between neurons to learn from data? (June-2024)

Q.10) Describe how neural network architectures are represented graphically or conceptually in machine learning? (June-2024)

Q.11) Write a short note on: Activation functions (June-2024)

Q.12) Define Back propagation and write an algorithm for Back Propagation with examples. (Nov-2023)

Q.13) What is a Perceptron? Explain the working of a perceptron with a neat diagram. (Nov-2023)

Q.14) Explain Multi layer perception model with neat diagram. (Nov-2023)

Q.15) Explain parallel processing perception learning in neural networks with neat diagram. (Nov-2023)

Q.16) Define Neural Networks and Explain different functions used in Neural Networks. (Nov-2023)

Q.17) Write short notes on: Training and validation (Nov-2023)

Q.18) Write short notes on: Simulation (Nov-2023)

Unit III : Supervised Learning Techniques

Decision Trees, Naive Bayes, Classification, Support vector machines for classification problems, Random forest for classification and regression problems, Linear regression for regression problems, Ordinary Least Squares Regression, Logistic Regression.

Previous Years questions appears in RGPV exam.

Q.1) Explain the concept of Support Vector Machines (SVM) for classification. What is the role of margin maximization? Discuss the importance of kernel tricks and explain different types of kernels used in SVM. (June-2025)

Q.2) What is Logistic Regression? Explain how logistic regression is used for binary classification problems. Derive the logistic function and explain how the cost function is optimized during training. (June-2025)

Q.3) Explain the bias-variance tradeoff in supervised learning. How do different supervised learning techniques like decision trees and random forests handle bias and variance? (June-2025)

Q.4) What is Naive Bayes classification in machine learning? Describe the fundamental principle behind Naive Bayes and how it calculates probabilities. (Dec-2024)

Q.5) Discuss about linear regression and derive the Individual error and Minimization functions. (Dec-2024)

Q.6) Describe the basic concept of a Random Forest and how it combines multiple decision trees for classification. (Dec-2024)

Q.7) Write a short note on: Issues in decision trees (Dec-2024)

Q.8) Explain the concept of entropy in the context of Decision Trees. How does entropy affect the construction and splitting of nodes in a Decision Tree? (June-2024)

Q.9) Explain the difference between linear and logistics regression with example. (June-2024)

Q.10) List the advantages of Support Vector Machine (SVM) and how optimal Hyper plane differ from Hyper plane. (June-2024)

Q.11) Explain how Support Vector Machine can be used for classification of linearly separable data. (Nov-2023)

Q.12) Define Decision tree. Explain Decision tree algorithm with example. (Nov-2023)

Q.13) Explain Random Forest algorithm and explain different regression problems. (Nov-2023)

Q.14) Define Decision tree. Explain Decision tree learning algorithm with example. (Nov-2023)

Q.15) Explain Logistic regression with examples. (Nov-2023)

Q.16) What is Linear Regression? Explain in detail with example and list all the assumptions to be met before starting with Linear Regression. (Nov-2023)

Q.17) Differentiate between regression and classifications. (Nov-2023)

Unit IV : Unsupervised Learning

Clustering: k-means, adaptive hierarchical clustering, Gaussian mixture model, Optimization Using Evolutionary Techniques, Number of Clusters, Advanced discussion on clustering, Expectation Maximization.

Previous Years questions appears in RGPV exam.

Q.1) Explain the k-means clustering algorithm in detail. Describe the steps involved, the objective function it minimizes, and the limitations of k-means. How does the choice of k affect the outcome? (June-2025)

Q.2) Explain the difference between hard clustering and soft clustering. Which clustering algorithms produce hard clusters and which produce soft clusters? Provide examples and use cases for each. (June-2025)

Q.3) How can Genetic Algorithms be applied to optimize clustering solutions? Discuss chromosome representation, fitness function design, and genetic operations in the context of clustering. (June-2025)

Q.4) List the applications of clustering and identify advantages and disadvantages of clustering algorithm. (Dec-2024)

Q.5) Explain the Expectation-Maximization (EM) algorithm used to train Gaussian Mixture Models. (Dec-2024)

Q.6) What are evolutionary optimization techniques in machine learning? Provide an overview of common evolutionary algorithms used for optimization tasks. (Dec-2024)

Q.7) Describe the basic concept of K-means clustering and how it partitions data into K clusters based on similarity. (June-2024)

Q.8) How does adaptive hierarchical clustering adaptively adjust the number of clusters based on the data? Discuss. (June-2024)

Q.9) Explain the difference between the E-step and M-step in the EM algorithm. How does EM handle missing or incomplete data in probabilistic models? (June-2024)

Q.10) Write a short note on: Goals of clustering (June-2024)

Q.11) Use K Means clustering to cluster the following data in to two groups. Assume cluster centroid are $m1=2$ and $m2=4$. The distance function used is Euclidean distance {2, 4, 10, 12, 3, 20, 30, 11, 25}. (Nov-2023)

Q.12) Explain Expectation Maximization algorithm with example. And also explain why we need it? (Nov-2023)

Q.13) Explain adaptive hierarchical clustering. (Nov-2023)

Q.14) What is Gaussian Mixture density estimation with example. (Nov-2023)

Q.15) Write short notes on: Cluster (Nov-2023)

Unit V : Design and Analysis of Machine Learning Experiments

Factors, response and strategy of experimentation, Guidelines for machine learning experiments, cross-validation and resampling methods, Measuring classifier performance, Hypothesis testing, comparing multiple algorithms, comparison over multiple datasets.

Previous Years questions appears in RGPV exam.

Q.1) What are the key guidelines to follow when designing machine learning experiments? Discuss the importance of reproducibility, dataset partitioning, and the control of confounding variables. How do these guidelines ensure the validity and generalizability of experimental results? (June-2025)

Q.2) Why is it important to compare machine learning models over multiple datasets? Discuss the benefits and challenges of cross-dataset validation. How can the results of comparisons over multiple datasets provide more robust insights into model performance? (June-2025)

Q.3) Discuss how to evaluate the stability of a machine learning model. What metrics or methods can be used to assess how consistent a model's performance is across different datasets, environments, or perturbations? How does model stability impact its generalization ability? (June-2025)

Q.4) Discuss the trade-off between bias and variance when designing machine learning experiments. How can experimental design mitigate the effects of high bias or high variance? Explain how this trade-off is related to overfitting and underfitting in model evaluation. (June-2025)

Q.5) Describe key elements of experimental design such as hypothesis formulation, variable manipulation, and control. (Dec-2024)

Q.6) What is the significance of comparing machine learning models over multiple datasets? (Dec-2024)

Q.7) Write a short note on: Hypothesis testing (Dec-2024)

Q.8) List the fundamental steps involved in designing and conducting a machine learning experiment. (June-2024)

Q.9) What is cross-validation in machine learning? Describe its purpose and how it helps in evaluating the performance of machine learning models. (June-2024)

Q.10) Write a short note on: Measuring classifier performance (June-2024)

Q.11) Explain hypothesis testing with examples. (Nov-2023)

Q.12) Explain resample methods of machine learning. (Nov-2023)

Q.13) What is cross validation and Explain resampling methods in machine learning? (Nov-2023)

Q.14) Write short notes on: Cross validation (Nov-2023)

Q.15) Write short notes on: Factors. (Nov-2023)

AL-405 – Machine Learning (ML)

Syllabus Content & Previous Year Questions