B.Tech./B.Tech. (Working Professional) IV Semester
Examination, June 2025
Grading System (GS) / Working Professional
Max Marks: 70 | Time: 3 Hours
Note:
i) Answer any five questions.
ii) All questions carry equal marks.
a) Define Machine Learning. Explain how Machine Learning differs from traditional programming paradigms. Illustrate your answer with examples. (Unit 1)
b) Define unsupervised learning. Explain its significance and challenges. Discuss different approaches to clustering and dimensionality reduction, citing examples of common algorithms used. (Unit 1)
a) What is a neural network? Explain how a neural network is represented mathematically. Describe the role of weights, biases, and activation functions in the working of a neural network. (Unit 2)
b) What is a Multilayer Perceptron (MLP)? Describe its architecture and how it differs from a single-layer perceptron. Explain how hidden layers contribute to solving complex problems. (Unit 2)
a) Explain the concept of Support Vector Machines (SVM) for classification. What is the role of margin maximization? Discuss the importance of kernel tricks and explain different types of kernels used in SVM. (Unit 3)
b) What is Logistic Regression? Explain how logistic regression is used for binary classification problems. Derive the logistic function and explain how the cost function is optimized during training. (Unit 3)
a) Explain the k-means clustering algorithm in detail. Describe the steps involved, the objective function it minimizes, and the limitations of k-means. How does the choice of k affect the outcome? (Unit 4)
b) Explain the difference between hard clustering and soft clustering. Which clustering algorithms produce hard clusters and which produce soft clusters? Provide examples and use cases for each. (Unit 4)
a) What are the key guidelines to follow when designing machine learning experiments? Discuss the importance of reproducibility, dataset partitioning, and the control of confounding variables. How do these guidelines ensure the validity and generalizability of experimental results? (Unit 5)
b) Why is it important to compare machine learning models over multiple datasets? Discuss the benefits and challenges of cross-dataset validation. How can the results of comparisons over multiple datasets provide more robust insights into model performance? (Unit 5)
a) Explain the concept of Principal Components Analysis (PCA). Derive the mathematical formulation of PCA. How does PCA help in dimensionality reduction and visualization? (Unit 1)
b) Describe the gradient descent algorithm. Compare its different variants like Stochastic Gradient Descent (SGD), Mini-batch Gradient Descent, Momentum, RMSProp, and Adam optimizers. (Unit 2)
a) Explain the bias-variance tradeoff in supervised learning. How do different supervised learning techniques like decision trees and random forests handle bias and variance? (Unit 3)
b) How can Genetic Algorithms be applied to optimize clustering solutions? Discuss chromosome representation, fitness function design, and genetic operations in the context of clustering. (Unit 4)
a) Discuss how to evaluate the stability of a machine learning model. What metrics or methods can be used to assess how consistent a model's performance is across different datasets, environments, or perturbations? How does model stability impact its generalization ability? (Unit 5)
b) Discuss the trade-off between bias and variance when designing machine learning experiments. How can experimental design mitigate the effects of high bias or high variance? Explain how this trade-off is related to overfitting and underfitting in model evaluation. (Unit 5)