Dissertations - M Tech (CS)

Permanent URI for this collectionhttps://dspace.isical.ac.in/handle/10263/2147

These Dissertations were submitted in partial fulfilment of the requirements for the award of M TECH (Computer Science) Degree of Indian Statistical Institute

Browse

Search Results

Now showing 1 - 10 of 578
  • Thumbnail Image
    Item
    Tkbe : Two Key Broadcast Encryption For The Iot
    (2023-07) Parikh, Rachit
    The growing usage of the Internet of Things (IoT) has made it necessary to ensure the security of these interconnected devices. Key management becomes particularly challenging when devices are not always online due to resource constraints or business decisions. Moreover, the IoT infrastructure typically relies on the publish-subscribe model for communication, which raises additional security considerations since the message broker becomes a central point of attack. Existing solutions with end-toend encryption from publisher to subscriber are either computationally expensive for resource constrained devices or compromise on the decoupling in publish/subscribe systems. This thesis tackles the problem of efficient key management in IoT systems by employing techniques from broadcast encryption and proposes a lightweight framework - TKBE (Two Key Broadcast Encryption) that reduces trust in the broker and enhances security in IoT communications while providing efficient immediate revocation with decoupling and offline key updates.
  • Thumbnail Image
    Item
    Sentiment Analysis on Hindi-English Code-mix Data
    (Indian Statistical Institute, 2023-06-23) Singh Bisht, Balwant
    Social media has emerged as a prominent platform for expressing opinions, leading to the development of a unique language known as code-mix text. This form of language incorporates words from multiple languages, such as Hindi and English in India. While sentiment analysis techniques have achieved moderate success in handling English texts, the same level of effectiveness has not been attained when dealing with code-mix text. In this study, we propose deep learning techniques to address the challenges of sentiment analysis in code-mix Hindi-English text data. Leveraging a pre-trained cross-lingual large language model called XLM-RoBERTa, we employ a transfer learning approach. Four distinct approaches are employed to train the model for sentiment analysis on a Hinglish dataset. The first approach involves training the model using the Hinglish dataset exclusively. The other three approaches utilise mixed datasets, where one includes the augmentation of Spanish-English and Marathi-English datasets with the Hinglish dataset, the second approach solely relies on the mixed dataset without Hinglish data, while the final approach exclude the Spanish-English data. The trained models are evaluated on the same Hinglish dataset, and their performance is compared. The results indicate that the approach of increasing the training data by arbitrarily combining different kinds of mixed datasets does not yield improvements over previous findings. But combining the data of languages with similar linguistic characteristics can result in better performance. This highlights that the problem associated with scarcity of data for code-mixed languages can be effectively solved by using data of similar languages. In conclusion, our study emphasises the ongoing challenge of limited data for code-mixed languages. We demonstrate that augmenting the training data with various mixed datasets does not lead to enhanced performance but the data of similar languages can be combined to produce better outcomes. These findings provide valuable insights for future research in sentiment analysis of code-mix text.
  • Item
    Glacier velocity estimation using Adaptive Search Window and Patch size
    (Indian Statistical Institute, Kolkata, 2025-06) Patil, Pratik
    Synthetic Aperture Radar (SAR) technology offers a robust solution for monitoring glacier surface motion, particularly in regions with challenging environmental conditions, since it do not dependence on time of day and weather. This paper presents an enhanced glacier motion monitoring approach based on a Deep Matching Network (DMN), which learns patch-pair correspondences in an end-to-end manner. Unlike traditional shallow feature tracking methods, DMN utilize deep feature similarity through a Siamese network architecture with dense connection blocks to maximize feature reuse and improve training efficiency. To further improve precision and reduce computational cost, the proposed method uses a variable search window and adaptive patch sizing, enabling efficient and accurate motion estimation across diverse glacier terrains. Experimental results demonstrate the effectiveness of the proposed approach in achieving high accuracy and efficiency in glacier motion tracking on SAR data.
  • Item
    Complexity Results in Some Clustering Algorithms
    (Indian Statistical Institute, Kolkata, 2025-06) Das, Rajdeep
    Density-Based Spatial Clustering of Applications with Noise (DBSCAN) is a prevalent Clustering method without supervision renowned for its capability to recognize arbitrarily shaped clusters and detect noise in spatial data. Unlike partitioning methods such as k-means, DBSCAN operates without inputting a predefined number of clusters and is particularly effective in handling datasets with varying densities. In this dissertation, we have undertaken an in-depth exploration of the DBSCAN algorithm. We reviewed and analyzed several research papers that build upon or revise the original DBSCAN framework, with the goal of understanding their motivations, design choices, and computational implications. In addition to studying the foundational principles, we examined traditional spatial data structures that are commonly employed to accelerate DBSCAN, such as R-trees and KD-trees. This background enabled us to identify key computational bottlenecks in both neighbor search and density estimation. Building on these insights, we proposed two novel algorithms. The first is an approximate algorithm that efficiently replicates standard DBSCAN behavior, and the second is a modified version termed Box-based DBSCAN, which operates under a slightly altered definition of neighborhood using axis-aligned bounding boxes. The box-based approach improves clustering performance for geometrically structured data and introduces new ways to identify core regions without relying on exhaustive point-wise comparisons.
  • Item
    Reducing Attention Complexity in Graph Transformers through Subgraph Partitioning
    (Indian Statistical Institute, Kolkata, 2025-06) Choubey, Ranjan Kumar
    This dissertation addresses the challenge of scaling Graph Transformers by proposing a subgraph-based strategy to reduce attention complexity. The proposed framework preserves representational power while making attention computation tractable for largescale graphs. The method begins by partitioning the input graph into K subgraphs using the METIS algorithm. Each subgraph is encoded using a combination of local structural features from a Graph Convolutional Network (GCN) and global positional cues from Laplacian Positional Embeddings (LPEs). These embeddings are fused via a trainable projection function to form subgraph tokens. A supergraph is constructed to model interactions among subgraphs, allowing attention to be applied over a K × K matrix instead of the full n × n space, thereby reducing complexity from O(n2) to O(K2). Finally, a component-aware prediction strategy maps subgraph-level predictions to individual nodes using learned weights and regularization. Empirical evaluations demonstrate that the framework delivers higher accuracy, improved convergence, and scalability across diverse benchmark datasets.
  • Item
    Efficient Blending of Large Language Models
    (Indian Statistical Institute, Kolkata, 2025-06) Chatterjee, Sandeep
    Due tothelimitedcapabilitiesofsingleLargeLanguageModels(LLMs),multipleLLMscanbe employedintandemforbetterreliabilityofanswers.Blendingreferstocombiningthestrengths of variousLLMstomakeuseoftheircomplementarycapabilitiesforgeneratinghigh-quality responses.Itisanon-trivialproblem,andthetaskbecomesevenmoredifficultwhenaiming for minimallatencyandsupervisingtheblendingcomponents.Thestandardframework,LLM- Blender, approachesthisinthreestages:responsegeneration,candidateselectionviaranking, and responsefusionthroughsummarization.However,thispipelinefacestwocriticallimita- tions—high latencyduetorepeatedrankingsteps,andheavyrelianceonexternal,supervised componentsincludingalearnedencoderforrankingandaseparatesequence-to-sequencesum- marizer forfusion. In thisthesis,weproposenovel,efficientalternativestoovercomethesechallenges.Thisthesis comprises twoworks.First,weshowthatreducingthefrequencyofrankingwithinmulti- turn conversationssignificantlyimproveslatencywithminimaldegradationinoutputquality. Second, weintroduceapeer-review-basedresponsefusionmechanism,whereLLMscollectively evaluateandreviseeachother’sresponses,removingtheneedforanyexternallytrainedrankers or summarizers.Thiscollaborativemethodenablesfullyself-containedLLMblendingwithout additional trainingorsupervision. WeassessourproposedmethodsonthetaskofConversationalQuestionAnsweringacrossfive multi-turnconversationalbenchmarks—ConvQuestions,Atlas-Converse,CoQA,QuAC,and DoQA—using tendiverse,publiclyavailableopen-weightLLMs.Experimentalresultsdemon- strate thatourpeer-review-drivenframeworkwithreducedrankingachievesqualityonparwith existing approacheswhilebeingsubstantiallymoreefficient.Ourworkpresentsasteptoward scalable, modularLLMensemblingforreal-worldopen-domaindialoguesystems.
  • Item
    Explanation and Judgement of IR Ranking using LLM
    (Indian Statistical Institute, Kolkata, 2024-06) Mondal, Santanu
    Pretrained transformer models such as BERT and T5 have significantly advanced the performance of information retrieval (IR) systems when fine-tuned with large-scale labeled datasets. However, their effectiveness diminishes notably in low-resource scenarios where annotated query-passage pairs are limited. This thesis explores an alternative supervision strategy by leveraging natural language explanations to enhance training signals during fine-tuning. We propose a novel methodology that augments traditional relevance labels with textual explanations generated by a large language model (LLM) using few-shot prompting. To achieve this, we generate explanations for 30,000 query-passage-label triples from the MS MARCO dataset using the open-source model google/gemma-2b, allowing for cost-free and scalable inference. These augmented samples are then used to fine-tune a T5-base sequence-to-sequence model, with the objective of producing both the relevance label and an accompanying explanation. During inference, the model predicts the label token, and the probability of that token is used as a soft relevance score, enabling efficient ranking. Empirical results demonstrate that our explanation-augmented retriever outperforms strong baselines, including BM25, a BERT reranker, and a T5 model trained with labels only. We further analyze the effectiveness of explanation order, training data size, and the quality of generated rationales. Our findings suggest that natural language explanations offer a powerful form of supervision, particularly valuable in data-scarce IR settings, and present a compelling direction for improving neural retrievers with minimal annotation overhead.
  • Item
    Modeling and Verification of Sigma Delta Neural Networks
    (Indian Statistical Institute, Kolkata, 2025-06) Das, Sirshendu
    In the context of modern day embedded safety-critical systems and low-resource edge devices in particular, Sigma-Delta Neural Networks (SDNNs) offer a promising alternative to traditional Artificial Neural Networks (ANNs) by leveraging eventdriven, sparse computations inspired by biological neural processing. This energyefficient paradigm makes SDNNs well-suited for neuromorphic hardware and realtime applications, particularly in scenarios with temporal redundancy, such as video processing. However, as neural networks become integral to safety-critical systems, ensuring their robustness against adversarial perturbations is an absolute necessity. In this work, we propose an end-to-end framework for formal modeling and verification of SDNNs using Satisfiability Modulo Theory (SMT). Unlike empirical robustness evaluations, SMT-based verification provides formal guarantees by encoding SDNN behavior and adversarial robustness properties as mathematical constraints. We introduce an SMT-based formulation for encoding SDNNs with SMT constraints and define a robustness property motivated by video stream processing. Our approach systematically examines how well SDNNs can handle adversarial attacks, ensuring they work correctly in safety-critical applications. We validate our framework through experiments on temporal version of the MNIST dataset. To the best of our knowledge, this is the first formal verification framework for SDNNs, bridging the gap between neuromorphic computing and rigorous verification. We also focus on applying the proposed SDNN verification methodology to a real-world deep learning system– PilotNet, an end-to-end model for steering angle prediction in autonomous vehicles.
  • Item
    Geometry Based UAV Trajectory Planning for Mixed User Traffic in mmWave Communication
    (Indian Statistical Institute, Kolkata, 2025-06) Hasan, Sk Abid
    Unmanned aerial vehicle (UAV) assisted communication is a revolutionary technology that has been recently presented as a potential candidate for beyond fifth-generation millimeter wave (mmWave) communications. Although mmWaves can o↵er a notably high data rate, their high penetration and propagation losses mean that line of sight (LoS) is necessary for e↵ective communication. Due to the presence of obstacles and user mobility, UAV trajectory planning plays a crucial role in improving system performance. In this work, we propose a novel computational geometry-based trajectory planning scheme by considering the user mobility, the priority of the delay sensitive ultra-reliable low-latency communications (URLLC) and the high throughput requirements of the enhanced mobile broadband (eMBB) traffic. Specifically, we use some geometric tools like Apollonius circle and minimum enclosing ball of balls to find the optimal position of the UAV that supports uninterrupted connections to the URLLC users and maximizes the aggregate throughput of the eMBB users. Finally, the numerical results demonstrate the benefits of the suggested approach over an existing state of the art benchmark scheme in terms of sum throughput obtained by URLLC and eMBB users.
  • Item
    Understanding Batch-Normalization in Deep Neural Networks
    (Indian Statistical Institute, Kolkata, 2025-06) Srujan, Pendyala Sai
    Batch Normalization (BN) is a commonly used technique in various deep learning architectures for tasks such as image classification and object detection. It stabilizes and accelerates training by normalizing the activations of intermediate layers using mean and variance of the batch, allowing the use of higher learning rates and often improving generalization through implicit regularization. During inference, BN uses running estimates of batch statistics accumulated during training. However, if individual batches are not representative of the overall data distribution, these accumulated statistics may not accurately approximate the population statistics. This discrepancy can lead to a phenomenon known as **estimation shift**, which impairs the model’s generalization performance. In this project, we study the behavior of estimation shift in deep learning models using BN and explore techniques to mitigate its effects. Specifically, we introduce **dynamicity** in the momentum parameter of BN layer (DMBN) while computing exponential moving averages and evaluate its impact under various architectural configurations. We use MNIST, FashionMNIST, and CIFAR-10/100 datasets to train and test both simple Deep Neural Networks (DNNs) as well as deeper Convolutional Neural Networks (CNNs) such as ResNet-50. Our experiments are conducted in two phases: first, by varying the static momentum parameter across different values, and second, by introducing layer-wise dynamic momentum where each layer is assigned the momentum (or equivalently, β) that minimizes estimation shift. The performance of the proposed method, DMBN, is evaluated using various performance metrics such as sensitivity, specificity, accuracy, and F-score. The DMBN is compared with existing BN-BFN method and is observed to be performing better in most of cases. For example, for fashionMNIST data, the accuracy values achieved by DMBN and BN-BFN are 0.889 and 0.853, respectively.