Multi Trajectory Guided Model Inversion Attacks

Nandi, Indrajit

Multi Trajectory Guided Model Inversion Attacks

dc.contributor.author	Nandi, Indrajit
dc.date.accessioned	2025-07-15T09:10:37Z
dc.date.available	2025-07-15T09:10:37Z
dc.date.issued	2025-06
dc.description	Dissertation under the supervision of Dr. Sarbani Palit	en_US
dc.description.abstract	Recent advancements in deep learning have brought significant concerns regarding the privacy of training data due to overfitting and memorization, as well as a lack of defense mechanisms. In model inversion attacks, where Attackers aim to reconstruct original private training samples (i.e images) just by using the DNN model’s last layer output. Traditional black-box MI attacks face three key challenges: (1) Non-convex optimization landscapes often trap reconstructions in poor local minima, degrading output quality. (2) Black-box scenarios require excessive queries to approximate gradients, raising detection risks. (3) Unconstrained pixel-space optimization generates unrealistic artifacts since the inverse mapping lacks natural image priors. These issues collectively yield low-fidelity reconstructions that fail to capture meaningful private data, especially for complex inputs like faces or medical images. Here, Generative Adversarial Networks (GANs) come in picture which provide a lowdimensional latent space for efficient optimization while ensuring high-dimensional realism. Some recent methods have explored reinforcement learning, where the search in the latent space (learned from a GAN trained on public data) is formulated as a Markov Decision Process (MDP). In our method, we introduce: (1) a dynamics network to model state transitions in the latent space, (2) a reward network that learns directly from the model’s confidence scores, and (3) policy-value networks for efficient exploration inspired by efficient-zero V2. This learned framework automatically adapts autonomously to diverse target models, and leverages the GAN’s generative prior to ensure realistic reconstructions.	en_US
dc.identifier.citation	32p.	en_US
dc.identifier.uri	http://hdl.handle.net/10263/7559
dc.language.iso	en	en_US
dc.publisher	Indian Statistical Institute, Kolkata	en_US
dc.relation.ispartofseries	MTech(CS) Dissertation;23-07
dc.subject	Generative Adversarial Networks (GANs)	en_US
dc.subject	Markov Decision Process (MDP)	en_US
dc.title	Multi Trajectory Guided Model Inversion Attacks	en_US
dc.type	Other	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Indrajit_nandi_CS2307.pdf
Size:: 14.46 MB
Format:: Adobe Portable Document Format
Description:: Dissertations - M Tech (CS)

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Dissertations - M Tech (CS)