On coreset construction for K-means clustering of flats and hyperplanes

dc.contributor.authorMukherjee, Abhisek
dc.date.accessioned2022-01-21T08:09:54Z
dc.date.available2022-01-21T08:09:54Z
dc.date.issued2020-06
dc.descriptionDissertation under the supervision of Dr. Arijit Ghosh & Dr. Arijit Bishnuen_US
dc.description.abstractCoreset is an important tool to effectively extract information from large amount of data by sampling only a few elements from it, without any substantial loss of the actual information. An -coreset is defined as a weighted set C obtained from an universe X, so that for any solution set Q for a problem (referred to as a query in coreset literature), jCost(X;Q) 􀀀 Cost(C;Q)j Cost(X;Q). Our work is an attempt to generalize the solution provided in the paper “k- Means Clustering of Lines for Big Data” (Marom and Feldman, NIPS, 2019), and explore if it is possible to extend to k-flats in Rd as well. Following the approach used in the paper mentioned, we will attempt at building a deterministic algorithm to compute an -coreset whose size is near logarithmic of the input size for a j-dimensional affine subspace in Rd.en_US
dc.identifier.citation37p.en_US
dc.identifier.urihttp://hdl.handle.net/10263/7246
dc.language.isoenen_US
dc.publisherIndian Statistical Institute, Kolkataen_US
dc.relation.ispartofseriesDissertation;;2020;32
dc.subjectThe minimum enclosing ball (MEB)en_US
dc.subjectD2 samplingen_US
dc.titleOn coreset construction for K-means clustering of flats and hyperplanesen_US
dc.typeOtheren_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Abhishek-Mukherjee-2018-20-Dissertation.pdf
Size:
534.13 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: