• No results found

Scientific publications

N/A
N/A
Protected

Academic year: 2023

Share "Scientific publications"

Copied!
2
0
0

Loading.... (view fulltext now)

Full text

(1)

Scientific publications

1. Bose, Mausumi. and Stufken, John : Optimal crossover designs when carryover effects are proportional to Direct Effects, Journal of Statistical Planning and Inference, Vol. 137, no. 2, 3291-3302, 2007.

(Journal Article)

Keywords : Direct treatment effect, Proportional carryover effects, Universal optimality

2. Bandyopadhyay, Sanghamitra; Mukhopadhyay, Anirban and Maulik, Ujjwal : An improved algorithm for clustering gene expression data, Bioinformatics, Vol.23, no.8, 2859-2865, 2007. (Journal Article) Keywords : Algorithm , Clustering gene

3. Khan, Aparajita : Integrative Clustering of Multi-View Data: Subspace Clustering, Graph Approximation to Manifold Learning. This thesis was submitted to Indian Statistical Institute, Kolkata Under the supervision of Prof. Pradipta Maji,275p, Nov 2021. (Thesis).

Series : ISI Ph. D Thesis; No. TH516

Keywords: Multi-View Clustering, Manifold Learning, Graph Approximation, Spectral Clustering

Abstract:

Multi-view data clustering explores the consistency and complementary properties of different views to uncover the natural groups present in a data set. While multiple views are expected to provide more information for an improved learning performance, they pose their own set of unique challenges. The most important problems of multi-view clustering are the high-dimensional heterogeneous nature of different views, selection of relevant and complementary views while discarding noisy and redundant ones, preventing the propagation of noise from individual views during data integration, and capturing the lower dimensional non-linear geometry of each view.

In this regard, the thesis addresses the problem of multi-view data clustering, in the presence of high-dimensional, noisy, and redundant views. In order to select the appropriate views for data clustering, some new quantitative measures are introduced to evaluate the quality of each view. While the relevance measures evaluate the compactness and separability of the clusters within each view, the redundancy measures compute the amount of information shared between two views. These measures are used to select a set of relevant and non-redundant views during multi-view data integration.

The “high-dimension low-sample size” nature of different views makes the feature space geometrically sparse and the clustering computationally expensive. The thesis addresses these challenges by performing the clustering in the low-rank joint subspaces, extracted by feature- space, graph, and manifold based approaches. In feature-space based approach, the problem of incremental update of relevant eigen spaces is addressed for multi-view data sets. This

(2)

formulation makes the extraction of joint subspace computationally less expensive compared to the principal component analysis. The graph based approaches, on the other hand, inherently take care of the data heterogeneity of different views, by modeling each view using a separate similarity graph. In order to filter out the background noise embedded in each view, a novel concept of approximate graph Laplacian is introduced, which captures the de-noised relevant information using the most informative eigen pairs of the graph Laplacian.

In order to utilize the underlying non-linear geometry of different views, the graph-based approach is judiciously integrated with the manifold optimization techniques. The optimization over Stiefel and k-means manifolds is able to capture the non-linearity and orthogonality of the cluster indicator subspaces. Finally, the problem of simultaneous optimization of the graph connectivity and clustering subspaces is addressed by exploiting the geometry and structure preserving properties of Grassmannian and symmetric positive definite manifolds.

References

Related documents

Percentage of countries with DRR integrated in climate change adaptation frameworks, mechanisms and processes Disaster risk reduction is an integral objective of

Assessing adaptation progress is critical for understanding whether and how vulnerability is changing over time and across scales and dimensions, and how adaptation interventions (or

SaLt MaRSheS The latest data indicates salt marshes may be unable to keep pace with sea-level rise and drown, transforming the coastal landscape and depriv- ing us of a

3 Collective bargaining is defined in the ILO’s Collective Bargaining Convention, 1981 (No. 154), as “all negotiations which take place between an employer, a group of employers

It examines biodiversity governance at local, national and international level – notably: policy and institutional support for community- based conservation; mainstreaming

2 The use of annual variation in temperature and precipitation to estimate the impact of climate change was pioneered by Deschenes and Greenstone (2007), who use annual

Angola Benin Burkina Faso Burundi Central African Republic Chad Comoros Democratic Republic of the Congo Djibouti Eritrea Ethiopia Gambia Guinea Guinea-Bissau Haiti Lesotho

1 For the Jurisdiction of Commissioner of Central Excise and Service Tax, Ahmedabad South.. Commissioner of Central Excise and Service Tax, Ahmedabad South Commissioner of