Department of Computer Science Publications

Multi‑view Clustering for Multi‑omics Data Using Unifed Embedding

Mohammed Hasanuzzaman, ADAPT Centre, Cork Institute of Technology, Cork, Ireland.Follow
Sayantan Mitra, Department of Computer Science, Indian Institute of Technology Patna, Bihta, Bihar 801103, India.
Sriparna Saha, Department of Computer Science, Indian Institute of Technology Patna, Bihta, Bihar 801103, India.

ORCID

0000-0003-1838-0091

Document Type

Article

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Disciplines

Computer Sciences | Databases and Information Systems | Data Science

Publication Details

Scientific Reports

Abstract

In real world applications, data sets are often comprised of multiple views, which provide consensus and complementary information to each other. Embedding learning is an effective strategy for nearest neighbour search and dimensionality reduction in large data sets. This paper attempts to learn a unified probability distribution of the points across different views and generates a unified embedding in a low-dimensional space to optimally preserve neighbourhood identity. Probability distributions generated for each point for each view are combined by conflation method to create a single unified distribution. The goal is to approximate this unified distribution as much as possible when a similar operation is performed on the embedded space. As a cost function, the sum of Kullback-Leibler divergence over the samples is used, which leads to a simple gradient adjusting the position of the samples in the embedded space. The proposed methodology can generate embedding from both complete and incomplete multi-view data sets. Finally, a multi-objective clustering technique (AMOSA) is applied to group the samples in the embedded space. The proposed methodology, Multi-view Neighbourhood Embedding (MvNE), shows an improvement of approximately 2−3% over state-of-the-art models when evaluated on 10 omics data sets.

Recommended Citation

Mitra, S., Saha, S. & Hasanuzzaman, M. Multi-view clustering for multi-omics data using unified embedding. Sci Rep 10, 13654 (2020). https://doi.org/10.1038/s41598-020-70229-1

Download

Find in your library

Included in

Databases and Information Systems Commons, Data Science Commons

COinS

DOI

https://doi.org/10.1038/s41598-020-70229-1

Department of Computer Science Publications

Multi‑view Clustering for Multi‑omics Data Using Unifed Embedding

ORCID

Document Type

Creative Commons License

Disciplines

Publication Details

Abstract

Recommended Citation

Included in

DOI

Browse

Search

Author Corner

Department of Computer Science Publications

Multi‑view Clustering for Multi‑omics Data Using Unifed Embedding

Authors

ORCID

Document Type

Creative Commons License

Disciplines

Publication Details

Abstract

Recommended Citation

Included in

Share

DOI

Browse

Search

Author Corner