Generalized bridge pipeline for multimodal gene regulatory network discovery
| dc.contributor.advisor | Alam, Md. Golam Rabiul | |
| dc.contributor.advisor | Shatabda, Swakkhar | |
| dc.contributor.author | Alif, Abrar Sami Khan | |
| dc.contributor.author | Fahmid, Riyadus Salehin | |
| dc.contributor.author | Raihan, Mohammad Omar | |
| dc.contributor.author | Chowdhury, Omor Bin Amjad | |
| dc.contributor.department | Department of Computer Science and Engineering | |
| dc.date.accessioned | 2026-01-08T04:38:19Z | |
| dc.date.available | 2026-01-08T04:38:19Z | |
| dc.date.copyright | 2025 | |
| dc.date.issued | 2025-10 | |
| dc.description | Cataloged from PDF version of thesis. | |
| dc.description | Includes bibliographical references (pages 47-50). | |
| dc.description | This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2025. | en_US |
| dc.description.abstract | In this thesis, a computationally generalized pipeline where multimodal gene regulatory networks (GRNs) are built by combining transcriptomic data in RNA sequencing and functional dependency data in CRISPR knockout screens. Traditional forms of GRN rely on expression data as the only tool which can not detect causal or functional significant interactions. We attempted to solve this by constructing a contrastive bridge model, where both datasets are put in the same 128-dimensional latent space. We used Maximum Mean Discrepancy (MMD) loss and a diversitypreserving loss such that patterns of modality are aligned, and meaningful biological variation is not distorted. Using these embeddings, we built multimodal GRNs, combining evidence as provided by various outlets. In order to identify statistical and functional relationships, we demonstrated Spearman co-expression correlations, GENIE3 random forest importance scores, CRISPR dependency support, and cosine similarity of embedding vectors into a single edge-weight expression. The bridgefused networks have been steady in structure and introduced new cross-modal interactions (Bridge-fused vs GENIE3) when used in both hematopoietic and lung cell data. Top hub genes in these networks scored negative on the mean CRISPR dependency score which is an indication of important functional roles and Gene Ontology enrichment analysis scored significant representation of the immune activation and metabolic processes. The implications of these findings are that the bridge pipeline offers biologically meaningful, consistent and interpretable GRNs. Overall, this framework is a generalizable and data-driven framework to integrate heterogeneous genomic datasets, which can be applied in the process of identifying significant regulators and potential therapeutic targets in a broad variety of biological settings. | en_US |
| dc.description.degree | Bachelor of Science in Computer Science and Engineering | |
| dc.description.statementofresponsibility | Abrar Sami Khan Alif | |
| dc.description.statementofresponsibility | Riyadus Salehin Fahmid | |
| dc.description.statementofresponsibility | Mohammad Omar Raihan | |
| dc.description.statementofresponsibility | Omor Bin Amjad Chowdhury | |
| dc.format.extent | 59 pages | |
| dc.identifier.other | ID 24141153 | |
| dc.identifier.other | ID 21201205 | |
| dc.identifier.other | ID 21141058 | |
| dc.identifier.other | ID 23241085 | |
| dc.identifier.uri | http://hdl.handle.net/10361/27408 | |
| dc.language.iso | en | en_US |
| dc.publisher | BRAC University | en_US |
| dc.rights | BRAC University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. | |
| dc.subject | Multimodal integration | en_US |
| dc.subject | Contrastive learning | en_US |
| dc.subject | Network inference | en_US |
| dc.subject | RNA sequencing | en_US |
| dc.subject | CRISPR | en_US |
| dc.subject | Gene prioritization | en_US |
| dc.subject | Functional genomics | en_US |
| dc.subject | GRNs | en_US |
| dc.subject.lcsh | Gene regulatory networks. | |
| dc.subject.lcsh | Data mining. | |
| dc.subject.lcsh | Genetic regulation--Computer simulation. | |
| dc.subject.lcsh | Nucleotide sequence. | |
| dc.subject.lcsh | RNA--Analysis. | |
| dc.subject.lcsh | Gene expression. | |
| dc.title | Generalized bridge pipeline for multimodal gene regulatory network discovery | en_US |
| dc.type | Thesis | en_US |