annotate cobraxy-9688ad27287b/COBRAxy/GSOC project submission.html @ 90:a48b2e06ebe7 draft

Uploaded
author luca_milaz
date Sun, 13 Oct 2024 11:35:56 +0000
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
90
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
1 <!DOCTYPE html>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
2 <html lang="en">
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
3 <head>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
4 <meta charset="UTF-8">
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
5 <meta name="viewport" content="width=device-width, initial-scale=1.0">
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
6 <title>Google Summer of Code 2024 - COBRAxy: COBRA and MaREA4Galaxy</title>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
7 <link href="https://cdn.jsdelivr.net/npm/bootstrap@5.3.1/dist/css/bootstrap.min.css" rel="stylesheet">
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
8 </head>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
9 <body>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
10 <div class="container my-5">
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
11 <h1 class="text-center mb-4">Google Summer of Code 2024</h1>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
12 <h2 class="text-center mb-4">COBRAxy: COBRA and MaREA4Galaxy</h2>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
13 <p><strong>National Resource for Network Biology (NRNB)</strong></p>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
14 <p><strong>Mentors:</strong></p>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
15 <ul>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
16 <li>Alex Graudenzi, alex.graudenzi@unimib.it</li>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
17 <li>Chiara Damiani, chiara.damiani@unimib.it</li>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
18 <li>Marco Antoniotti, marco.antoniotti@unimib.it</li>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
19 </ul>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
20 <p><strong>Contributor:</strong></p>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
21 <ul>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
22 <li>Luca Milazzo (University of Milano-Bicocca) – lucmil2000@gmail.com, luca.milazzo@epfl.ch</li>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
23 </ul>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
24
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
25 <h3 class="mt-4">Project Description</h3>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
26 <p>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
27 The project focused on developing an advanced Galaxy tool that enhances the data mapping capabilities of MaREA4Galaxy. The extension of this framework includes the analysis of fluxomics data, starting from a metabolic model and progressing to the representation of up-regulated fluxes on a metabolic map. This tool enables users to perform constraint-based enrichment analysis of metabolic pathways.
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
28 </p>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
29 <p>The primary goals of the project were:</p>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
30 <ul>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
31 <li>Create a flux sampling and analysis interface to allow users to work with constraint-based metabolic models (e.g., sampling algorithms, FBA, pFBA, and FVA).</li>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
32 <li>Adapt the existing clustering module to clusterize fluxomics data and implement additional clustering algorithms (e.g., Leiden and Louvain).</li>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
33 <li>Build upon the existing module for visualizing enriched reactions based on RAS to create a new module for enrichment analysis of metabolic pathways based on simulated fluxomics data, and visualize the results on the metabolic map.</li>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
34 </ul>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
35
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
36 <h3 class="mt-4">What I Did</h3>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
37 <ul>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
38 <li>Updated all existing modules of MaREA4Galaxy to use recent versions of Python libraries, ensuring greater future compatibility.</li>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
39 <li>Modified the "Custom Data Generator" tool to extract rules, reactions, bounds, and medium information from a COBRA model.</li>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
40 <li>Developed the "RAS to Bound" tool, which generates metabolic reaction bounds based on the RAS matrix and a growth medium (either custom or one of 26 pre-defined settings), enabling the creation of cell-specific bounds from a generic metabolic model (e.g., ENGRO2 or a custom model).</li>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
41 <li>Developed the "Flux Simulation" tool, allowing users to sample multiple metabolic models using cell-specific bounds, employing the CBS and OPTGP algorithms. This tool also supports flux analysis using FBA, pFBA, FVA, and biomass sensitivity analysis.</li>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
42 <li>Developed the "Metabolic Flux Enrichment Analysis" tool, which visualizes up-regulated fluxes identified by the "Flux Simulation" tool, compares different sub-classes identified by the clustering tool over fluxomics data, and visualizes all results on the metabolic map.</li>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
43 </ul>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
44
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
45 <h3 class="mt-4">Current State and Future Extensions</h3>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
46 <p>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
47 Currently, the updated MaREA4Galaxy tool allows users to perform constraint-based enrichment analysis of metabolic pathways using RNA-seq profiles by simulating fluxomics. Additionally, users can compare different sub-populations identified by the clustering tool. The architecture minimizes computational costs by handling cell-specific models through a set of bounds, without storing complete COBRA models, which would contain a large amount of redundant information.
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
48 </p>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
49 <p>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
50 The implementation of the "Metabolic Flux Enrichment Analysis" tool did not leave enough time to extend the clustering module to new algorithms such as HDBSCAN, Leiden, and Louvain. This is a potential future extension to consider. Moreover, implementing a more advanced clustering grid search could further optimize clustering results.
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
51 </p>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
52
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
53 <h3 class="mt-4">About the Code</h3>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
54 <p>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
55 I worked on the Mercurial repository of MaREA4Galaxy, where this document is stored. I committed all my changes, as shown by the repository history, though without using any Git-like merge operations due to the limitations of the Mercurial interface.
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
56 </p>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
57
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
58 <h3 class="mt-4">Conclusions</h3>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
59 <p>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
60 Over the past years, I have focused on biology-related subjects, particularly metabolic fluxes and other omics data such as gene expression datasets. Through this project, I was able to apply the knowledge I have gained in constraint-based modeling, flux sampling, and omics enrichment analysis by expanding the MaREA4Galaxy tool. This experience not only enhanced my programming skills but also deepened my understanding of the real needs of biologists when working with such omics data.
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
61 </p>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
62 </div>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
63 </body>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
64 </html>
a48b2e06ebe7 Uploaded
luca_milaz
parents:
diff changeset
65