annotate COBRAxy/GSOC project submission.html @ 24:c5dbdbb64cef draft

Uploaded
author luca_milaz
date Thu, 19 Sep 2024 08:03:37 +0000
parents 41f35c2f0c7b
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
4
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
1 <!DOCTYPE html>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
2 <html lang="en">
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
3 <head>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
4 <meta charset="UTF-8">
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
5 <meta name="viewport" content="width=device-width, initial-scale=1.0">
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
6 <title>Google Summer of Code 2024 - COBRAxy: COBRA and MaREA4Galaxy</title>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
7 <link href="https://cdn.jsdelivr.net/npm/bootstrap@5.3.1/dist/css/bootstrap.min.css" rel="stylesheet">
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
8 </head>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
9 <body>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
10 <div class="container my-5">
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
11 <h1 class="text-center mb-4">Google Summer of Code 2024</h1>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
12 <h2 class="text-center mb-4">COBRAxy: COBRA and MaREA4Galaxy</h2>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
13 <p><strong>National Resource for Network Biology (NRNB)</strong></p>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
14 <p><strong>Mentors:</strong></p>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
15 <ul>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
16 <li>Alex Graudenzi, alex.graudenzi@unimib.it</li>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
17 <li>Chiara Damiani, chiara.damiani@unimib.it</li>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
18 <li>Marco Antoniotti, marco.antoniotti@unimib.it</li>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
19 </ul>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
20 <p><strong>Contributor:</strong></p>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
21 <ul>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
22 <li>Luca Milazzo (University of Milano-Bicocca) – lucmil2000@gmail.com, luca.milazzo@epfl.ch</li>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
23 </ul>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
24
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
25 <h3 class="mt-4">Project Description</h3>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
26 <p>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
27 The project focused on developing an advanced Galaxy tool that enhances the data mapping capabilities of MaREA4Galaxy. The extension of this framework includes the analysis of fluxomics data, starting from a metabolic model and progressing to the representation of up-regulated fluxes on a metabolic map. This tool enables users to perform constraint-based enrichment analysis of metabolic pathways.
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
28 </p>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
29 <p>The primary goals of the project were:</p>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
30 <ul>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
31 <li>Create a flux sampling and analysis interface to allow users to work with constraint-based metabolic models (e.g., sampling algorithms, FBA, pFBA, and FVA).</li>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
32 <li>Adapt the existing clustering module to clusterize fluxomics data and implement additional clustering algorithms (e.g., Leiden and Louvain).</li>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
33 <li>Build upon the existing module for visualizing enriched reactions based on RAS to create a new module for enrichment analysis of metabolic pathways based on simulated fluxomics data, and visualize the results on the metabolic map.</li>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
34 </ul>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
35
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
36 <h3 class="mt-4">What I Did</h3>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
37 <ul>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
38 <li>Updated all existing modules of MaREA4Galaxy to use recent versions of Python libraries, ensuring greater future compatibility.</li>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
39 <li>Modified the "Custom Data Generator" tool to extract rules, reactions, bounds, and medium information from a COBRA model.</li>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
40 <li>Developed the "RAS to Bound" tool, which generates metabolic reaction bounds based on the RAS matrix and a growth medium (either custom or one of 26 pre-defined settings), enabling the creation of cell-specific bounds from a generic metabolic model (e.g., ENGRO2 or a custom model).</li>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
41 <li>Developed the "Flux Simulation" tool, allowing users to sample multiple metabolic models using cell-specific bounds, employing the CBS and OPTGP algorithms. This tool also supports flux analysis using FBA, pFBA, FVA, and biomass sensitivity analysis.</li>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
42 <li>Developed the "Metabolic Flux Enrichment Analysis" tool, which visualizes up-regulated fluxes identified by the "Flux Simulation" tool, compares different sub-classes identified by the clustering tool over fluxomics data, and visualizes all results on the metabolic map.</li>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
43 </ul>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
44
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
45 <h3 class="mt-4">Current State and Future Extensions</h3>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
46 <p>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
47 Currently, the updated MaREA4Galaxy tool allows users to perform constraint-based enrichment analysis of metabolic pathways using RNA-seq profiles by simulating fluxomics. Additionally, users can compare different sub-populations identified by the clustering tool. The architecture minimizes computational costs by handling cell-specific models through a set of bounds, without storing complete COBRA models, which would contain a large amount of redundant information.
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
48 </p>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
49 <p>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
50 The implementation of the "Metabolic Flux Enrichment Analysis" tool did not leave enough time to extend the clustering module to new algorithms such as HDBSCAN, Leiden, and Louvain. This is a potential future extension to consider. Moreover, implementing a more advanced clustering grid search could further optimize clustering results.
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
51 </p>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
52
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
53 <h3 class="mt-4">About the Code</h3>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
54 <p>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
55 I worked on the Mercurial repository of MaREA4Galaxy, where this document is stored. I committed all my changes, as shown by the repository history, though without using any Git-like merge operations due to the limitations of the Mercurial interface.
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
56 </p>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
57
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
58 <h3 class="mt-4">Conclusions</h3>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
59 <p>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
60 Over the past years, I have focused on biology-related subjects, particularly metabolic fluxes and other omics data such as gene expression datasets. Through this project, I was able to apply the knowledge I have gained in constraint-based modeling, flux sampling, and omics enrichment analysis by expanding the MaREA4Galaxy tool. This experience not only enhanced my programming skills but also deepened my understanding of the real needs of biologists when working with such omics data.
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
61 </p>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
62 </div>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
63 </body>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
64 </html>
41f35c2f0c7b Uploaded
luca_milaz
parents:
diff changeset
65