immuneml_tools: immuneml_train_repert.xml comparison

comparison immuneml_train_repert.xml @ 6:2d3dd9ff7e84 draft

"planemo upload commit 74f2bd15d2b7723c8e5a22d743913706dc7d8333-dirty"

author	immuneml
date	Tue, 27 Jul 2021 09:30:50 +0000
parents	ed3932e6d616
children	45ca02982e1f

comparison

equal deleted inserted replaced

-:48569213d91c
+:2d3dd9ff7e84
 a disease status. One or more ML models are trained to classify repertoires based on the information within the sets of CDR3 sequences. Finally, the performance
 of the different methods is compared.
 Alternatively, if you want to predict a property per immune receptor, such as antigen specificity, check out the
 `Train immune receptor classifiers (simplified interface) <https://galaxy.immuneml.uio.no/root?tool_id=immuneml_train_classifiers>`_ tool instead.
-The full documentation can be found `here <https://docs.immuneml.uio.no/galaxy/galaxy_simple_repertoires.html>`_.
+The full documentation can be found `here <https://docs.immuneml.uio.no/latest/galaxy/galaxy_simple_repertoires.html>`_.
 **Basic terminology**
 In the context of ML, the characteristics to predict per repertoire are called **labels** and the values that these labels can take on are **classes**.
 One could thus have a label named ‘CMV_status’ with possible classes ‘positive’ and ‘negative’. The labels and classes must be present in the metadata
 file, in columns where the header and values correspond to the label and classes respectively.
-.. image:: https://docs.immuneml.uio.no/_images/metadata_repertoire_classification.png
+.. image:: https://docs.immuneml.uio.no/latest/_images/metadata_repertoire_classification.png
 :height: 150
 |
 When training an ML model, the goal is for the model to learn **signals** within the data which discriminate between the different classes. An ML model
 groups of similar receptors or short CDR3 subsequences in an immune repertoire. Our assumptions about what makes up a ‘signal’ determines how we
 should represent our data to the ML model. This representation is called **encoding**. In this tool, the encoding is automatically chosen based on
 the user's assumptions about the dataset.
-.. image:: https://docs.immuneml.uio.no/_images/repertoire_classification_overview.png
+.. image:: https://docs.immuneml.uio.no/latest/_images/repertoire_classification_overview.png
 :height: 500
 |
 |
 encoding, the CDR3 regions are divided into overlapping subsequences and the (disease) signal may be characterized by the presence or absence of
 certain sequence motifs in the CDR3 regions. Here, two similar CDR3 sequences are no longer independent, because they contain many identical subsequences.
 A graphical representation of how a CDR3 sequence can be divided into k-mers, and how these k-mers can relate to specific positions in a 3D immune receptor
 (here: antibody) is shown in this figure:
-.. image:: https://docs.immuneml.uio.no/_images/3mer_to_3d.png
+.. image:: https://docs.immuneml.uio.no/latest/_images/3mer_to_3d.png
 :height: 250
 |
 The subsequences may be position-dependent or invariant. Position invariant means that if a subsequence, e.g., ‘EDNA’ occurs in different positions
 - Archive: repertoire classification: a .zip file containing the complete output folder as it was produced by immuneML. This folder
 contains the output of the TrainMLModel instruction including all trained models and their predictions, and report results.
 Furthermore, the folder contains the complete YAML specification file for the immuneML run, the HTML output and a log file.
 - optimal_ml_settings.zip: a .zip file containing the raw files for the optimal trained ML settings (ML model, encoding).
-This .zip file can subsequently be used as an input when `applying previously trained ML models to a new AIRR dataset in Galaxy <https://docs.immuneml.uio.no/galaxy/galaxy_apply_ml_models.html>`_.
+This .zip file can subsequently be used as an input when `applying previously trained ML models to a new AIRR dataset in Galaxy <https://docs.immuneml.uio.no/latest/galaxy/galaxy_apply_ml_models.html>`_.
 - repertoire_classification.yaml: the YAML specification file that was used by immuneML internally to run the analysis. This file can be
 downloaded, altered, and run again by immuneML using the  `Train machine learning models <https://galaxy.immuneml.uio.no/root?tool_id=immuneml_train_ml_model>`_ Galaxy tool.
 **More analysis options**

Mercurial > repos > immuneml > immuneml_tools

comparison immuneml_train_repert.xml @ 6:2d3dd9ff7e84 draft