Table showing the order of each filtering step and the number and percentage of sequences after each filtering step.

Input: The number of sequences in the original IMGT file. This is always 100% of the sequences.

After "no results" filter: IMGT classifies sequences either as "productive", "unproductive", "unknown", or "no results". Here, the number and percentages of sequences that are not classified as "no results" are reported.

After functionality filter: The number and percentages of sequences that have passed the functionality filter. The filtering performed is dependent on the settings of the functionality filter. Details on the functionality filter can be found on the start page of the SHM&CSR pipeline.

After removal sequences that are missing a gene region: In this step all sequences that are missing a gene region (FR1, CDR1, FR2, CDR2, FR3) that should be present are removed from analysis. The sequence regions that should be present are dependent on the settings of the sequence starts at filter. The number and percentage of sequences that pass this filter step are reported.

After N filter: In this step all sequences that contain an ambiguous base (n) in the analysed region or the CDR3 are removed from the analysis. The analysed region is determined by the setting of the sequence starts at filter. The number and percentage of sequences that pass this filter step are reported.

After filter unique sequences: The number and percentage of sequences that pass the "filter unique sequences" filter. Details on this filter can be found on the start page of the SHM&CSR pipeline

After remove duplicate based on filter: The number and percentage of sequences that passed the remove duplicate filter. Details on the "remove duplicate filter based on filter" can be found on the start page of the SHM&CSR pipeline.

Number of matches sequences: The number and percentage of sequences that passed all the filters described above and have a (sub)class assigned.

Number of unmatched sequences: The number and percentage of sequences that passed all the filters described above and do not have subclass assigned.