Fibronectin Report

Input file:
fibronectin_datatest.fasta

Number of sequences in input file:
36

Pattern of the sequence bank:
AAAGSSVSSVPTKLEVVAATPTSLLISWDA:4-6:V:1:YYRITYGETGGNSPVQEFTVPG:3:TATISGLSPGVDYTITVYA:11-12:PISINYRTGTGGSGGSHHHHHHHHGGSYPYDVPDYAPRVG*GRVGSKDIRAETVESCLAKSHTENSFTNVWKDDKTLDRYANYE

5' restriction site:
GCGGCCGC

3' restriction site:
GGTACC

Caption:

Absence of restriction sitesIncorrect number of nucleotides between the restriction sitesStop codon inside the area of interestMutation in the conserved regionsValid sequencesAmber codon in the sequence (inside the area of interest)
1 sequence(s) (2.78%)6 sequence(s) (16.67%)25 sequence(s) (69.44%)0 sequence(s) (0.00%)4 sequence(s) (11.11%)0 sequence(s)
XL2-3_PSEXSEQ-REV_34XL1_22_PSEXSEQ-REV_24
XL1_24_PSEXSEQ-REV_26
XL1_30_PSEXSEQ-REV_30
XL1_4_PSEXSEQ-REV_8
XL2-2_PSEXSEQ-REV_33
XL3-4_PSEXSEQ-REV_40
XL1_10_PSEXSEQ-REV_13
XL1_11_PSEXSEQ-REV_14
XL1_13_PSEXSEQ-REV_16
XL1_14_PSEXSEQ-REV_17
XL1_15_PSEXSEQ-REV_18
XL1_16_PSEXSEQ-REV_19
XL1_17_PSEXSEQ-REV_20
XL1_1_PSEXSEQ-REV_5
XL1_23_PSEXSEQ-REV_25
XL1_25_PSEXSEQ-REV_27
XL1_26_PSEXSEQ-REV_28
XL1_29_PSEXSEQ-REV_29
XL1_33_PSEXSEQ-REV_23
XL1_3_PSEXSEQ-REV_7
XL1_5_PSEXSEQ-REV_9
XL1_6_PSEXSEQ-REV_10
XL1_8_PSEXSEQ-REV_11
XL1_9_PSEXSEQ-REV_12
XL2-1_PSEXSEQ-REV_32
XL2-4_PSEXSEQ-REV_35
XL2-5_PSEXSEQ-REV_36
XL3-1_PSEXSEQ-REV_37
XL3-2_PSEXSEQ-REV_38
XL3-3_PSEXSEQ-REV_39
XL3-5_PSEXSEQ-REV_41
XL1_12_PSEXSEQ-REV_15
XL1_18_PSEXSEQ-REV_21
XL1_19_PSEXSEQ-REV_22
XL1_2_PSEXSEQ-REV_6

The following group of sequences are identical clones on the variable regions:

No clone was found.

Here's the distribution of the repeated sequences in variable regions:

Variable regionRepeated sequenceNumber of occurrences (percentage of valid sequences)
3YSY2 (50.00%)

The following clusters were generated by MCL:

4 sequences (100.00% of valid sequences)
XL1_12_PSEXSEQ-REV_15
XL1_18_PSEXSEQ-REV_21
XL1_19_PSEXSEQ-REV_22
XL1_2_PSEXSEQ-REV_6

Here's some statistics about the valid sequences:

Mean for the pairwise alignement scores: 25.92
Standard deviation: 3.89

Distribution of the pairwise alignment score
Pairwise Alignment ScoreNumber of occurrences
19.001
23.801
26.301
27.302
31.801

Valid protein sequences in FASTA format:

Multiple sequence alignment of the valid sequences generated by Clustal Omega:

Protein sequences with an incorrect number of nucleotides between the restriction sites in FASTA format:

Protein sequences with a stop codon in FASTA format: