Mercurial > repos > galaxyp > blast_plus_remote_blastp
diff tool-data/blastdb_p.loc.sample @ 5:22a767177ac9 draft
planemo upload for repository https://github.com/peterjc/galaxy_blast/tree/master/tools/ncbi_blast_plus commit 8cb8939dadaad8e804e35128cfb7b2560eb4d9b4
author | galaxyp |
---|---|
date | Fri, 20 Jan 2017 16:00:56 -0500 |
parents | a7f1634cd624 |
children |
line wrap: on
line diff
--- a/tool-data/blastdb_p.loc.sample Mon May 04 09:58:57 2015 -0500 +++ b/tool-data/blastdb_p.loc.sample Fri Jan 20 16:00:56 2017 -0500 @@ -1,32 +1,45 @@ #NOTE: This file comes from the tool galaxyp/blast_plus_remote_blastp +# This is a sample file distributed with Galaxy that is used to define a +# list of protein BLAST databases, using three columns tab separated: # -#This is a sample file distributed with Galaxy that is used to define a -#list of protein BLAST databases, using three columns tab separated -#(longer whitespace are TAB characters): +# <unique_id>{tab}<database_caption>{tab}<base_name_path> +# +# The captions typically contain spaces and might end with the build date. +# It is important that the actual database name does not have a space in +# it, and that there are only two tabs on each line. # -#<unique_id> <database_caption> <base_name_path> +# You can download the NCBI provided protein databases like NR from here: +# ftp://ftp.ncbi.nlm.nih.gov/blast/db/ # -#The captions typically contain spaces and might end with the build date. -#It is important that the actual database name does not have a space in -#it, and that there are only two tabs on each line. -# -#So, for example, if your database is NR and the path to your base name -#is /data/blastdb/nr, then the blastdb_p.loc entry would look like this: +# For simplicity, many Galaxy servers are configured to offer just a live +# version of each NCBI BLAST database (updated with the NCBI provided +# Perl scripts or similar). In this case, we recommend using the case +# sensistive base-name of the NCBI BLAST databases as the unique id. +# Consistent naming is important for sharing workflows between Galaxy +# servers. # -#nr{tab}NCBI NR (non redundant){tab}/data/blastdb/nr +# For example, consider the NCBI "non-redundant" protein BLAST database +# where you have downloaded and decompressed the files under /data/blastdb/ +# meaning at the command line BLAST+ would be run with something like +# which would look at the files /data/blastdb/nr.p*: # -#and your /data/blastdb directory would contain all of the files associated -#with the database, /data/blastdb/nr.*. +# $ blastp -db /data/blastdb/nr -query ... # -#Your blastdb_p.loc file should include an entry per line for each "base name" -#you have stored. For example: +# In this case use nr (lower case to match the NCBI file naming) as the +# unique id in the first column of blastdb_p.loc, giving an entry like +# this: +# +# nr{tab}NCBI non-redundant (nr){tab}/data/blastdb/nr # -#nr_05Jun2010 NCBI NR (non redundant) 05 Jun 2010 /data/blastdb/05Jun2010/nr -#nr_15Aug2010 NCBI NR (non redundant) 15 Aug 2010 /data/blastdb/15Aug2010/nr -#...etc... +# Alternatively, rather than a "live" mirror of the NCBI databases which +# are updated automatically, for full reproducibility the Galaxy Team +# recommend saving date-stamped copies of the databases. In this case +# your blastdb_p.loc file should include an entry per line for each +# version you have stored. For example: # -#You can download the NCBI provided protein databases like NR from here: -#ftp://ftp.ncbi.nlm.nih.gov/blast/db/ +# nr_05Jun2010{tab}NCBI NR (non redundant) 05 Jun 2010{tab}/data/blastdb/05Jun2010/nr +# nr_15Aug2010{tab}NCBI NR (non redundant) 15 Aug 2010{tab}/data/blastdb/15Aug2010/nr +# ...etc... # -#See also blastdb.loc which is for any nucleotide BLAST database, and -#blastdb_d.loc which is for any protein domains databases (like CDD). +# See also blastdb.loc which is for any nucleotide BLAST database, and +# blastdb_d.loc which is for any protein domains databases (like CDD).