changeset 2:2f9619adab91 draft

planemo upload commit 76744a97a6f837ee60104321867e6451fec78d74-dirty
author iuc
date Sat, 05 Nov 2016 10:37:44 -0400
parents b573ecb2ca66
children 83b998fa34ea
files tool-data/blastdb_d.loc.sample tool-data/blastdb_p.loc.sample
diffstat 2 files changed, 44 insertions(+), 57 deletions(-) [+]
line wrap: on
line diff
--- a/tool-data/blastdb_d.loc.sample	Sat Nov 05 10:31:30 2016 -0400
+++ /dev/null	Thu Jan 01 00:00:00 1970 +0000
@@ -1,57 +0,0 @@
-# This is a sample file distributed with Galaxy that is used to define a
-# list of protein domain databases, using three columns tab separated
-# (longer whitespace are TAB characters):
-#
-# <unique_id>{tab}<database_caption>{tab}<base_name_path>
-#
-# The captions typically contain spaces and might end with the build date.
-# It is important that the actual database name does not have a space in
-# it, and that there are only two tabs on each line.
-#
-# You can download the NCBI provided databases as tar-balls from here:
-# ftp://ftp.ncbi.nih.gov/pub/mmdb/cdd/little_endian/
-#
-# For simplicity, many Galaxy servers are configured to offer just a live
-# version of each NCBI BLAST database (updated with the NCBI provided
-# Perl scripts or similar). In this case, we recommend using the case
-# sensistive base-name of the NCBI BLAST databases as the unique id.
-# Consistent naming is important for sharing workflows between Galaxy
-# servers.
-#
-# For example, consider the NCBI Conserved Domains Database (CDD), where
-# you have downloaded and decompressed the files under the directory
-# /data/blastdb/domains/ meaning at the command line BLAST+ would be
-# run as follows any would look at the files /data/blastdb/domains/Cdd.*:
-#
-# $ rpsblast -db /data/blastdb/domains/Cdd -query ...
-#
-# In this case use Cdd (title case to match the NCBI file naming) as the
-# unique id in the first column of blastdb_d.loc, giving an entry like
-# this:
-#
-# Cdd{tab}NCBI Conserved Domains Database (CDD){tab}/data/blastdb/domains/Cdd
-#
-# Your blastdb_d.loc file should include an entry per line for each "base name"
-# you have stored. For example:
-#
-# Cdd{tab}NCBI CDD{tab}/data/blastdb/domains/Cdd
-# Kog{tab}KOG (eukaryotes){tab}/data/blastdb/domains/Kog
-# Cog{tab}COG (prokaryotes){tab}/data/blastdb/domains/Cog
-# Pfam{tab}Pfam-A{tab}/data/blastdb/domains/Pfam
-# Smart{tab}SMART{tab}/data/blastdb/domains/Smart
-# Tigr{tab}TIGR	/data/blastdb/domains/Tigr
-# Prk{tab}Protein Clusters database{tab}/data/blastdb/domains/Prk
-# ...etc...
-#
-# Alternatively, rather than a "live" mirror of the NCBI databases which
-# are updated automatically, for full reproducibility the Galaxy Team
-# recommend saving date-stamped copies of the databases. In this case
-# your blastdb_d.loc file should include an entry per line for each
-# version you have stored. For example:
-#
-# Cdd_05Jun2010{tab}NCBI CDD 05 Jun 2010{tab}/data/blastdb/domains/05Jun2010/Cdd
-# Cdd_15Aug2010{tab}NCBI CDD 15 Aug 2010{tab}/data/blastdb/domains/15Aug2010/Cdd
-# ...etc...
-#
-# See also blastdb.loc which is for any nucleotide BLAST database, and
-# blastdb_p.loc which is for any protein BLAST databases.
--- /dev/null	Thu Jan 01 00:00:00 1970 +0000
+++ b/tool-data/blastdb_p.loc.sample	Sat Nov 05 10:37:44 2016 -0400
@@ -0,0 +1,44 @@
+# This is a sample file distributed with Galaxy that is used to define a
+# list of protein BLAST databases, using three columns tab separated:
+#
+# <unique_id>{tab}<database_caption>{tab}<base_name_path>
+#
+# The captions typically contain spaces and might end with the build date.
+# It is important that the actual database name does not have a space in
+# it, and that there are only two tabs on each line.
+#
+# You can download the NCBI provided protein databases like NR from here:
+# ftp://ftp.ncbi.nlm.nih.gov/blast/db/
+#
+# For simplicity, many Galaxy servers are configured to offer just a live
+# version of each NCBI BLAST database (updated with the NCBI provided
+# Perl scripts or similar). In this case, we recommend using the case
+# sensistive base-name of the NCBI BLAST databases as the unique id.
+# Consistent naming is important for sharing workflows between Galaxy
+# servers.
+#
+# For example, consider the NCBI "non-redundant" protein BLAST database
+# where you have downloaded and decompressed the files under /data/blastdb/
+# meaning at the command line BLAST+ would be run with something like
+# which would look at the files /data/blastdb/nr.p*:
+#
+# $ blastp -db /data/blastdb/nr -query ...
+#
+# In this case use nr (lower case to match the NCBI file naming) as the
+# unique id in the first column of blastdb_p.loc, giving an entry like
+# this:
+#
+# nr{tab}NCBI non-redundant (nr){tab}/data/blastdb/nr
+#
+# Alternatively, rather than a "live" mirror of the NCBI databases which
+# are updated automatically, for full reproducibility the Galaxy Team
+# recommend saving date-stamped copies of the databases. In this case
+# your blastdb_p.loc file should include an entry per line for each
+# version you have stored. For example:
+#
+# nr_05Jun2010{tab}NCBI NR (non redundant) 05 Jun 2010{tab}/data/blastdb/05Jun2010/nr
+# nr_15Aug2010{tab}NCBI NR (non redundant) 15 Aug 2010{tab}/data/blastdb/15Aug2010/nr
+# ...etc...
+#
+# See also blastdb.loc which is for any nucleotide BLAST database, and
+# blastdb_d.loc which is for any protein domains databases (like CDD).