comparison readme.rst @ 9:70170338b4f7 draft

Uploaded
author rnateam
date Sat, 14 Feb 2015 09:28:35 -0500
parents
children
comparison
equal deleted inserted replaced
8:78b364f5d93e 9:70170338b4f7
1
2
3 This package is a Galaxy workflow for BlockClust pipeline.
4
5
6 ======
7 Galaxy
8 ======
9
10 `Galaxy <http://galaxyproject.org/>`_ is an open, web-based platform for data intensive research.
11 All tools can be combined in workflows without any need of programming skills.
12 Furthermore the platform can be extended with more tools at any time.
13 Each tool has its own information about what it does and how the input is supposed to look like.
14 You can make data available for Galaxy by uploading local files or downloading online content.
15 Inputfiles, workflowsteps and results are stored in a history where you can view them or reaccess them later.
16 It is possible to share workflows and histories with other users or make the public available.
17 Saved workflows can be used with new input files or just to rerun an analyses which ensures repeatability.
18
19
20
21 Getting Started
22 ===============
23
24 BlockClust can be installed on all common Unix systems.
25 However, it is developed on Linux and I don't have access to OS X. You are welcome to help improving this documentation, just contact_ me.
26
27 For any additional information, especially cluster configuration or general Galaxy_ questions,
28 please have a look at the Galaxy Wiki.
29
30 - http://wiki.galaxyproject.org/
31
32 - http://wiki.galaxyproject.org/Admin/
33
34 - http://galaxyproject.org/search/web/
35
36 .. _contact: https://github.com/bgruening
37 .. _Galaxy: http://galaxyproject.org/
38
39 Prerequisites::
40
41 * Python 2.6 or 2.7
42 * standard C compiler, C++ and Fortran compiler
43 * Autotools
44 * CMake
45 * cairo development files (used for PNG depictions)
46 * python development files
47 * Java Runtime Environment (JRE, used by OPSIN and NPLS)
48
49 To install all of the prerequisites you can run the following command, depending on your OS:
50
51 - Debian based systems: apt-get install build-essential gfortran cmake mercurial libcairo2-dev python-dev
52 - Fedora: yum install make automake gcc gcc-c++ gcc-gfortran cmake mercurial libcairo2-devel python-devel
53 - OS X (MacPorts_): port install gcc cmake automake mercurial cairo-devel
54
55 .. _MacPorts: http://www.macports.org/
56
57
58 ===================
59 Galaxy installation
60 ===================
61
62
63 0. Create a sand-boxed Python using virtualenv_ (not necessary but recommended)::
64
65 wget https://raw.github.com/pypa/virtualenv/master/virtualenv.py
66 python ./virtualenv.py --no-site-packages galaxy_env
67 . ./galaxy_env/bin/activate
68
69 .. _virtualenv: http://www.virtualenv.org/
70
71
72 1. Clone the latest `Galaxy platform`_::
73
74 hg clone https://bitbucket.org/galaxy/galaxy-central/
75
76 .. _Galaxy platform: http://wiki.galaxyproject.org/Admin/Get%20Galaxy
77
78 2. Navigate to the galaxy-central folder and update it::
79
80 cd ~/galaxy-central
81 hg pull
82 hg update
83
84 This step is not necessary if you have a fresh checkout. Anyway, it is good to know ;)
85
86 3. Create folders for toolshed and dependencies::
87
88 mkdir ~/shed_tools
89 mkdir ~/galaxy-central/tool_deps
90
91 4. Create configuration file::
92
93 cp ~/galaxy-central/universe_wsgi.ini.sample ~/galaxy-central/universe_wsgi.ini
94
95 5. Open universe_wsgi.ini and change the dependencies directory::
96
97 LINUX: gedit ~/galaxy-central/universe_wsgi.ini
98 OS X: open -a TextEdit ~/galaxy-central/universe_wsgi.ini
99
100 6. Search for ``tool_dependency_dir = None`` and change it to ``tool_dependency_dir = ./tool_deps``, remove the ``#`` if needed
101
102 7. Remove the ``#`` in front of ``tool_config_file`` and ``tool_path``
103
104 8. (Re-)Start the galaxy daemon::
105
106 sh run.sh --reload
107
108 In deamon mode all logs will be written to main.log in your Galaxy Home directory. You can also use::
109
110 run.sh
111
112 During the first startup Galaxy will prepare your database. That can take some time. Have a look at the log file if you want to know what happens.
113
114 After launching galaxy is accessible via the browser at ``http://localhost:8080/``.
115
116
117
118 =======================
119 Tool Shed configuration
120 =======================
121
122 - Register a new user account in your Galaxy instance: Top Panel → User → Register
123 - Become an admin
124 - open ``universe_wsgi.ini`` in your favourite text editor (gedit universe_wsgi.ini)
125 - search ``admin_users = None`` and change it to ``admin_users = EMAIL_ADDRESS`` (your Galaxy Username)
126 - remove the ``#`` if needed
127 - restart Galaxy
128
129 ::
130
131 sh run.sh --reload
132
133
134 =======================
135 BlockClust installation
136 =======================
137
138 BlockClust will automatically download and compile all requirements,
139 like EDeN, samtools and so on. It can take up to 1-2 hours.
140
141
142 Installation via webbrowser
143 ===========================
144
145 - go to the `admin page`_
146 - select *Search and browse tool sheds*
147 - Galaxy test tool shed > Sequence Analysis > blockclust_workflow
148 - install
149
150 .. _admin page: http://localhost:8080/admin
151
152
153 ===============
154 Troubleshooting
155 ===============
156 You can navigate to the blockclust_workflow repository in your browser and repair manually:
157 Top Panel → Admin → Manage installed tool shed repositories → blockclust_workflow → Repository Actions → Repair repository
158
159 ------
160
161
162 On slow computers and during the compilation of large software libraries, like R,
163 the Tool Shed can run into a timeout and kills the installation.
164 That problem is known and should be fixed in the near future.
165
166 If you encouter a timeout or 'hung' during the installation you can increase the ``threadpool_kill_thread_limit`` in your universe_wsgi.ini file.
167
168
169 ------
170
171 **Database locking errors**
172
173 Please note that Galaxy per default uses a SQLite database. Sqlite is not intended for production use.
174 With multiple users or complex components, like that workflow, you will see database locking errors.
175 We highly recommend to use PostgreSQL for any kind of production system.
176
177
178 .. _Galaxy wiki: http://wiki.galaxyproject.org/
179
180
181 Workflows
182 =========
183
184 The BlockClust workflow is located in the `Tool Shed`::
185
186 http://toolshed.g2.bx.psu.edu/view/rnateam/blockclust_workflow
187
188 To import successfully installed the workflow to all your users you need to go to the admin panel, choose the worklow and import it.
189 For more information have a look at the Galaxy wiki::
190
191 http://wiki.galaxyproject.org/ToolShedWorkflowSharing#Finding_workflows_in_tool_shed_repositories
192
193 Please **note** that Galaxy per default uses a SQLite database. Sqlite is not intended for production use.
194 With multiple users or complex components, like that workflow, you will see database locking errors.
195 We highly recommend to use PostgreSQL for any kind of production system.
196
197
198
199 Sample Data
200 ===========
201
202
203
204 Citation
205 ========
206
207 If you use this workflow directly, or a derivative of it, or the associated
208 wrappers for Galaxy, in work leading to a scientific publication,
209 please cite:
210
211 Pavankumar Videm, Dominic Rose, Fabrizio Costa, and Rolf Backofen. "BlockClust: efficient clustering and classification of non-coding RNAs from short read RNA-seq profiles." Bioinformatics 30, no. 12 (2014): i274-i282.
212
213
214
215 Additional References
216 =====================
217
218
219
220 Availability
221 ============
222
223 This workflow is available on the main Galaxy Tool Shed:
224
225 http://toolshed.g2.bx.psu.edu/view/rnateam/blockclust_workflow
226
227 Development is being done on github:
228
229 https://github.com/bgruening/galaxytools/tree/master/workflows/blockclust
230
231
232 Dependencies
233 ============
234
235 These dependencies should be resolved automatically via the Galaxy Tool Shed:
236
237 * http://testtoolshed.g2.bx.psu.edu/view/iuc/package_samtools_0_1_19
238 * http://testtoolshed.g2.bx.psu.edu/view/iuc/package_r_3_0_1
239 * http://testtoolshed.g2.bx.psu.edu/view/iuc/msa_datatypes
240 * http://testtoolshed.g2.bx.psu.edu/view/iuc/package_infernal_1_1rc4
241 * http://testtoolshed.g2.bx.psu.edu/view/rnateam/blockbuster
242 * http://testtoolshed.g2.bx.psu.edu/view/bgruening/package_eden_1_1
243 * http://testtoolshed.g2.bx.psu.edu/view/iuc/package_mcl_12_135