GitHub - vika19/dimspy_workflow: conversion of galaxy workflow to CLI

Welcome to the NCGAS workflow conversion for dimspy!

1. Setting up your email

Please set up your email address. You can do this for all jobs at once by running:

find . -name Run* -exec sed -i 's/[email protected]/EMAIL/g' {} \;

where EMAIL is your email address.

what this command means:

find software to find files matching a pattern
. search within this entire folder and all subfolders
-name Run* search for files with names that begin with "Run", which is what I name all the job scripts.
-exec run this command for each file found
sed -i sed is a program to find and replace text, -i makes it write in place
's/A/B/g' substitute A for B throughout the file
{} ; the found file names will be replaced here

So, this line is replacing all instances of "[email protected]" with your actual email!

2. Please link your input file directory

ln -s /location/of/your/data input_data

where /location/of/your/data is the location of your data. This can be found by navigating to the folder with your data, and typing pwd, then enter. Then navigate back to the dimspy directory and run the ln. You MUST be in the dimspy folder when you run this command!

This will create a shortcut to your raw files in a directory that the workflow will find. Make sure you have a file named filename.txt in this folder that describes the files as before.

3. Check the samples.conf file!

This is the set up file for the jobs. The first couple lines require full paths - please update these to match where you are running the data. Running the command "pwd" in the same directory as samples.conf will give you the full path for your current location.

This file also allows you to edit different parameters that are used across the jobs at once. So if you have more replicates, make sure to change that line. If you want to adjust the ppm setting, change that line. These will be referenced by each job, so changes here will change across the full workflow.

4. Run it!

After your email is entered, your input data is linked in, and you have set up the config file, you are ready to run! Simply do the following:

cd 1a_ProcessScans

qsub RunProcessScans.sh

This will output a report to the reports folder, and automatically launch subsequent jobs. You can restart the pipeline at any point by going to the desired step's directory and submitting the job in that folder. It will then re-run all steps starting at that point automatically.

If you have questions, please feel free to email [email protected]!

WRITTEN 2021 by Sheri Sanders

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
1a_ProcessScans		1a_ProcessScans
1b_RepFilter		1b_RepFilter
2_AlignSamples		2_AlignSamples
3_BlankFilter		3_BlankFilter
4_SampleFilter		4_SampleFilter
5_MissingVals		5_MissingVals
6_GetPeaksAndConvert		6_GetPeaksAndConvert
LICENSE.md		LICENSE.md
reports		reports
README.md		README.md
samples.conf		samples.conf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Welcome to the NCGAS workflow conversion for dimspy!

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

vika19/dimspy_workflow

Folders and files

Latest commit

History

Repository files navigation

Welcome to the NCGAS workflow conversion for dimspy!

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages