ELPH
ELPH
ELPH is a bioinformatics program designed to perform Gibbs Sampling on DNA and protein sequence data in order to find patterns and motifs in the sequences. The program can handle as many as thousands of sequences at a time.
Using ELPH on RCC Resources
Serially Running ELPH
The gnu module is required to run ELPH. To load the module, use the command:
module load gnu
To run ELPH from the command line, follow the format elph [FILES] -[OPTIONS]. R
eplace the[FILES]
argument with your sequence file(s). A list of options can be found either by simply typing elph
with no arguments or on the user manual for ELPH.
As a short example, a very simple run with a test file in FASTA format could be run as follows:
elph TEST.fa LEN=10 -o OUTFILE.txt
Running ELPH in Parallel
ELPH can also be run in parallel, and requires the GNU OpenMPI module. To load the module, use the command:
module load gnu-openmpi
You can then run the program through submitting a Slurm script to HPC. Your script file must use the .sh extension. Below is an example script using the same run as the serial example:
#! /bin/bash
#SBATCH -J ELPH_TEST
#SBATCH -n 4
#SBATCH -p genacc_q
#SBATCH -t 00:10:00
#SBATCH --mail-type=ALL
module load gnu openmpi
elph TEST.fa LEN=10 -o OUTFILE.txt
For detailed usage documentation for ELPH, refer to the official website.