Software Category


ELPH is a bioinformatics program designed to perform Gibbs Sampling on DNA and protein sequence data in order to find patterns and motifs in the sequences. The program can handle as many as thousands of sequences at a time.

Using ELPH on RCC Resources

Serially Running ELPH 

The gnu module is required to run ELPH. To load the module, use the command: 

module load gnu

To run ELPH from the command line, follow the format elph [FILES] -[OPTIONS]. Replace the[FILES] argument with your sequence file(s). A list of options can be found either by simply typing elph with no arguments or on the user manual for ELPH.

As a short example, a very simple run with a test file in FASTA format could be run as follows:

elph TEST.fa LEN=10 -o OUTFILE.txt

Running ELPH in Parallel

ELPH can also be run in parallel, and requires the GNU OpenMPI module. To load the module, use the command: 

 module load gnu-openmpi 

You can then run the program through submitting a Slurm script to HPC. Your script file must use the .sh extension. Below is an example script using the same run as the serial example:  

#! /bin/bash

#SBATCH -n 4
#SBATCH -p genacc_q
#SBATCH -t 00:10:00
#SBATCH --mail-type=ALL

module load gnu openmpi
elph TEST.fa LEN=10 -o OUTFILE.txt

For detailed usage documentation for ELPH, refer to the official website