Nephele User Guide

How to submit a job

Thank you for choosing Nephele. Here you will find instructions on how to submit a successful job. If you would like to try submitting with a small dataset, below are the sample fastq files and a mapping file.

Note: These data are from a 2017 BioProject in NCBI entitled "Experimental Microbial Dysbiosis Does Not Promote Disease Progression in SIV-Infected Macaques" (

File (Type) Size Description Applicable pipelines
Sequences 72MB Contains paired-end data (forward and reverse) from 10 samples sequenced on an Illumina MiSeq Pre-process (QC), Analyze (Metagenomics and Viral)
Mapping File (Excel) 9KB Metadata file used in Nephele job submissions that describes samples, groups, etc. for analysis Pre-process (QC), Analyze (Metagenomics and Viral)
Biom 252KB Contains abundance and taxonomy assignments. It is generated by the analysis pipelines Downstream Analysis
Tree 25KB It is a rooted phylogenetic tree in newick format Downstream Analysis

You can follow along with this user guide using the sample data.

Step 1: Select analysis type.

Please select the analysis you are interested in. Nephele provides amplicon analysis (16S, ITS), WGS metagenomics analysis and Viral (SARS-CoV-2 mapping).

Screenshot of analysis type select panel with Amplicon and WGS type selection buttons
Step 2: Select data type (demultiplexed file only)

Nephele supports the following data types for analysis types.

  • 16S
    • demultiplexed paired end fastq (select this option if you are using the sample data)
    • demultiplexed single end fastq
  • ITS
    • As ITS supports only demultiplexed paired end fastq, the system will assume the data type and proceed to next page
  • WGS
    • demultiplexed paired end fastq
    • demultiplexed single end fastq

Select the type appropriate to your data (e.g. Paired End FASTQ) and click “Next”.

Step 3: Select upload method.

Nephele provides five methods to upload input files: my computer, Google Drive, BaseSpace, Globus and FTP. Upload from my computer allows you to upload files from your local computer. This method is suitable for up to 2GB per fastq or fastq.gz file per file. We recommend that you upload compressed .gz files for performance. Select this option if you are using a sample dataset with small files. If you have a large data set, we recommend opting to use Globus, FTP, Google Drive or BaseSpace method.

Screenshot of upload options panel with from local, Globus, Google Drive, BaseSpace and FTP buttons
  • Upload from my computer
    1. Click "+ Add files" and select fastq or fastq.gz files or Drag and drop the files.
    2. Click Start upload to start upload your input files.
      *Dependent on your network, the upload speed and total time can vary. We recommend stable and high-speed internet for this method.
    3. After completing the uploads, click Next.
    4. You can cancel the upload anytime by clicking the "Cancel upload" button.
    5. You can select all files by clicking "Select all."
    6. If you wish to delete files, you can delete them by clicking the "Delete selected" button.
    Screenshot of local upload panel with file uploads in progress
  • Google Drive: We recommend that you organize all of your input files in one folder ahead of time. During upload you will select the folder and all fastq files in that folder will be available for input. Proceed to the next step of uploading the mapping file. Nephele will proceed to import all the fastq files that correspond to the files indicated on the mapping file.
  • BaseSpace: Nephele will display all available Projects in BaseSpace. During upload you will select the Project you wish to use, Nephele will find all the fastq files associated with that Project. Proceed to the next step of uploading the mapping file. Nephele will proceed to import all the fastq files that correspond to the files indicated on the mapping file.
  • Globus (recommended for large data). Please see tutorial here.
  • Public FTP (recommended for large data)
    • If you are not familiar with using FTP, please see this guide.
      1. Upload your files to publicly accessible FTP.
      2. Copy the ftp URL. The URL should point to a location of a folder that contains fastq or fastq.gz files.
        Note: please make the folder with only files without nested folders.
      3. Select Upload via FTP from the upload options page
      4. Paste the URL in the text box.
        Note: please enter the URL as exactly it is. If there is a space in the folder name, you can put a space in the path.
      5. Click Upload.
    • Nephele FTP method can retrieve data from publicly available FTP that does not require a username or password. (Nephele FTP method does not support Google Drive or Box currently. Read more in the FAQ).
Step 4: Upload mapping files

After uploading your input files, Nephele pipelines require a mapping file. To upload your mapping file, simply click Browse, select your mapping file, and click Upload. Upload the sample mapping file (xx.xlsx).

Nephele validates your mapping file instantly so that you can correct any error interactively. In order to prevent any mapping file errors, we recommend that you start with mapping file templates provided on the map file upload page for the pipeline you're running.

Step 4.1: How to use Nephele interactive mapping file validation

If you have an error in your mapping file, for example, a typo in your fastq file name column, here is an example of how to correct it using the interactive error page.

  • On the validation error page, move your mouse over to the area that is highlighted in red.
  • This example shows that the file names in both ForwardFastqFile and the ReverseFastqFile column names are the same: "3_S3_L001_R1_001s.fastq". There is a typo on the ReverseFastqFile column.
    Screenshot of map file error page with two columns in first row highlighted and error displayed
  • Click on the ReverseFastqFile column and correct "3_S3_L001_R1_001s.fastq" to "3_S3_L001_R2_001s.fastq."
    Screenshot of map file error page with two columns in first row highlighted, editing second column
  • Click "Save & Retry."
  • The mapping file passes the validation and moves to the next page, Select Pipelines, automatically.

Notes on map file validation:

  1. "BarcodeSequence" and "LinkerPrimerSequence" columns are no longer required by Nephele pipelines. Please remove them if you are using older mapping files.
  2. The file names listed in the mapping file should exactly match the names of the files you have uploaded for analysis. If you have uploaded *.fastq.gz files, make sure to add the .gz extension to the file names in the mapping file. Do not include full file paths.
  3. Nephele 2 accepts tab-delimited text (.txt) files and Excel (.xlsx or .xls) files. Comma-separated (.csv) files are not supported.
Step 5: Select a pipeline

After uploading mapping file and pass the validation, you can select a pipeline. Simply click "Select" on the pipeline you wish to use. Select QIIME1 if you use the sample data.
You can learn more about pipelines in Nephele in the Nephele Pipelines section.

Screenshot of the pipeline selection page
Step 6: Update parameters (optional)

Once you select a pipeline, you will see a submission page. You can enter a description of your job so that you can identify different jobs easily.

Screenshot of Job Details tab of pipeline options page

On this page, there are different tabs such as "Pre-processing" and "Analysis" (dependent on the pipeline) which allow you to change the parameters the pipeline is run with. Don’t forget to have a look if you are interested in different options. See Nephele's Pipeline Section for more details regarding parameters.

Screenshot of Pre-processing tab of pipeline options page

Finally, click Validate and Submit!

Congratulations! You have submitted a job! You will receive a Pipeline Started email shortly.

How to resubmit your job

You can resubmit a job without uploading all of your input files again.

Step 1: Enter the Nephele JobID that you would like to resubmit.

Screenshot of Home page with job resubmission box highlighted

Step 2: As long as your job hasn't expired, your input data will be retrieved. The rest of steps are the same as submitting a job. You can re-upload a mapping file and change parameters for the resubmission.