
6.2 Activity - Kickstart Project Work
6.2.1 Learning Objectives
- Import an example SRA sequencing data file by accession id (e.g. SRR)
- Perform classification of nanopore-sequenced soil MAGs with GTDB-Tk
- Import an SRA dataset you would like to analyze for your research project
- Use a new Galaxy tool that we have not worked through in class
6.2.2 Activity 1 - Import an SRA file into Galaxy
6.2.2.1 Instructions
- Upload sequencing reads for accession id SRR29980924 in Galaxy using Faster Download and Extract Reads in FASTQ tool .
- Make your Galaxy history sharable.
6.2.2.2 Questions
1. Share your Galaxy history with SRR29980924fastq data you downloaded with Faster Download and Extract Reads in FASTQ tool. |
---|
Galaxy history Link: |
2. Find and record the following features of your downloaded dataset below. |
---|
File size: |
File format: |
Job Runtime (Wall Clock): |
First line of the sequencing file: |
3. Inspect SRR29980924 entry in NCBI https://www.ncbi.nlm.nih.gov/sra. Obtain the following sequencing information. |
---|
Organism: |
Instrument: |
Strategy: |
Run: |
6.2.3 Activity 2 – Taxonomically classify nanopore-soil-subset MAGs with GTDB-Tk in Galaxy
6.2.4 Activity 3 – Import an SRA dataset of interest for your research project
6.2.4.1 Instructions
Upload sequencing reads from a dataset of interest using the Faster Download and Extract Reads in FASTQ tool. Refer back to the table in the Activity: Possible Datasets for BioProjects with known long read metagenomics datasets.
6.2.4.2 Questions
1. Record the following features of your downloaded dataset below. |
---|
Accession ID: |
File size: |
Job Runtime (Wall Clock): |
2. Inspect your SRA entry in NCBI https://www.ncbi.nlm.nih.gov/sra. Obtain the following sequencing information. |
---|
Organism: |
Instrument: |
Strategy: |
Run: |
6.2.5 Activity 4 – Use a new Galaxy tool that may be useful for your project
6.2.5.1 Instructions
Find one new Galaxy tool that you think may be useful for your research project. A few possibilities to get you started include:
- Prokka Galaxy tutorial
- Bakta Galaxy Tutorial
- antiSMASH [Galaxy tutorial](http://training.galaxyproject.org/training-material/topics/genome-annotation/tutorials/secondary-metabolite-discovery/tutorial.html
- CheckM2 Galaxy tutorial
- Tools mentioned in tutorials like GTN: Bacterial Genome Annotation
- Tools used in workflows like https://nf-co.re/mag (not all tools will be available on Galaxy)
6.2.5.2 Questions
1. Briefly describe the tool e.g. how it works, what it takes as input, and what it produces as output. |
---|
2. Describe briefly any problems you had running the tool and troubleshooting steps you tried. |
---|
3. If you were able to get the tool to run, describe the results and whether they provide additional information for your project. |
---|