forked from AnnaSyme/testlab
-
Notifications
You must be signed in to change notification settings - Fork 1
Dev aina #1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft
tojoa10
wants to merge
94
commits into
AGRF:main
Choose a base branch
from
tojoa10:dev_aina
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Draft
Dev aina #1
Changes from all commits
Commits
Show all changes
94 commits
Select commit
Hold shift + click to select a range
6caa409
Fixed spelling and local rendering issue
tojoa10 44dfaf5
Added a file to gitignore
tojoa10 b109fcb
Add QIIME2 PCoA tool to beta diversity section
tojoa10 a725e8b
Add QIIME2 feature-table summarize tool to microbial section
tojoa10 114ec2c
Add Krona visualization tool
tojoa10 56dd0b2
Visualise microbial abundance using heatmaps and taxonomy bar plots
tojoa10 5681040
added picrust2(functional annotation
tojoa10 090d6f3
Add metadata filtering tool
tojoa10 c4a13b5
Add RNASeq section and update intro grid layout
tojoa10 e716ae9
Updated gitignore
tojoa10 b6fd5d1
Add RNASeq section
tojoa10 f4f03ed
Add RNASeq tools section with QC and alignment
tojoa10 063ec75
Add RNASeq gene quantification step (featureCounts)
tojoa10 9f3c9d8
Add RNASeq filtering step
tojoa10 0673b17
Add normalization step to RNASeq workflow
tojoa10 08e1600
correction
tojoa10 ba2a861
Add input details for STAR alignment tool
tojoa10 7fdeaa6
Add edgeR and DESeq2 inputs
tojoa10 15819b2
Add Metagenomics section to lab
tojoa10 725a726
Add assembly + QC
tojoa10 5d40048
Add binning to metagenomics workflow
tojoa10 343fb93
Add binning to metagenomics workflow
tojoa10 90882ef
Add taxonomic classification to metagenomics workflow
tojoa10 829ebc9
Complete metagenomics workflow with inputs, outputs and tool links
tojoa10 74f0c0f
Updated RNA-seq section with tool links and inputs.
tojoa10 135b3ee
Refined RNA-seq workflow and fixed visualization section.
tojoa10 cd5700c
Refine metagenomics workflow and add inputs
tojoa10 1a7ef45
Refine microbial workflow and fix tool links
tojoa10 0501ddf
Add microbial workflow with diversity analysis and tools
tojoa10 ca42029
added trim galore
tojoa10 2cff7ff
Added Sortmerna to remove rRNA reads.
tojoa10 c792cce
Added Samtools Sort
tojoa10 2ddb087
Added StringTie
tojoa10 328692b
GBS analysis using ustacks, cstacks, sstacks, tsv2bam, gstacks, and p…
tojoa10 8d53eb1
Added tutorial section for RNASeq with transcriptomics GTN link.
tojoa10 ba69e09
Improved RNASeq tools section with inline descriptions
tojoa10 c9d7481
Improved Metagenomics tools section with inline description
tojoa10 f60f61b
Add GBS section with STACKS workflow, inputs/outputs, and tutorials
tojoa10 d7877ad
updated intro.md
tojoa10 5a37b9a
Added Learn Galaxy section with training resources and tutorials
tojoa10 2c90d23
Updated microbial tutorial descriptions
tojoa10 0e46bdf
Updated RNA-seq tutorials description
tojoa10 7068b5c
Improved metagenomics tutorials description
tojoa10 6a8b86f
Add Bakta to metagenomics tools
tojoa10 acc95de
Add MAFFT, BLAST, and FastTree tools to microbial section
tojoa10 c3d7f8a
updated base.yml
tojoa10 4d3cd2e
Add RNASeq overview, outputs, and file descriptions
tojoa10 d237038
Add Metagenomics overview and structured output tables
tojoa10 2d05417
Added GBS (ddRADSeq) overview section with workflow description and o…
tojoa10 3ec900a
Refined RNA-seq overview formatting and consistency
tojoa10 d782397
Update GBS section with tools, outputs, and bcftools filtering
tojoa10 7af23e9
Add RNA-seq overview section
tojoa10 0bec3c3
Add metagenomics overview section
tojoa10 f66a798
Add GBS overview section
tojoa10 ab1c1d2
Clarify GBS tool input/output descriptions
tojoa10 dd8310c
Fix base.yml tracking and remove old_base.yml
tojoa10 af6585f
Add base.yml and update gitignore
tojoa10 652a6d0
Clarify GBS tool input and output descriptions
tojoa10 534998b
update gbs.yml
tojoa10 9027053
Improve microbial workflow descriptions and tool clarity
tojoa10 0a82f07
Improve RNA-seq section structure, descriptions, and tool links
tojoa10 5e3d1d7
Improve metagenomics workflow descriptions and tool organisation
tojoa10 0ed091f
updated RNA section
tojoa10 c402d5d
Improved GBS workflow descriptions by clarifying VCF outputs and vari…
tojoa10 28b44f3
Improve metagenomics workflow with taxonomic profiling support
tojoa10 7a9d813
Changed minor errors in microbial
tojoa10 dda258e
minor update
tojoa10 a64ed46
Corrected button link for metaSPAdes
tojoa10 cd87580
Improve GBS input file documentation
tojoa10 5dd0990
Create Help and Contact section
tojoa10 a160935
Clean up Import Data section
tojoa10 ccbbfe9
Simplify intro and footer content
tojoa10 51170f5
Remove old help section and contributor file
tojoa10 271a835
Update base.yml section references
tojoa10 6e5b0dc
Improve microbial tool descriptions and outputs
tojoa10 3702927
Improve RNA-seq tool descriptions and outputs
tojoa10 3e571de
Improve microbial section descriptions and outputs
tojoa10 0cfbc52
Improve metagenomics section descriptions and outputs
tojoa10 c772051
Improve GBS section descriptions and outputs
tojoa10 5586d58
Minor corrections in GBS section
tojoa10 44e6a65
Revise sequencing quality control section and update QC tools
tojoa10 5e7f7ff
Improve metagenomics tool descriptions and outputs
tojoa10 f5d98f7
Improve GBS tool descriptions and outputs
tojoa10 8228014
Improve import data section descriptions and links
tojoa10 80d4801
Improve quality control section descriptions and workflows
tojoa10 ce6fb27
Add support and service notice to landing page
tojoa10 78b9a55
Update Microbial Profiling section and workflow support guidance
tojoa10 de12506
Refine microbial profiling and QC content
tojoa10 bd5bf52
Split metagenomics assembly tools by read type
tojoa10 b0755eb
Refine RNA-seq workflow and replace featureCounts section
tojoa10 c967938
Update microbial section content
tojoa10 54dd6a2
Update AGRF Lab homepage links
tojoa10 1f5bae0
Update site navigation
tojoa10 67a6028
Add Genome Assembly section
tojoa10 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,4 @@ | ||
| python-3.14.3-amd64 (1).exe | ||
| venv | ||
| venv311 | ||
| .DS_Stores |
This file was deleted.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,43 +1,37 @@ | ||
| id: import | ||
| title: Import data | ||
|
|
||
| tabs: | ||
| - id: overview | ||
| title: Overview | ||
| heading_md: | ||
| heading_md: | ||
|
|
||
| content: | ||
|
|
||
| - title_md: Get AGRF data into Galaxy | ||
| description_md: | | ||
| * Find your fastq data at the [AGRF Hub](https://www.agrf.org.au/my-agrf-hub). | ||
| * Download this data to your computer. | ||
| * Upload data from your computer to Galaxy. | ||
| * Tip: only import the files that you need. | ||
| * Find your FASTQ data at the [AGRF Hub](https://www.agrf.org.au/my-agrf-hub). | ||
| * Download the required files to your computer. | ||
| * Upload data from your computer into Galaxy. | ||
| * Tip: only import the files that you need for your analysis. | ||
|
|
||
| *Note: Galaxy and the AGRF Hub are not configured for long-term data storage. Ensure raw and processed data are backed up to a secure archive.* | ||
|
|
||
| *Note: Galaxy and the AGRF Hub are not configured for long-term data storage. Ensure you save raw and processed data to a secure archive.* | ||
| button_md: Upload data | ||
| button_link: "{{ galaxy_base_url }}/tool_runner?tool_id=upload1" | ||
|
|
||
| - title_md: Compress large files | ||
| description_md: | | ||
| * After uploading data, we recommend compressing <code>fastq</code> and <code>fasta</code> files. | ||
| * This will save you time and storage space. | ||
| * After uploading data, we recommend compressing <code>FASTQ</code> and <code>FASTA</code> files. | ||
| * Compression reduces storage usage and can improve upload and download performance. | ||
|
|
||
| button_md: Compress data | ||
| button_link: "{{ galaxy_base_url }}/tool_runner?tool_id=https://toolshed.g2.bx.psu.edu/repos/iuc/compress_file" #does this need additional text eg %2Frepos% etc? | ||
| button_link: "https://usegalaxy.org.au/?tool_id=toolshed.g2.bx.psu.edu%2Frepos%2Fiuc%2Fcompress_file%2Fcompress_file%2F0.1.0&version=latest" | ||
|
|
||
| # - title_md: Learn how to use Galaxy | ||
| # description_md: | | ||
| # * Galaxy Australia is free to use. | ||
| # * Get started with these user-friendly tutorials. | ||
| # * Galaxy Australia is free to use. | ||
| # * Get started with these user-friendly tutorials. | ||
| # | ||
| # button_md: Galaxy tutorials | ||
| # button_link: "https://training.galaxyproject.org/" | ||
|
|
||
| - id: help | ||
| title: Help | ||
| heading_md: | ||
| content: | ||
|
|
||
| - title_md: Galaxy Australia support | ||
| description_md: Request support, new tools, or additional storage. | ||
| button_md: Request support | ||
| button_link: /request/support | ||
| - title_md: AGRF support | ||
| description_md: Contact AGRF for more help with your data. | ||
| button_md: Contact AGRF | ||
| button_link: /request/support | ||
| # button_link: "https://training.galaxyproject.org/" |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,167 @@ | ||
| id: gbs | ||
| title: GBS | ||
|
|
||
| tabs: | ||
| - id: overview | ||
| title: Overview | ||
| content: | ||
| - title_md: About the service | ||
| description_md: | | ||
| Genotyping-by-sequencing (GBS) is used to identify genetic variation across multiple samples by sequencing reduced representation regions of the genome generated through restriction enzyme digestion. | ||
|
|
||
| Library preparation is performed using a ddRAD-based protocol, which includes digestion of genomic DNA with restriction enzymes, ligation of barcoded adapters, size selection of fragments, and PCR amplification to generate sequencing-ready libraries. | ||
|
|
||
| Sequencing data is processed using the DeNovoGBS pipeline from NGSEP software. Raw sequencing reads are demultiplexed into individual samples based on barcode sequences. Reads are clustered based on sequence similarity, and variants are detected within each cluster using a probabilistic model. | ||
|
|
||
| Quality control metrics are generated throughout the analysis, including read counts per sample, clustering statistics, and similarity between samples. | ||
|
|
||
| The final outputs include variant calls, consensus sequences for each cluster, demultiplexed sequencing reads, and quality control visualisations, enabling downstream population genetics and diversity analyses. | ||
|
|
||
| - title_md: Sequencing Output | ||
| description_md: | | ||
| **Number of samples:** 95 samples following demultiplexing | ||
|
|
||
| Raw sequencing reads are provided as compressed FASTQ files for each individual sample after barcode-based demultiplexing. | ||
|
|
||
| - title_md: What files are included? | ||
| description_md: | | ||
|
|
||
| | Filename | Description | | ||
| |----------|------------| | ||
| | *.fastq.gz | Demultiplexed sequencing reads for each sample | | ||
| | *_variants.vcf | Variant calls across all samples | | ||
| | *.fa | Consensus sequences for each read cluster | | ||
| | QC figures (.jpg) | Quality control visualisations (read counts, clustering, IBS heatmap) | | ||
|
|
||
| - title_md: File formats used | ||
| description_md: | | ||
|
|
||
| | Type | Description | | ||
| |------|------------| | ||
| | .fastq.gz | Demultiplexed sequencing reads per sample | | ||
| | .vcf | Variant call format for genotype data | | ||
| | .fa | FASTA format for consensus sequences | | ||
| | .jpg | Quality control figures | | ||
|
|
||
| - id: tools | ||
| title: Tools | ||
| content: | ||
| subsections: | ||
| - id: stacks | ||
| title: STACKS workflow | ||
| content: | ||
|
|
||
| - title_md: <code>ustacks</code> - Build loci for each sample | ||
| description_md: | | ||
| Assemble short-read sequences into putative loci for each sample and identify SNPs within stacks of matching reads. This is the first step in the STACKS workflow and is used to build loci from raw GBS or ddRAD sequencing reads. | ||
|
|
||
| Output: Produces loci assemblies, SNP calls, haplotype information, and model files for each sample (.tags.tsv.gz, .snps.tsv.gz, .alleles.tsv.gz, .models.tsv.gz). | ||
| inputs: | ||
| - label: Demultiplexed FASTQ reads for each sample | ||
| datatypes: | ||
| - fastqsanger | ||
| outputs: | ||
| - label: Sample loci and SNP TSV files generated by ustacks | ||
| button_md: Run ustacks | ||
| button_link: "{{ galaxy_base_url }}/?tool_id=toolshed.g2.bx.psu.edu%2Frepos%2Fiuc%2Fstacks_ustacks%2Fstacks_ustacks" | ||
|
|
||
| - title_md: <code>cstacks</code> - Create catalog of loci | ||
| description_md: | | ||
| Create a catalogue of consensus loci by combining loci identified across multiple samples. This catalogue acts as a reference set of loci for downstream matching and population analysis. | ||
|
|
||
| Output: Produces catalogue loci and SNP information shared across samples (.catalog.tags.tsv.gz, .catalog.snps.tsv.gz). | ||
| inputs: | ||
| - label: Sample loci files from ustacks (sample.tags.tsv.gz) | ||
| datatypes: | ||
| - tabular | ||
| outputs: | ||
| - label: Catalog of loci and SNP TSV files generated by cstacks | ||
| button_md: Run cstacks | ||
| button_link: "{{ galaxy_base_url }}/?tool_id=toolshed.g2.bx.psu.edu%2Frepos%2Fiuc%2Fstacks_cstacks%2Fstacks_cstacks" | ||
|
|
||
| - title_md: <code>sstacks</code> - Match samples to catalog | ||
| description_md: | | ||
| Match loci from individual samples to the catalogue of consensus loci generated by cstacks. This step identifies shared loci between samples and the catalogue for downstream population analysis. | ||
|
|
||
| Output: Produces catalogue matching files describing loci matches for each sample (.matches.tsv.gz). | ||
| inputs: | ||
| - label: Sample loci files from ustacks (sample.tags.tsv.gz) | ||
| datatypes: | ||
| - tabular | ||
| - label: Catalog loci file from cstacks (catalog.tags.tsv.gz) | ||
| datatypes: | ||
| - tabular | ||
| outputs: | ||
| - label: Matches-to-catalog TSV files generated by sstacks | ||
| button_md: Run sstacks | ||
| button_link: "{{ galaxy_base_url }}/?tool_id=toolshed.g2.bx.psu.edu%2Frepos%2Fiuc%2Fstacks2_sstacks%2Fstacks2_sstacks" | ||
|
|
||
| - title_md: <code>tsv2bam</code> - Convert TSV to BAM | ||
| description_md: | | ||
| Convert STACKS locus information into BAM format while incorporating population map information. This step prepares aligned loci information for downstream genotype calling and population genetics analysis. | ||
|
|
||
| Output: Produces BAM alignment files and associated population mapping information (.bam). | ||
| inputs: | ||
| - label: Loci and polymorphism files from STACKS (*.tags.tsv.gz, *.snps.tsv.gz, *.alleles.tsv.gz) | ||
| datatypes: | ||
| - tabular | ||
| - label: Catalog loci file from cstacks (catalog.tags.tsv.gz) | ||
| datatypes: | ||
| - tabular | ||
| - label: Matches-to-catalog files from sstacks (sample.matches.tsv.gz) | ||
| datatypes: | ||
| - tabular | ||
| outputs: | ||
| - label: BAM alignment files generated by tsv2bam | ||
| button_md: Run tsv2bam | ||
| button_link: "{{ galaxy_base_url }}/?tool_id=toolshed.g2.bx.psu.edu%2Frepos%2Fiuc%2Fstacks2_tsv2bam%2Fstacks2_tsv2bam" | ||
|
|
||
| - title_md: <code>gstacks</code> - Assemble loci and call variants | ||
| description_md: | | ||
| Assemble loci, align reads, and perform SNP and genotype calling across all samples in the population. This step generates population-level variant information from aligned STACKS data. | ||
|
|
||
| Output: Produces genotype calls, assembled loci, and population-level SNP information (.vcf, .tsv.gz). | ||
| inputs: | ||
| - label: BAM alignment files generated by tsv2bam | ||
| datatypes: | ||
| - bam | ||
| outputs: | ||
| - label: Variant calls (VCF) and assembled loci generated by gstacks | ||
| button_md: Run gstacks | ||
| button_link: "{{ galaxy_base_url }}/?tool_id=toolshed.g2.bx.psu.edu%2Frepos%2Fiuc%2Fstacks2_gstacks%2Fstacks2_gstacks" | ||
|
|
||
| - title_md: <code>populations</code> - Population genetics analysis | ||
| description_md: | | ||
| Perform population genetics analysis and export filtered variant datasets for downstream analysis. This tool calculates population statistics, applies locus filtering, and prepares data for diversity and population structure analysis. | ||
|
|
||
| Output: Produces filtered SNP datasets, population statistics, and export files for downstream analysis (.vcf, .tsv, .structure, .phylip). | ||
| inputs: | ||
| - label: Variant calls and assembled loci generated by gstacks | ||
| datatypes: | ||
| - vcf | ||
| outputs: | ||
| - label: Population statistics and export files generated by populations | ||
| button_md: Run populations | ||
| button_link: "{{ galaxy_base_url }}/?tool_id=toolshed.g2.bx.psu.edu%2Frepos%2Fiuc%2Fstacks2_populations%2Fstacks2_populations" | ||
|
|
||
| - title_md: <code>bcftools filter</code> - Filter variant data | ||
| description_md: | | ||
| Filter variant call files (VCF) based on minor allele frequency (MAF), missing data thresholds, depth, and quality metrics. This step helps retain high-confidence variants for downstream population genetics and diversity analysis. | ||
|
|
||
| Output: Produces filtered variant datasets suitable for downstream analysis (.vcf). | ||
| inputs: | ||
| - label: Variant call file (VCF) generated by NGSEP, gstacks, or another variant calling workflow | ||
| datatypes: | ||
| - vcf | ||
| outputs: | ||
| - label: Filtered VCF file for downstream population analysis | ||
| button_md: Launch Tool | ||
| button_link: "{{ galaxy_base_url }}/?tool_id=toolshed.g2.bx.psu.edu%2Frepos%2Fiuc%2Fbcftools_filter%2Fbcftools_filter" | ||
|
|
||
| - id: tutorials | ||
| title: Tutorials | ||
| content: | ||
| - title_md: GBS analysis tutorials | ||
| description_md: Explore Galaxy Training Network tutorials for variant analysis workflows relevant to GBS and population genomics studies. | ||
| button_md: Tutorials | ||
| button_link: https://training.galaxyproject.org/training-material/topics/variant-analysis/ | ||
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Similarly to tool inputs, please also add brief descriptions of tool output files and filenames
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Check the tool descriptions in each section to ensure they have output file descriptions