Skip to content

susanaguix/WEVTYTO

Repository files navigation

Here we present a tool to analyse enterovirus sequencing data. It was desinged focusing on wastewater samples and Oxford Nanopore data.

Briefly, VSEARCH tool is used to filter and cluster the generated raw reads and then perform a BLAST search against a custom reference database. A final Excel file per sample is generated containing information about the Enterovirus types found as well as its abundance.

ev_typing_environment.yml creates a Conda environment that contains all the required packages to run the pipeline

ev_typing_nix.py is the script to analyse those samples that were amplified following the Nix et.al., 2006 protocol targeting all Enterovirus.

ev_typing_shaw.py is the script to analyse those samples that were amplified following the Shaw et.al., 2020 protocol targeting Cluster C Enterovirus.

ev_reference_sequences.fasta.gz contains a FASTA file with reference sequences from all Enterovirus types. There are 6303 references. If preferred, a custom reference FASTA file can be used always keeping the same header structure as the provided file.

About

Wastewater Enterovirus Typing Tool

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Contributors

Languages