Base for the Viral Genomics and Bioinformatics Repository
We are pleased to announce the next Viral Genomics and Bioinformatics in collaboration with COVID-19 Genomics (COG) Train (COG-Train).
Viruses are a huge public health burden all around the world. In Asia, viral infections such as influenza, dengue fever, hand, foot and mouth disease, chikungunya, and viral hepatitis kill hundreds of thousands of people each year. Apart from health issues, emerging viral illnesses and epidemics have a global economic impact. According to the World Bank, the SARS-Cov-2 pandemic caused a more than $10 trillion deficit in the global economy in 2020-2021 alone. This demonstrates the impact of viral illnesses not only on human and animal health, but also on social and economic growth of the world.
Technological advances in science and research are making it possible to rapid detection, surveillance and management of viral diseases. In recent years, next generation sequencing (NGS) technologies have played an important role in the identification and classification of viruses, in the detection of drug resistance mutations and in the treatment and surveillance of viral diseases. Early identification of a virus and quick analysis of its genome will aid towards better treatment and help in controlling the disease spread. Along with the advances in the sequencing technologies, it is necessary to develop data management methods, computational analysis tools and human expertise to cope with the sequence deluge. The number of researchers able to interpret viral high throughput sequence data is currently limited.
The Viral Genomics and Bioinformatics course will teach participants how to analyse viral genome sequences and interpret genomic data obtained by high-throughput sequencing. The use of the command line to analyse viral NGS data, quality control of sequences, reference mapping and de novo assembly of viral genomes, pathogen identification from metagenomics data, and creating phylogenetic trees will be covered in the course.
Module 1 - Introduction to Linux + Module 2 - Linux text processing
PDF Manual - Introduction to Linux + Text processing
PDF Lectures - Introduction to Linux + Text processing
PDF Solutions - Introduction to Linux + Text processing
Module 3 - Introduction to NGS sequencing
Online Manual - Introduction to NGS
Module 4 - NGS file formats and data & QC
Online Manual - NGS data Quality Control QC
Module 5 - Introduction to BASH scripting
PDF Manual - Bash Scripting
PDF Answers - Bash Scripting.
Answer Scripts Folder - Bash Scripting
Module 6 - Reference Alignment
Online Manual - Reference Alignment
Module 7 - Coverage Plots and Stats
Online Manual - Coverage Plots and stats
Module 8 - Consensus & variant calling
Online Manual - Consensus & variant calling
Module 9 - De Novo assembly
PDF Manual - De Novo assembly
Module 10 - Pathogen sequence detection using metagenomics
Online Manual - Metagenomics
Module 11 - Introduction to phylogenetic methods
Online Manual - Phylogeny and Virus Typing
PDF Phylogeny Help Doc
Module 12 - SARS-CoV-2 workflows
Online Manual - SARS-CoV-2 Reference Alignment
Online Manual - SARS-CoV-2 Lineages and Mutations
Online Manual - SARS-CoV-2 Phylogenetics
Online Manual - SARS-CoV-2 Group Practical
Appendix
PDF - ALT Learning Outcomes Session 2 Slides
PDF - Answers ALT Learning Outcomes Exercise
Software list used in course Virtual Machine
Any reuse of the course materials, data or code is encouraged with due acknowledgement.
This work is licensed under a Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).