Pangolin lineage classifications to support accessing and analysis of SARS-CoV-2 sequence data.

The Pango nomenclature, called Pango lineages, is being used by researchers and public health agencies worldwide to track the transmission and spread of SARS-CoV-2, including variants of concern. The requirements for running the tool include having conda on a MacOS or Linux system, and the FASTA-formated sequence data. There are 2 methods for lineage assignment with Pango; within NCBI Virus we use the process which includes PangoLEARN, where a classification tree is used to group similar sequences.