Multiple sequence alignments of ten largest contig clusters, generated using minimap2, mapping all cluster sequences to the longest sequence (i.e. the cluster representative sequenc from cd hit).

btESBL_contigclusters_msa_paf_files

Format

A named list of ten data frames, where name is cluster id. Columns are:

qname

Query sequence name

qlen

Query sequence length, bases

qstart

Query start coordinate (0-based)

qend

Query end coordinate (0-based)

strand

‘+’ if query/target on the same strand; ‘-’ if opposite

tname

Target sequence name

tlen

Target sequence length, bases

tstart

Target start coordinate on orignal strand

tend

Target end coordinate on orignal strand

nmatch

Number of matching bases in the mapping

alen

Number of bases, including gaps, in the mapping

mapq

Mapping quality (0-255, with 255 if missing)