Data: Contig cluster multiple sequence alignment .paf files — btESBL_contigclusters_msa_paf

Multiple sequence alignments of ten largest contig clusters, generated using minimap2, mapping all cluster sequences to the longest sequence (i.e. the cluster representative sequenc from cd hit).

btESBL_contigclusters_msa_paf_files

Format

A named list of ten data frames, where name is cluster id. Columns are:

qname: Query sequence name
qlen: Query sequence length, bases
qstart: Query start coordinate (0-based)
qend: Query end coordinate (0-based)
strand: ‘+’ if query/target on the same strand; ‘-’ if opposite
tname: Target sequence name
tlen: Target sequence length, bases
tstart: Target start coordinate on orignal strand
tend: Target end coordinate on orignal strand
nmatch: Number of matching bases in the mapping
alen: Number of bases, including gaps, in the mapping
mapq: Mapping quality (0-255, with 255 if missing)