Contig cluster assignment of all ESBL gene containing contigs, as determined by cd-hit

btESBL_contigclusters

Format

A tidy data frame with 714 rows and 10 variables:

id

Contig identifier

clstr_size

Number of contigs in cluster

length

Contig length, bases

clstr_rep

Is this the cd-hit cluster representative sample (1=yes,0=no)

clstr_iden

Identity (%) of sample to cluster representative sample

clstr_cov

Coverage (%) of sample to cluster representative sample

gene

ESBL gene

clstr_name

cd-hit cluster identfier: gene.n where n = 1,2..N and N is total no. clusters for given gene

lane

Unique sample-sequencing run ID

species

Species of sample