The four folders, whose names have "Mapped...seqs", contain amplicon sequences of the mapping parents, N. otophora TA3353, N. tomentsiformis TA3385, N. acuminata TA3460 and N. acuminata var. multiflora TA3461, which were used to identify CAPS/dCAPS. The folders "mappedCOSIIseqs" contain COSII marker sequences. “.txt” files contain two parental sequences and the COSII consensus sequence used for primer design (only the portion embraced by the primers). The sequences are named with marker name plus sequencing primer plus pedigree number, e.g. 1g13380R3353 for N. otophora, 3385 for N. tomentosiformis, 3460 for N. acuminata and 3461 for N. acuminata var. multiflora. “NNNNN” in the COSII consensus sequences indicates the predicted intron positions. “.aln” files are alignments of the corresponding “.txt” files, which show the conserved exon sequences and splicing sites. “.fasta” files are alignments of separated introns or exons based on the corresponding “.aln” files, which were then used to calculate SNP frequency between two parents of a population (see Tables 5 & 6). The folders “mappedNonCOSIIseqs” contain nonCOSII marker sequences, e.g. LPT5E7F1-3461 is the sequence of N. acuminata var. multiflora for marker cLPT5E7 by primer LPT5E7F1.