Read Me Banana Data Provided by NSIP, Aug 2022 This drive contains the following data from the ABBB banana project diversity study: ----------------------- Banana_Data >CombinedHapmap > unfilteredBananaHapmap.vcf.gv : This file contains original, unfiltered/processed call data for 763982 markers, 1041 individuals (these contain individual replicates included by NSIP and by DArT). >dataset1Fastqs > 848 files, one for each sequenced sample. Each target ID is a numeric 7 digit ID provided by DArT. >dataset1KeyFiles > 4 key files corresponding to the Fastq files in dataset1 >dataset2Fastqs > 193 files, one for each sequenced sample. Each target ID is a numeric 7 digit ID provided by DArT. >dataset2KeyFiles > 1 key file corresponding to the Fastq files in dataset2 >ProcessedMarkerSet >ABBB_Diversity_913i_56840m.csv : This file contains the processed marker set used for the diversity study. Replicate individuals were merged and are displayed as DART_ID $ DART_ID of which individuals were merged into a single call. Markers are subset to a high quality set after ploidy specific calling and quality control filtering. Alleles are represented as "A", "B", "H" or "". "A" corresponds to homozygous Cavendish genome sequence reference allele, "B" is homozygous alternate allele, "H" is the heterozygote, "" blank is missing. >translationFile_56840m.csv : This file shows the reference and alternate alleles in the ATCG format. A = Reference, B = Alternate >SampleTargetIDTranslation_1040i.csv : This file lists the unique tissue sample identifiers for each genotyped sample.