We are pleased to announce the release of version SL4.0 of the tomato genome and version ITAG4.0 of the annotation. You can find here more information about the assembly and annotation in the preprint on bioRxiv and slides. The genome data is available from our FTP site (Build SL4.0 and ITAG4.0), JBrowse, Apollo gene editor and Blast.

Please use the contact form to let us know if you notice any issues with SL4.0 or ITAG4.0.

Build highlights SL4.0
Annotation highlights ITAG4.0

   - Only 44Kb of N's (unknown bases) compared to 81.7Mb in SL3.0
   - Only 152 unplaced contigs in Chr 00 compared to 4,374 in SL3.0
   - Better annotation of repeat regions in SL4.0
   - 80X Pacbio coverage with RSII and Sequel (13kb read N50)
   - Canu assembly (N50 5.5 Mb) and Hi-C scaffolding (12 chromosomes and unplaced contigs)
   - Validated with Bionano optical maps and 10X linked reads

   - 34,075 protein coding genes in ITAG4.0
   - Functional descriptions assigned to 29,532 genes
   - ITAG4.0 has 4,794 novel genes
   - 29,281 genes preserved from ITAG2.3
   - 21,962 of 29,281 preserved genes have been updated
   - Most of the updated genes have extensions in the 5' and 3' UTRs