International Tomato Genome Sequencing Project

The International Tomato Genome Sequencing Project was begun in 2004 by an international consortium including participants from Korea, China, the United Kingdom, India, the Netherlands, France, Japan, Spain, Italy and the United States. The initial approach was to sequence only the euchromatic sequence using a BAC-by-BAC approach, and in total more than 1,200 BACs have been sequenced. In 2009, a complementary whole-genome shotgun approach was initiated, which in conjunction with other data yielded high quality assemblies. The International Tomato Annotation Group (ITAG) annotates the genome builds generated by this combined sequencing approach.

Please note that all data is released under the data access agreement below.

Data access agreement 

PLEASE READ BEFORE ACCESSING THE PRE-PUBLICATION TOMATO GENOME SEQUENCE OR ANNOTATIONS: The International Tomato Genome Sequencing Consortium is pleased to make available a pre-publication draft assembly of the tomato genome for use by public and private research communities as a resource to enable plant biology discovery and improve the human condition through improved agriculture. This assembly was produced by the Dutch/French assembly team and includes both 454 data and Sanger sequence data (BAC-ends, fosmid-ends and Selected BAC Mixture sequences).

We caution you that the current assembly is a "work-in-progress" and as such is subject to modification prior to publication release (anticipated for mid-2011), some of which is likely to be substantial. Therefore we encourage you to carefully and independently validate any conclusions you may draw from this sequence. We will update this resource as improvements in the assembly are made. We welcome any feedback regarding your successes or that may assist us in improving the quality and accuracy of this sequence.

This pre-publication tomato genome data is made available with the understanding that users will respect the rights of those who contributed to this effort to describe the tomato genome in a peer-reviewed publication. This description includes whole genome level analyses on genes, gene families, repetitive sequences etc. We encourage you to review the NIH-NHGRI guidelines on distribution and use of pre-publication genome sequence at http://www.genome.gov/page.cfm?pageID=10506537. Any use of the tomato genome data prior to its publication should credit "The International Tomato Genome Sequencing Consortium". If you are uncertain about how to credit the use of the sequence or its appropriate use please do not hesitate to contact Joyce Van Eck.

Official annotation browse genome contigs and official annotations 
The official annotation for the tomato genome is provided by the International Tomato Annotation Group (ITAG), a multinational consortium, funded in part by the EU-SOL project.
ITAG2.3 annotation release  
ITAG2.3 Release: genomic annotations

ITAG Release 2.3 (2011-04-26) official annotations on the SL2.40 genome build by the International Tomato Annotation Group (ITAG).

Browse or
ITAG2.3 Release: protein annotations

ITAG Release 2.3 (2011-04-26) official annotations on the SL2.40 genome build by the International Tomato Annotation Group (ITAG).

Browse or
Bulk files
ITAG2 annotation release  
ITAG1 annotation release  
Tomato genome sequence builds  
ReleaseDateDescriptionAnnotationDownload
SL1.00Dec 2009Initial build, based on the Newbler assembler and containing only 454 sequencing data.ITAG1scaffolds
proteins
cds
SL1.03Jan 2010Like 1.00, but with additional 454 runs and improved contamination screen.Not annotatedscaffolds
cabog1.00Mar 2010All 454 data, bac end and fosmid end data, assembled using the CABOG assembler.Not annotatedscaffolds
SL1.50Apr 2010Includes all 454 data, bac ends, fosmid ends, polishing with Solexa and SOLiD data.Not annotatedscaffolds
SL2.00Jun 2010Release withdrawn.Not annotated-
SL2.10Jun 2010Additional scaffold merging using clone end sequences. Scaffolds placed and oriented using multiple physical maps, first release to include chromosome pseudomolecule sequences.Not annotatedscaffolds, chromosomes
SL2.30Aug 2010Integration and polishing of tomato BAC sequencesmoved to SL2.31scaffolds, chromosomes
SL2.31Nov 2010Mask a small number of contaminated regions. Base-compatible with SL2.30.ITAG2scaffolds, chromosomes
SL2.40Jan 2011Small amount of additional contamination removal. Regularize gap sizes to comply with GenBank policies.ITAG2.3scaffolds, chromosomes
Assembly issues

If in the course of your work you find errors or other issues with the tomato genome assemblies, please report them using one of the following links:

Clone sequences  
Other tomato genome pages on SGN  
Tomato sequencing tools elsewhere on the web  
Publications 
Something wrong? Report a problem