This unigene build has unigene_build_id=42 with date: 17-07-2008. Method: - Get all the Nicotiana tabacum mRNA processed sequences from the SGN database (239172 sequences). - Preclustering: SelfblastN of this sequences dataset and precluster the sequences with match length > 30 bp and percentage length > 90% using a SGN script (precluster.pl) - Clustering: Clustering of the sequences using CAP3 program with the follow arguments -e 5000 -p 90 -d 10000 -b 60. Statistics: - Members: 239761. - Unigenes: 84602. - Singlets: 53906. - Contigs: 30696.