## #This is a report table with the different datasets used in the last two years. The table is organized into fours columns: # - First: Library_shortname # - Second: Data source (file_type-file_origin) # - Third and the rest: Datasets with the version and the number os sequences. # # So it means that the second dataset have libraries with a version1 (It is the first time that was used in the dataset) # and version2 (new version of the sequence dataset, with different preprocessing parameters). ## +--------------------+-----------------------------------+-------------------+ | | | First_Dataset | | LIBRARY_SHORTNAME | DATA_SOURCE +-------------------+ | | | 2008-08-27 - seqN | +--------------------+-----------------------------------+-------------------+ | j001 | genbank_sequence-download_dbEST | version1 - 8,325 | | Nb_MixTis1 | genbank_sequence-download_dbEST | version1 - 18,710 | | Nb_Trich_1 | genbank_sequence-download_dbEST | version1 - 6,472 | | SAL_AGN | fasta_files-download_from_TGI* | version1 - 4,274 | | SAL_CAN | fasta_files-download_from_TGI* | version1 - 561 | | SAL_UKA | fasta_files-download_from_TGI* | version1 - 8,228 | | SAL_US | fasta_files-download_from_TGI* | version1 - 8,083 | | vlGB_o61mRNA | genbank_sequence-download_nr | version1 - 378 | +--------------------+-----------------------------------+-------------------+ * TGI (Tobacco Genome Initiative): www.tobaccogenome.org