There are 3 different datasets of proteins for an unigene build. + Proteins predicted by longest6frame (a SGN script that translate the sequence in the 6 ORF and get the longest) + Proteins predicted by estscan (http://www.ch.embnet.org/software/ESTScan2.html) + Proteins preferred (for each unigene, compare both methods and get the longest protein) For each dataset, exists two files (cds and protein). Also is provided a version of the preferred protein dataset with annotations compatibles with ProteinPilot program. ---------------------------------------- Report: ---------------------------------------- Files: * cds sequences: - cds fasta file: Petunia_hybrida_cds_predicted_by_estscan.v1.fasta - number of sequences: 4945 - total bases: 2142960 - average sequences length: 433 - maximum sequence length: 1911 - minimum sequence length: 51 * protein sequences: - protein fasta file: Petunia_hybrida_protein_predicted_by_estscan.v1.fasta - number of sequences: 4945 - total aminoacids: 709857 - average sequences length: 143 - maximum sequence length: 636 - minimum sequence length: 16 * cds sequences: - cds fasta file: Petunia_hybrida_cds_predicted_by_longest6frame.v1.fasta - number of sequences: 5251 - total bases: 2086221 - average sequences length: 397 - maximum sequence length: 1809 - minimum sequence length: 69 * protein sequences: - protein fasta file: Petunia_hybrida_protein_predicted_by_longest6frame.v1.fasta - number of sequences: 5251 - total aminoacids: 693657 - average sequences length: 132 - maximum sequence length: 603 - minimum sequence length: 23 * cds sequences: - cds fasta file: Petunia_hybrida_cds_predicted_by_preferred.v1.fasta - number of sequences: 5135 - total bases: 2271087 - average sequences length: 442 - maximum sequence length: 1911 - minimum sequence length: 69 * protein sequences: - protein fasta file: Petunia_hybrida_protein_predicted_by_preferred.v1.fasta - number of sequences: 5135 - total aminoacids: 753917 - average sequences length: 146 - maximum sequence length: 636 - minimum sequence length: 23 ----------------------------------------