In the next folders are included the genomic data for Nicotiana tomentosiformis published by Sierro, et al. 2013 in Genome Biology. assembly: includes a file, called Ntom_ASAG01.fa.gz, with the genome sequences in FASTA format (compressed using gzip) for the N.tomentosiformis genome scaffolds. annotation: contains the FASTA files for the N.tomentosiformis genome, transcripts and proteins, and the gff3 files for the gene models predicted by Cufflinks, and BLAT mappings with tomato (ITAG 2.4), potato (PGSC) and arabidopsis thaliana (TAIR10). Ntom.fasta.tar.gz -> All annotation FASTA files compressed Ntom_ASAG01_NTOM.cds.fna -> coding sequences (CDS) Ntom_ASAG01_NTOM.mrna.fna -> mRNA (includes UTR) Ntom_ASAG01_NTOM.proteins.faa -> protein sequences Ntom.gff3.tar.gz -> All gff3 files compressed Ntom_ASAG01_NTOM_rnaseq.gff3 -> Cufflinks gene models Ntom_ASAG01_itag_2.4_.80-80.gff3 -> BLAT mapping of tomato genes (ITAG 2.3) on N.tomentosiformis Ntom_ASAG01_pgsc.80-80.gff3 -> BLAT mapping of potato genes (PGSC) on N.tomentosiformis Ntom_ASAG01_tair10.80-60.gff3 -> BLAT mapping of A. thaliana genes (TAIR 10) on N.tomentosiformis ===================================================== ASSEMBLY INFO ===================================================== BioProject: PRJNA182501 Assembly: GCA_000390325 Level: scaffolds Taxid: 4098 WGS: ASAF00000000 Submission: Registration date: 10-May-2013 Philip Morris International R&D Download Link: ftp://ftp.ncbi.nlm.nih.gov/genbank/genomes/Eukaryotes/plants/Nicotiana_tomentosiformis/Ntom_v01/ ===================================================== STATS ===================================================== Total sequence length 1,688,312,294 bp (1.7 Gb) Total assembly gap length 45,733,186 bp Gaps between scaffolds 0 Number of scaffolds 159,548 sequences Longest scaffold length 789,565 bp Min. scaffold length 200 bp Average scaffold size 10,582 bp Scaffold L90 11,543 bp Scaffold N90 24,457 sequences Scaffold L50 82,593 bp Scaffold N50 5,852 sequences Number of contigs 215,609 sequences Contig L50 34,051 bp ===================================================== Raw reads are available through SRA Database: Submission: ERA214730 Link: http://www.ncbi.nlm.nih.gov/sra?term=ERP002502 Sierro, N. et al. Reference genomes and transcriptomes of Nicotiana sylvestris and Nicotiana tomentosiformis. Genome. Biol. 14, R60 (2013).