Transcriptome Data Assembly and Gene Function Annotation of MangoFruits
We performed transcriptome sequencing of mixed RNA isolated from fruit sample for ‘Zill’ mango (Mangifera indica Linn) including pericarp and pulp during different development stage using Illumina RNA-seq technology.RNA-seq generated 68 419 722 sequence reads encompassing 6 157 744 980 total nucleotides (nt),each sequence read averaging 90 bp in length.All the sequence read datasets were deposited at the NCBI Sequence Read Archive (SRA) (GenBank accession SRP035450).All high-quality reads were assembled into 124 002 contigs.these contigs were further assembled into 54 207 unigene with a mean size of 838 bp.A total of 42 515 (78.43%)unigenes were annotated using public protein databases with a cut-off E-value above 10-5.out of these,35 198 unigenes and 14 619 unigenes were assigned to Gene Ontology terms and Clusters of Orthologous groups.Functional annotation against Kyoto Encyclopedia of Genes and Genomes pathway database identified 23 741 (43.79%) unigenes which were mapped to 128 pathways,which including Metabolic pathways,Biosynthesis of secondary metabolites,Plant-pathogen interaction,Plant hormone signal transduction,Phenylpropanoid biosynthesis,Starch and sucrose metabolism,Flavonoid biosynthesis.This study will lay a foundation for further research on functional gene cloning,gene expression analysis and molecular marker development in mango.