Publication:
A gene atlas for human pathogenic fungi.

dc.contributor.authorKeatinge Clay, Oliver
dc.contributor.corporatenameUniversidad Colegio Mayor de Nuestra Señora del Rosario (Bogotá, Colombia)spa
dc.coverage.projectdates2013-2017spa
dc.date.accessioned2020-10-23T13:26:32Z
dc.date.accessioned2020-12-17T23:52:18Z
dc.date.available2020-10-23T13:26:32Z
dc.date.available2020-12-17T23:52:18Z
dc.date.issued2017-05-19
dc.description.abstractIn recent years there has been a dramatic increase in the number of fully sequenced and annotated unicellular fungi. Many of these fungi are human pathogenic fungi, and a major motivation for their sequencing was a need to ultimately have a reliable set of protein-coding genes, and their DNA and encoded protein sequences. Unfortunately, in a number of the medically important fungal species and their sequenced strains, the quality of the genes' protein-coding sequences, and the automated predictions of exactly where they are located on the chromosomal sequence or contig (coordinates of start, stop, and exon/intron boundaries) are not yet satisfactory, for a large percentage of the genes. Consistency checks are needed, which may then suggest re-annotation, and in some cases resequencing or imputing of unidentified/missing nucleotides in order to reveal the true gene structure. Two very recent advances make this an opportune moment to fill the remaining knowledge gap in a systematic way. The first is that in some parts of the fungal tree, there is already a high phylogenetic density of fully sequenced, annotated genomes, allowing alignment of genes and imputation with high confidence. The second is that affordable Illumina next-generation sequencing (NGS), delivering paired short reads, has now increased its read lengths to above 100 bp as well as the quality of the reads (i.e., ease of NGS-only assembly and accuracy) and the throughput one can expect at a reasonable cost. We propose to construct a sustainable, expandable, human-curatable and easily modifiable prototype database system to hold fungal gene sets (alignments of orthologous or homologous coding sequences) for the nuclear and mitochondrial protein-coding genes of species sequenced by our group and by other sequencing centers. For the prototype we will construct and test by ongoing use in this project, we propose to populate the database with the human pathogenic and other fungi of the Onygenales order, which includes important (primary) pathogens endemic in South and/or North America such as the dimorphic fungi Histoplasma capsulatum, Coccidioides spp., Paracoccidioides brasiliensis, and Blastomyces dermatitidis; as an 'outgroup' we will also include another clinically important group within the ascomycetes, such as the genus Aspergillus. The protein-coding gene sets of these taxa will be carefully corrected and curated, by imputing unidentified/missing nucleotides via alignment and re-annotation where this is possible, and by sequencing 10 previously sequenced or unsequenced strains where the initial results indicate a need for higher sequence quality, or for complementing with additional species/strains in regions of the fungal tree. We will also use this work to bring the gene sets and approximately 10,000 expected alignments to the level of a gene atlas: for the selected (and later for additional) human pathogenic fungi, we will have curated descriptions of the roles of the genes (orthology classes) and associated metadata on them. The design, platform and database structure of the system will be consciously chosen to allow, as far as possible, efficient ongoing human curation that can be sustained also after the end of the 3 year duration of the project.spa
dc.format.extent34 páginas.spa
dc.identifier.instnameColcienciasspa
dc.identifier.reponameRepositorio Colcienciasspa
dc.identifier.repourlhttp://colciencias.metabiblioteca.com.cospa
dc.identifier.urihttps://colciencias.metadirectorio.org/handle/11146/39122
dc.language.isospaspa
dc.relation.ispartofseriesInforme;
dc.rights.accessrightsinfo:eu-repo/semantics/openAccessspa
dc.rights.accessrightshttp://purl.org/coar/access_right/c_abf2spa
dc.rights.accessrightsinfo:eu-repo/semantics/openAccessspa
dc.rights.accessrightshttp://purl.org/coar/access_right/c_abf2spa
dc.rights.creativecommonshttps://creativecommons.org/licenses/by/4.0/spa
dc.subject.proposalBioinformaticsspa
dc.subject.proposalFungal pathogensspa
dc.subject.proposalGenome informaticsspa
dc.subject.proposalMicrobial genomicsspa
dc.subject.proposalOnygenalesspa
dc.titleA gene atlas for human pathogenic fungi.spa
dc.typeInforme de investigaciónspa
dc.type.coarhttp://purl.org/coar/resource_type/c_93fcspa
dc.type.contentTextspa
dc.type.driverinfo:eu-repo/semantics/reportspa
dc.type.redcolhttps://purl.org/redcol/resource_type/PIDspa
dc.type.versioninfo:eu-repo/semantics/submittedVersionspa
dc.type.versionhttp://purl.org/coar/version/c_71e4c1898caa6e32spa
dc.type.versioninfo:eu-repo/semantics/submittedVersionspa
dc.type.versionhttp://purl.org/coar/version/c_71e4c1898caa6e32spa
dcterms.audienceEstudiantes, Profesores, Comunidad científica colombiana, etc.spa
dspace.entity.typePublication
oaire.awardnumber122256934875spa
oaire.funderidentifier.colciencias40-2013
oaire.fundernameDepartamento Administrativo de Ciencia, Tecnología e Innovación [CO] Colcienciasspa
oaire.fundingstreamPrograma Nacional en Ciencias Básicasspa
oaire.objetivesTo provide the biomedical and biological research communities with a sustainable database and system for fungal genes (prototype 'gene atlas'), built around a core of consistently and effectively structured, annotated, relationally linked and human curated protein-coding gene/protein alignments of human pathogenic fungi from the order Onygenales as well as from another ascomycete taxon such as Aspergillus, that can later be extended to include further human pathogenic fungi.spa

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
122256934875.pdf
Size:
11.19 MB
Format:
Adobe Portable Document Format
Description:
Informe Final
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
license.txt
Size:
14.45 KB
Format:
Item-specific license agreed upon to submission
Description:
No Thumbnail Available
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description:

Collections