Core A
Core A: Administrative and Organizational
This Administrative Core (Core A) will have two major functions. First, it will provide administrative support and intellectual enrichment for the investigators in this program. Due to the fact that the Program includes 5 senior investigators from two institutions (UC San Diego and the Scripps Research Institute) belonging to several departments: Pediatrics, Medicine, Neuroscience (School of Medicine at UCSD), Computer Science (UCSD) and the Scripps Research Institute.
Additionally, three different species will be used for this work (Human, Mouse, Zebrafish) and work accomplished at four different campuses (UCSD School of Medicine, UCSD Cancer Center, UCSD Computer Sciences, Scripps Research Institute). Because of this and the multi- disciplinary nature of the Program, a central administrative core is essential. The Core will work closely with the administration of the two major institutions, as well as administrators, business managers, and scientists in each department involved.
The Administrative Core will assist individual investigators in budgeting and will coordinate travel, purchasing, meetings and seminars.
The second function of the Administrative Core is to provide and encourage intellectual collaboration between members of the Program Project, faculty within the University (who are not members of this Program Project), and outside consultants. The Core will achieve these objectives in two ways:
- We will have a monthly two-hour seminar for all investigators in this Program. Research progress will be presented by one of the Program's investigators during each seminar. This will average about 10-12 seminars per year and about 2-3 per Project/year. We will also have the Directors of Cores B and C present new data generation and analytical techniques, especially in the rapidly moving field of genomics and bioinformatics.
- The Program Project plans to have outside experts visit each year as consultants, at least one of the Project groups. The role of the consultant will be to work directly with his/her host, to advise regarding their progress, to suggest possible new directions for both the individual Projects and potentially the overall Program.
In addition, we propose also to have a mid-term whole day symposium during which we invite consultants whose work is on some aspects of the research of each Project. During these workshops, each of the PIs presents their progress over the preceding 2 years and the outside experts are asked to critically evaluate the progress of each project.
Leader: Joseph G. Gleeson M.D.Professor of Neurosciences and Pediatrics
UC San Diego
Dr. Gleeson's Lab Website
Core B
Core B: High-Throughput Sequencing Genomics
The amazing advances brought fonward by the completion of the Human Genome Project, new sequencing technologies and new methodologies to extract specific fragments of genomic DNA (gDNA), have now made it possible to sequence the "exome" in an individual patient in a relatively short time-frame (whole exome sequencing, WES). The Core PI and Co-PI are thoroughly expert in the design and implementation of next-generation sequencing (NGS) experiments, and will establish and operate the NGS Core B to generate WES data across each of the proposed Projects, involving human, mouse and zebrafish.
It has long been appreciated that the vast majority of alleles with strong effect are located in the exome, which constitutes just 1% of most vertebrate genomes. The methodologies to capture and sequence the exome in humans was the first to emerge and is transforming the way scientists approach genetic disease. Methodologies to capture and sequence the mouse and zebrafish exomes will be available shortly. In this Core, we will bring these technologies to bear on structural brain diseases (SBDs) across the evolutionary spectrum. We have been successful at generating WES data that produces 95% target bases at >10X coverage. For dominant disease, the ability to systematically identify heterozygous variants is limited by this coverage, but for recessive disease, this hurdle is easily overcome. Because recessive disease due to homozygous mutations in humans, mouse and zebrafish is the focus of this application, we will be extraordinarily well-powered to identify causative mutations in these species using this approach.
An important aspect of Core B is the close ties that will develop not just to the Scientific Projects but also with Core C (Bioinformatics Core) and Core A (Administrative Core). Core C will develop and utilize new software that is specifically devised for identifying homozygous potentially deleterious sequence variants (PDSVs) in the data from Core B. Core A will support the technological infrastructure of both Core B and Core C. Projects I, II and III will be well-positioned to uncover new mechanisms of SBDs, and translate these into new discoveries about underlying mechanisms.
Director: Kelly Frazer, Ph.D.Co-Director: Olivier Harismendy, PhD
Division of Genome Information Sciences
Department of Pediatrics
UC San Diego
Core C
Core C: Bioinformatics Genomics
Bioinformatics is the application of statistics and computer science to the field of molecular biology. It has emerged as a field unto itself, as the datasets that are generated by modern biomedical researchers easily exceeds what can be directiy analyzed. Core C will work with the data generated from massive parallel sequencing from human, mouse and zebrafish, to extract variants that are potential to cause disease.
The PIs of Cores A, B and C have worked together extensively in the past, and have an established track record of productivity in the area of next generation sequencing (NGS) data analysis. Dr. Bafna has worked broadly in bioinformatics and genomics in the development computational methodologies employing novel algorithms and statistical techniques for NGS datasets. We envision that the WES data generated from Core B will be delivered to Core C for extraction of the potentially deleterious sequence variants (PDSVs), which will be delivered back to each of the Projects for segregation analysis and further validation. This will be accomplished by developing the four key pipelines of Core C:
- WES data tracking and storage pipeline
- WES data analysis pipeline
- Mutation identification pipeline
- Comparative genomics pipeline
The analysis of WES datasets is presented in this application as a series of filters that is applied to the primary sequence to extract all relevant variants, and then apply a heuristic ranking strategy to detect the PDSVs mostly likely associated with the phenotype. The output of this FILTER and PRIORITIZE programs are then reported as both SNPs and INDELs in a ranked fashion, for later validation and segregation testing. Further analysis will help uncover the contribution of these genes to common disease as well as genome- wide gene-gene interactions using other software we have developed. We are also well-positioned to take full advantage of the 3^" generation DNA sequencers, and are excited that UCSD will serve as one of the national HHMI PacBio Sequencing Centers. These tools, together with the outstanding and unique human and animal resources, will make for a powerful combination to investigate new causes of structural brain disorders.
Leader: Vineet Bafna, PhDProfessor, Computer Science and Engineering
UC San Diego
Dr. Bafna's Research Website