
via Johns Hopkins University
Creation of Johns Hopkins-led team allows worldwide scientific collaboration for studies of human genetics and health
Harnessing the power of genomics to find risk factors for major diseases or search for relatives relies on the costly and time-consuming ability to analyze huge numbers of genomes. A team co-led by a Johns Hopkins University computer scientist has leveled the playing field by creating a cloud-based platform that grants researchers easy access to one of the world’s largest genomics databases.
Known as AnVIL (Genomic Data Science Analysis, Visualization, and Informatics Lab-space), the new platform gives any researcher with an Internet connection access to thousands of analysis tools, patient records, and more than 300,000 genomes. The work, a project of the National Human Genome Institute, appears today in Cell Genomics.
“AnVIL is inverting the model of genomics data sharing, offering unprecedented new opportunities for science by connecting researchers and datasets in new ways and promising to enable exciting new discoveries,” said project co-leader Michael Schatz, Bloomberg Distinguished Professor of computer science and biology at Johns Hopkins.
Michael Schatz
Bloomberg Distinguished Professor
Typically, genomic analysis starts with researchers downloading massive amounts of data from centralized warehouses to their own data centers, a process that is not only time-consuming, inefficient, and expensive, but also makes collaborating with researchers at other institutions difficult. Genetic risk factors for ailments such as cancer or cardiovascular disease are often very subtle, so researchers must analyze thousands of patients’ genomes to discover new associations. The raw data for a single human genome comprises about 40GB, so downloading thousands of genomes to conduct such research can take takes several days to several weeks.
“AnVIL will be transformative for institutions of all sizes, especially smaller institutions that don’t have the resources to build their own data centers. It is our hope that AnVIL levels the playing field, so that everyone has equal access to make discoveries,” Schatz said.
In addition, studies requiring the integration of data collected at multiple institutions means each institution must download its own copy while ensuring that patient-data security is maintained. This challenge is expected to become even greater in the future, as researchers embark on ever-larger studies requiring the analysis of hundreds of thousands to millions of genomes at once.
“Connecting to AnVIL remotely eliminates the need for these massive downloads and saves on the overhead,” Schatz says. “Instead of painfully moving data to researchers, we allow researchers to effortlessly move to the data in the cloud. It also makes sharing datasets much easier so that data can be connected in new ways to find new associations, and it simplifies a lot of computing issues, like providing strong encryption and privacy for patient datasets.”
AnVIL also provides researchers with several major analysis tools, including Galaxy, developed in part at Johns Hopkins, along with other popular tools such as R/Bioconductor, Jupyter notebooks, WDLs, Gen3, and Dockstore to support both interactive analysis and large-scale batch computing. Collectively, these tools allow researchers to tackle even the largest studies without having to build out their own computing environments.
Researchers from all over the world currently use the AnVIL platform to study a variety of genetic diseases, including autism spectrum disorders, cardiovascular disease, and epilepsy. Schatz’s team, part of the Telomere-to-Telomere Consortium, used it to reanalyze thousands of human genomes with the new reference genome to discover more than 1 million new variants.
Already, the AnVIL team has collected petabytes of data (1 petabyte equals one million GB) from several of the largest NHGRI projects, including hundreds of thousands of genomes from the Genotype-Tissue Expression, Centers for Mendelian Genetics, and Centers for Common Disease Genomics projects, with plans to host many more projects in the near future.
Original Article: New cloud-based platform opens genomics data to all
More from: Johns Hopkins University | Broad Institute | Harvard University | Vanderbilt University | University of Chicago | Oregon Health & Science University | Yale School of Medicine | University of California Santa Cruz | Roswell Park Comprehensive Cancer Center| Pennsylvania State University | City University of New York | Carnegie Institution for Science | Washington University in St. Louis
The Latest on: Genomics databases
- Mining museums’ genomic treasureson May 28, 2022 at 4:48 am
Genomic analysis of blood samples from deer mice preserved at the Museum of Southwestern Biology at the University of New Mexico and the Museum of Texas Tech University helped scientists figure out ...
- SARS-CoV-2 genome analysis could boost disease surveillanceon May 27, 2022 at 6:31 am
Continued monitoring of the evolution of the virus that causes COVID-19 as the pandemic progresses could strengthen disease surveillance systems and aid preparedness for new variants, a study suggests ...
- 54gene Signs MoU with ANSTS to Announce Landmark SEN-GENOME Genetics Research Studyon May 26, 2022 at 4:03 pm
Kindly share this post54gene, the health technology company advancing African genomics research for improved global health outcomes, has partnered on a new Senegal research study SEN-GENOME, which ...
- New Data to be Shared at ASCO 2022 Underscore Clinical Utility of Decipher Prostate Genomic Classifieron May 26, 2022 at 2:12 pm
(Nasdaq: VCYT) today announced that new data from a large, population-based study reinforce the clinical utility of the Decipher Prostate genomic classifier. The findings, which will be shared for the ...
- VIDEO: Genomics education for nurses can improve communication with patientson May 26, 2022 at 12:17 pm
In this video, Jeramy Tabuzo, BSN, RN, OCN, discussed a recent Oncology Nursing Society survey that evaluated literacy and knowledge deficits in genomics in cancer care among oncology nurses. In ...
- Has your food been chemically altered? New database of 50,000 products provides answerson May 26, 2022 at 7:25 am
In a paper published Tuesday in Nature Food, Giulia Menichetti, senior research scientist at Northeastern's Network Science Institute, demonstrates that the concentrations of different nutrients in ...
- Dante Labs highlights advancements to its drug discovery development program, demonstrating the value of genomic data to drug discoveryon May 26, 2022 at 4:30 am
Company announces rebranding to Dante Genomics to reflect its broader global efforts – Dante drug discovery program Dante drug discovery program NEW YORK, May 26, 2022 (GLOBE NEWSWIRE) -- Dante ...
- Elon Musk Speculates About Storing All Human DNA in Databaseon May 25, 2022 at 1:04 pm
In the early 2000s, scientists from the Human Genome Project announced a breakthrough: they sequenced the complete human reference genome, including all three billion DNA letter, a scientific ...
- What genetic sequencing can reveal about the secret lives of beeson May 23, 2022 at 2:43 pm
The Beenome100 Project aims to sequence dozens of species of native bees to help answer questions about bee biology, behavior and much more.
- Cost of genomic revolution may be more ethical than economicon May 20, 2022 at 7:01 am
A database of genomes, matched to living people, would be a boon for medical research. The fruits of that research, in turn, would make those genomes more useful to their owners as time goes on.
via Bing News
The Latest on: Genomics databases
via Google News
Add Comment