Computational solutions to large-scale data management and analysis

scientific article (publication date: September 2010)

Computational solutions to large-scale data management and analysis is …
instance of (P31):
scholarly articleQ13442814

External links are
P6179Dimensions Publication ID1034410093
P3181OpenCitations bibliographic resource ID4519573
P932PMC publication ID3124937
P698PubMed publication ID20717155
P5875ResearchGate publication ID45695183

P50authorGarry P. NolanQ89180933
P2093author name stringEric E Schadt
Lawrence Lee
Michael D Linderman
Jon Sorenson
P2860cites workA network view of disease and compound screeningQ44111868
Up in a cloud?Q51757985
A Survey of General-Purpose Computation on Graphics HardwareQ56019490
Third-generation sequencing fireworks at Marco IslandQ84199345
High-throughput Bayesian Network Learning using Heterogeneous Multicore ComputersQ88802795
High-throughput sequence alignment using Graphics Processing UnitsQ21284218
A human gut microbial gene catalogue established by metagenomic sequencingQ24618931
Variations in DNA elucidate molecular networks that cause diseaseQ24622333
Infernal 1.0: inference of RNA alignmentsQ24646921
Accelerating molecular dynamic simulation on graphics processing unitsQ24654186
Human genome sequencing using unchained base reads on self-assembling DNA nanoarraysQ28263829
Real-time DNA sequencing from single polymerase moleculesQ28301519
Mapping the genetic architecture of gene expression in human liverQ28472693
Bacterial community variation in human body habitats across space and timeQ29547432
Genetics of gene expression and its effect on diseaseQ29614591
Genetic mapping in human diseaseQ29614943
BLAST: at the core of a powerful and diverse set of sequence analysis toolsQ29615883
A general framework for weighted gene co-expression network analysisQ29617580
VertNet: a new model for biodiversity data sharingQ30000978
CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing unitsQ30488136
Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networksQ31159122
A Bayesian partition method for detecting pleiotropic and epistatic eQTL modulesQ33525507
Direct detection of DNA methylation during single-molecule, real-time sequencingQ33888756
Direct sequencing of the human microbiome readily reveals community differencesQ33965950
Accelerating molecular dynamic simulation on the cell processor and Playstation 3.Q34794279
Towards a cyberinfrastructure for the biological sciences: progress, visions and challengesQ34810712
Searching for SNPs with cloud computingQ34964688
Mass cytometry: technique for real time single cell multitarget immunoassay based on inductively coupled plasma time-of-flight mass spectrometryQ34992218
CloudBurst: highly sensitive read mapping with MapReduceQ37193342
Cloud computing: a new business paradigm for biomedical information sharingQ37588826
P407language of work or nameEnglishQ1860
P921main subjectdata managementQ1149776
P577publication date2010-09-01
P1433published inNature Reviews GeneticsQ1071824
P1476titleComputational solutions to large-scale data management and analysis

Reverse relations

cites work (P2860)
Q28744052'Sciencenet'--towards a global search and share engine for all scientific knowledge
Q28607528A Framework for Global Collaborative Data Management for Malaria Research
Q37508553A cloud-based workflow to quantify transcript-expression levels in public cancer compendia
Q30720812A comparison study of succinct data structures for use in GWAS.
Q58185784A data-driven framework for archiving and exploring social media data
Q34349116A glimpse into past, present, and future DNA sequencing.
Q50875340A literature mining-based approach for identification of cellular pathways associated with chemoresistance in cancer
Q33849686A performance/cost evaluation for a GPU-based drug discovery application on volunteer computing
Q50104991A roadmap towards personalized immunology.
Q24289222A scalable method for molecular network reconstruction identifies properties of targets and mutations in acute myeloid leukemia
Q35006338A sea of biosynthesis: marine natural products meet the molecular age.
Q57210666A survey and evaluation of Web-based tools/databases for variant analysis of TCGA data
Q34557864A survey of tools for variant analysis of next-generation genome sequencing data
Q34155502Accelerating translational research by clinically driven development of an informatics platform--a case study
Q34361062Alignment-free genetic sequence comparisons: a review of recent approaches by word analysis
Q39168493Analyzing large datasets with bootstrap penalization
Q47290661Analyzing large scale genomic data on the cloud with Sparkhit
Q28076074Applying computation biology and "big data" to develop multiplex diagnostics for complex chronic diseases such as osteoarthritis
Q33945185BRISK—research-oriented storage kit for biology-related data
Q30451663Behavioral barcoding in the cloud: embracing data-intensive digital phenotyping in neuropharmacology
Q89770267Behind the Scenes of Successful Research in Emergency Medicine: Nine Tips for Junior Investigators
Q34233227Benchmarking undedicated cloud computing providers for analysis of genomic datasets
Q87686345Big Data Provenance: Challenges, State of the Art and Opportunities
Q30365708Big Data Usage Patterns in the Health Care Domain: A Use Case Driven Approach Applied to the Assessment of Vaccination Benefits and Risks. Contribution of the IMIA Primary Healthcare Working Group.
Q90096114Big Data and Artificial Intelligence Modeling for Drug Discovery
Q57002530Big data, but are we ready?
Q31013317BioDB extractor: customized data extraction system for commonly used bioinformatics databases
Q91992463Bioinformatics Workflows With NoSQL Database in Cloud Computing
Q30578803Bioinformatics clouds for big data manipulation
Q28731508Bioinformatics tools and database resources for systems genetics analysis in mice--a short review and an evaluation of future needs
Q38062889Biological network analysis: insights into structure and functions
Q34013580Biomedical cloud computing with Amazon Web Services
Q45770499BlueSNP: R package for highly scalable genome-wide association studies using Hadoop clusters
Q31026807Breast Imaging in the Era of Big Data: Structured Reporting and Data Mining
Q31063860Can data repositories help find effective treatments for complex diseases?
Q34537690Cancer genomic research at the crossroads: realizing the changing genetic landscape as intratumoral spatial and temporal heterogeneity becomes a confounding factor
Q34017084Characterizing genetic interactions in human disease association studies using statistical epistasis networks
Q34005928CloVR: a virtual machine for automated and portable sequence analysis from the desktop using cloud computing
Q83387272Cloud and heterogeneous computing solutions exist today for the emerging big data problems in biology
Q28742669Community-driven computational biology with Debian Linux
Q35678244Compression of Large genomic datasets using COMRAD on Parallel Computing Platform
Q35809331Computational tools for discovery and interpretation of expression quantitative trait loci
Q30622838DDBJ read annotation pipeline: a cloud computing-based pipeline for high-throughput analysis of next-generation sequencing data
Q61660750Data Management Experiences and Best Practices from the Perspective of a Plant Research Institute
Q27009050Data management strategies for multinational large-scale systems biology projects
Q28607400Data management, documentation and analysis systems in radiation oncology: a multi-institutional survey
Q40966285Decoding the immune response to successful influenza vaccination.
Q36688894DemaDb: an integrated dematiaceous fungal genomes database
Q37927182Detecting unknown sequences with DNA microarrays: explorative probe design strategies
Q50609941Detrending moving average algorithm: Frequency response and scaling performances
Q38145438Developing translational research infrastructure and capabilities associated with cancer clinical trials.
Q39519619Disease gene prioritization using network and feature
Q38941374Dissecting the phenotypic components of crop plant growth and drought responses based on high-throughput image analysis.
Q37544055Efficient denoising algorithms for large experimental datasets and their applications in Fourier transform ion cyclotron resonance mass spectrometry
Q57002849Enabling cloud bursting for life sciences within Galaxy
Q41780136Enabling large-scale biomedical analysis in the cloud
Q37270318Enzyme reaction annotation using cloud techniques
Q35027011Expanding roles in a library-based bioinformatics service program: a case study
Q34569390Expanding the boundaries of local similarity analysis
Q21045422Flow cytometry bioinformatics
Q34253645Fractal MapReduce decomposition of sequence alignment
Q24612983From RNA-seq reads to differential expression results
Q26830861Genetic data and electronic health records: a discussion of ethical, logistical and technological considerations
Q34571172Genetics and primary care: where are we headed?
Q38021828Genome Research in the Cloud
Q38490268Genome-wide identification of cancer-related polyadenylated and non-polyadenylated RNAs in human breast and lung cell lines.
Q28608430Genomics Virtual Laboratory: A Practical Bioinformatics Workbench for the Cloud
Q38115030Genomics and transcriptomics in drug discovery
Q30712554Heart beats in the cloud: distributed analysis of electrophysiological 'Big Data' using cloud computing for epilepsy clinical research
Q28657443Imaging informatics: essential tools for the delivery of imaging services
Q59302773Improving Environmental Scanning Systems Using Bayesian Networks
Q46557322Improving data mining strategies for drug design
Q38578341In silico ADME/T modelling for rational drug design.
Q31144037In-depth comparison of somatic point mutation callers based on different tumor next-generation sequencing depth data
Q38092764Information engineering infrastructure for life sciences and its implementation in China
Q37970704Insights into antibiotic resistance through metagenomic approaches.
Q28538643Integrated Bio-Search: challenges and trends for the integration, search and comprehensive processing of biological information
Q62776588Interoperable and scalable data analysis with microservices: Applications in Metabolomics
Q64113894Interoperable and scalable data analysis with microservices: Applications in Metabolomics
Q35502599Introduction to bioinformatics: sequencing technology
Q37550841Investigating Mutations to Reduce Huntingtin Aggregation by Increasing Htt-N-Terminal Stability and Weakening Interactions with PolyQ Domain
Q28681150KNODWAT: a scientific framework application for testing knowledge discovery methods for the biomedical domain
Q37701987Knowledge discovery by accuracy maximization.
Q34994748LAILAPS: the plant science search engine
Q52716996Landscape of Actionable Genetic Alterations Profiled from 1,071 Tumor Samples in Korean Cancer Patients.
Q34782992Lessons learned from implementing a national infrastructure in Sweden for storage and analysis of next-generation sequencing data
Q38183344Locus-specific databases in cancer: what future in a post-genomic era? The TP53 LSDB paradigm
Q57089715Marine ecosystem acoustics (MEA): quantifying processes in the sea at the spatio-temporal scales on which they occur
Q27692567Mechanisms of drug resistance in kinases
Q46011126Metabolomic Modularity Analysis (MMA) to Quantify Human Liver Perfusion Dynamics.
Q37502143Mixed Linear Model Approaches of Association Mapping for Complex Traits Based on Omics Variants
Q40882296Monte Carlo simulation of photon migration in a cloud computing environment with MapReduce
Q38410699Multi-scale genetic dynamic modelling II: application to synthetic biology: an algorithmic Markov chain based approach
Q37973257NEW: network-enabled wisdom in biology, medicine, and health care
Q35761806NGS technologies for analyzing germplasm diversity in genebanks.
Q35789407Needs Assessment for Research Use of High-Throughput Sequencing at a Large Academic Medical Center
Q41747887Next-Generation Sequencing: The Translational Medicine Approach from "Bench to Bedside to Population".
Q33565443Next-generation sequencing: from understanding biology to personalized medicine
Q36457381Now and next-generation sequencing techniques: future of sequence analysis using cloud computing.
Q28731358Opportunities and Challenges for the Life Sciences Community
Q38675644Optimizing drug development in oncology by clinical trial simulation: Why and how?
Q47975495Organellar Omics-A Reviving Strategy to Untangle the Biomolecular Complexity of the Cell
Q34377131P4 medicine: how systems medicine will transform the healthcare sector and society
Q36642804PD-1 Blockade Expands Intratumoral Memory T Cells
Q38078204Pediatric systems medicine: evaluating needs and opportunities using congenital heart block as a case study
Q38206642Principles and methods of integrative genomic analyses in cancer
Q27323553Profiling animal toxicants by automatically mining public bioassay data: a big data approach for computational toxicology
Q21284302QMachine: commodity supercomputing in web browsers
Q33751206Ray Meta: scalable de novo metagenome assembly and profiling.
Q37101303Relational Network for Knowledge Discovery through Heterogeneous Biomedical and Clinical Features
Q26749385Review of Developments in Electronic, Clinical Data Collection, and Documentation Systems over the Last Decade - Are We Ready for Big Data in Routine Health Care?
Q86411837Role of high-throughput sequencing in oncology
Q28276638SCALCE: boosting sequence compression algorithms using locally consistent encoding
Q34369145SIMPLEX: cloud-enabled pipeline for the comprehensive analysis of exome sequencing data
Q33742951SUPERFAMILY 1.75 including a domain-centric gene ontology method
Q28660188Satellite remote sensing, biodiversity research and conservation of the future
Q38757199Scale-up/Scale-down of microbial bioprocesses: a modern light on an old issue
Q38274975Single-cell and multivariate approaches in genetic perturbation screens
Q36240390Statistical Approaches for Gene Selection, Hub Gene Identification and Module Interaction in Gene Co-Expression Network Analysis: An Application to Aluminum Stress in Soybean (Glycine max L.).
Q56284351Statistical Inference, Learning and Models in Big Data
Q37141192Stem cell systems informatics for advanced clinical biodiagnostics: tracing molecular signatures from bench to bedside
Q36877024Strand-Specific RNA-Seq Provides Greater Resolution of Transcriptome Profiling.
Q37232564Systems biological approaches to measure and understand vaccine immunity in humans
Q34892932Systems biology of asthma and allergic diseases: a multiscale approach
Q21032488Systems immunology of human malaria
Q37894331TP53 Mutations in Human Cancer: Database Reassessment and Prospects for the Next Decade
Q30856212The Cancer Genomics Hub (CGHub): overcoming cancer through the power of torrential data
Q57139707The HTPmod Shiny application enables modeling and visualization of large-scale biological data
Q36193716The Power of Boolean Implication Networks
Q37911523The Top Five “Game Changers” in Vaccinology: Toward Rational and Directed Vaccine Development
Q35188878The human condition: an immunological perspective
Q28300132The origin of the Haitian cholera outbreak strain
Q38067181To milliseconds and beyond: challenges in the simulation of protein folding
Q33851072Tools for managing and analyzing microarray data
Q28648412Toward a Literature-Driven Definition of Big Data in Healthcare
Q33992866Toward real-time Monte Carlo simulation using a commercial cloud computing infrastructure
Q34174942Towards big data science in the decade ahead from ten years of InCoB and the 1st ISCB-Asia Joint Conference
Q94335553Towards reproducible computational drug discovery
Q40010897Ultrafast and scalable cone‐beam CT reconstruction using MapReduce in a cloud computing environment
Q35134186Unifying immunology with informatics and multiscale biology.
Q43435937Using molecular profiled human tissue to accelerate drug discovery
Q26765778Utilizing electronic health records to predict acute kidney injury risk and outcomes: workgroup statements from the 15(th) ADQI Consensus Conference
Q33637552VCGDB: a dynamic genome database of the Chinese population
Q37593301Viral Phylogenomics Using an Alignment-Free Method: A Three-Step Approach to Determine Optimal Length of k-mer
Q30587418Visualizing time-related data in biology, a review
Q34306646Volcano plots in analyzing differential expressions with mRNA microarrays.
Q34619146Why is it so difficult to data mine relevant genome-scale biomarkers?
Q38621817iCAVE: an open source tool for visualizing biomolecular networks in 3D, stereoscopic 3D and immersive 3D.

Search more.