event-icon
Description

Sanford Imagenetics is a population scale precision medicine initiative that aims to integrate genetics into the primary care of its patients. Integrating genomic data (genome) with the clinical data (phenome) of our patients while being able to create and run collaborative data science products that can harness the power of big data analytics is a challenge. We are working with Databricks and AWS to integrate these data in a HIPAA compliant cloud data-science platform which allows us to conduct large-scale analyses on these datasets.

Describe the new knowledge and additional skills the participant will gain after attending your presentation.: Most healthcare systems have a data warehouse in place, yet integrating this DW with a genomics data platform is not a common task. I will cover some of the challenges we encountered in the journey to set up this environment and some lessons learned. Participants will also learn some of the performance gains we enjoy from using Apache Spark-based ETL and data science pipelines.

Authors:

Murat Sincan (Presenter)
Sanford Health

Presentation Materials:

Tags