Skip to main content
  • Idoia Ochoa - Genomic data compression, processing, and analysis

  • Tuesday, April 18, 2017 2:00 PM - 3:00 PM EDT
    LWSN 3102 (A&B)
    Purdue University

    Abstract:
    Recent technological advances have led to a drastic reduction in the cost of genome sequencing, resulting in the accumulation of an unprecedented amount of highly distributed and heterogenous genomic data. My research is dedicated to identifying and addressing the challenges arising in the context of such data. This undertaking - which combines tools from information and coding theory, statistics, and machine learning - includes the design and deployment of new algorithms for coping with the distribution and storage of the data, for facilitating its access and queries, and for improving the analysis and inference performed on it.

    The talk will focus on some of my work in this area, geared toward alleviating the requirements for storage of the data, and improving its analysis. I will present some of our results on lossless compression of genomic data, as well as lossy compressors and denoisers of the quality scores that come with the sequencing data. In some cases lossy compression results not only in substantial storage savings, but also boosted performance when inference is done on the reconstructed data. Conversely, denoising the quality scores can result in substantial compression gains.

    Bio:
    Idoia Ochoa is currently an assistant professor in the Electrical and Computer Engineering department at the University of Illinois at Urbana-Champaign. She received her PhD from the Electrical Engineering department at Stanford University in 2016, where she was advised by Prof. Tsachy Weissman. She received her M.Sc. from the same department in 2012. Prior to Stanford, she graduated with B.Sc. and M.Sc. degrees in Telecommunication Engineering (Electrical Engineering) from the University of Navarra, Spain, in 2009. She has conducted internships at Google and Genapsys, and served as technical consultant for the HBO show "Silicon Valley". Her research interests include compression and coding, bioinformatics, information theory, machine learning, communications and signal processing. She is recipient of the Stanford Graduate Fellowship, La Caixa Graduate Fellowship, and an award for excellence from the Basque Government.



    Associated Files

    Event Flyer (PDF)