Skip to main content
MSU CSE Colloquium Series 2014-2015: Heng Huang Title:  Large-Scale Structured Sparse Learning with Applications in Imaging Genomics

Dr. Heng Huang
Computer Science
University of Texas at Arlington

Time: Monday, Feb 16, 2015, 10am
Location: EB 3105

Abstract:

Sparsity is one of the intrinsic properties of real-world data, thus the sparse learning has recently emerged as a powerful tool to obtain models of high-dimensional data with high degree of interpretability at low computational cost, and provide great opportunities to analyze the big, complex, and diverse datasets. By enforcing properly designed structured sparsity, we can integrate the specific data/feature structures and domain knowledge into the machine learning models to simplify data models and discover predictive patterns in data analytics. Data science research is accelerating the translation of biological and biomedical data to advance the detection, diagnosis, treatment and prevention of diseases, including the recently announced BRAIN (Brain Research through Advancing Innovative Neurotechnologies) Initiative. To address the challenging problems in current big data mining, we proposed several novel large-scale structured sparse learning models for multi-dimensional data integration, heterogeneous multi-task learning, group/graph structured data analysis, and longitudinal feature learning. We applied our new structured sparse learning models to analyze the multi-modal neuroimaging and genome-wide array data in Imaging Genomics and discover the phenotypic and genotypic biomarkers to characterize the neurodegenerative process in the progression of Alzheimer’s disease and other complex brain disorders. We also utilized our new machine learning models to analyze the Electronic Medical Records for predicting the heart failure patients’ readmission and drug side effects, detect the multi-dimensional biomarkers in The Cancer Genome Atlas (TCGA) research, and identify the brain circuitry patterns in Human Connectome.

Biography:

Heng Huang is an Associate Professor of Computer Science and Engineering (CSE) at University of Texas at Arlington (UTA), director of Data Science Lab in CSE at UTA. Dr. Huang received the PhD degree in Computer Science at Dartmouth College in 2006 and then joined UTA as an assistant professor. His research areas include machine learning, big data mining, bioinformatics, health informatics, neuroinformatics, computer vision, and computational sustainability. Dr. Huang has published more than 100 papers in top-tier conferences and journals, such as RECOMB, ISMB, NIPS, ICML, KDD, IJCAI, AAAI, CVPR, ICCV, SIGIR, Bioinformatics, IEEE Trans. on Medical Imaging, Medical Image Analysis, etc. He is leading multiple NSF funded projects on big data mining, machine learning, imaging genomics, electronic medical records mining and privacy-preserving, computational biology, smart healthcare, cyber physical system, and also industry (e.g. Con Edison in New York City) funded projects on computational sustainability, smart metering and smart grid.

Host:

Dr. Joyce Chai