Category Archives: Uncategorized

Our recent work on transfer learning for error estimation has been featured on ACM Tech News & DOE Office of Science website

This week’s ACM Tech News has featured our recent work on Transfer Learning for Bayesian Error Estimation (TL-BEE):

https://technews.acm.org/archives.cfm?fo=2022-03-mar/mar-11-2022.html

This study has also received spotlight on the Department of Energy (DOE), Office of Science website:

https://www.energy.gov/science/office-science

The work has been recently published in Cell Press Patterns, and the full article can be accessed at the link below:

Omar Maddouri, Xiaoning Qian, Francis J. Alexander, Edward R. Dougherty, Byung-Jun Yoon, “Robust Importance Sampling for Error Estimation in the Context of Optimal Bayesian Transfer Learning,” Patterns, 2022, DOI: https://doi.org/10.1016/j.patter.2021.100428

Omar Maddouri’s recent work on transfer learning for Bayesian error estimation featured on Texas A&M College of Engineering website

Recent work by Omar Maddouri, currently a Ph.D. candidate in the BioMLSP lab, on transfer learning for Bayesian error estimation has been featured in an article entitled “Doctoral student offers new insight into machine-learning error estimation”, which has been published on the Texas A&M College of Engineering website.

The article can be found at: https://engineering.tamu.edu/news/2022/03/ecen-doctoral-student-offers-new-insight-into-machine-learning-error-estimation.html

Our paper on error estimation via optimal Bayesian transfer learning has now been published in Patterns

Our recent study on Bayesian error estimation via optimal Bayesian transfer learning has been published in Patterns, a premium open access journal from Cell Press that publishes ground-breaking original research across the full breadth of data science.

Omar Maddouri, Xiaoning Qian, Francis J. Alexander, Edward R. Dougherty, Byung-Jun Yoon, “Robust Importance Sampling for Error Estimation in the Context of Optimal Bayesian Transfer Learning,” Patterns, DOI:https://doi.org/10.1016/j.patter.2021.100428.

Continue reading

Finding robust biomarkers for more accurate and reproducible disease diagnosis/prognosis

Identifying robust diagnostic/prognostic biomarkers from gene expression data that can lead to accurate and reproducible predictions is a challenging problem.

In our recent paper “Deep graph representations embed network information for robust disease marker identification”, we show that deep graph representations using graph convolutional networks (GCNs) can identify effective markers that significantly improve the predictive performance and their reproducibility across different datasets and platforms.

Omar Maddouri, Xiaoning Qian, Byung-Jun Yoon, “Deep graph representations embed network information for robust disease marker identification,” Bioinformatics, btab772, https://doi.org/10.1093/bioinformatics/btab772.

Continue reading

Our NeurIPS 2021 paper on efficient active learning for Gaussian process classification is now online

We are happy to announce that our NeurIPS 2021 paper entitled “Efficient Active Learning for Gaussian Process Classification by Error Reduction” is now available in OpenReview.net: https://openreview.net/pdf?id=UK15Hj9qX6I

Guang Zhao, Edward Dougherty, Byung-Jun Yoon, Francis Alexander, Xiaoning Qian, “Efficient Active Learning for Gaussian Process Classification by Error Reduction,” Thirty-Fifth Conference on Neural Information Processing Systems (NeurIPS 2021), Dec. 6 – 14, 2021.

Continue reading

Post-doc position available in Scientific Machine Learning (SciML)

We are happy to announce the availability of a new post-doc position in areas relevant to Scientific Machine Learning (SciML).

The position resides in the Applied Mathematics Group of the Computational Science Initiative (CSI) at Brookhaven National Laboratory (BNL), and the post-doctoral researcher will work with Dr. Byung-Jun Yoon and Dr. Nathan Urban on a project focused on scientific data reduction.

We invite outstanding candidates to apply for a post-doctoral research associate position in applied mathematics, machine learning, and scientific computing. This position offers a unique opportunity to conduct research in emerging interdisciplinary research problems at the intersection of applied mathematics, machine learning, and high-performance computing (HPC) with applications in diverse scientific domains of interest to BNL and the Department of Energy (DOE).

Topics of specific interest include:

  • optimal decision-guided data reduction
  • feature extraction/engineering in high-dimensional compositional workflows that involve machine learning (ML) models
  • Bayesian inference and uncertainty quantification in scientific ML models
  • learning/optimization of low-dimensional latent feature spaces for ML surrogates.

The position includes access to world-class HPC resources, such as the BNL Institutional Cluster and DOE leadership computing facilities. Access to these platforms will allow computing at scale and will ensure that the successful candidate will have the necessary resources to solve challenging DOE problems of interest.

This program provides full support for a period of two years at CSI with possible extension. Candidates must have received a doctorate (Ph.D.) in applied mathematics, statistics, computer science, or a related field (e.g., mathematics, engineering, operations research, physics) within the past five years. This post-doc position presents a unique chance to conduct interdisciplinary collaborative research in BNL programs with a highly competitive salary.

For further information, please visit: [post-doc position announcement]

For inquiries, contact Dr. Byung-Jun Yoon or Dr. Nathan Urban.

KBTX news on scientific data reduction: is having more data always better for achieving scientific goals?

KBTX has featured Dr. Yoon and his team’s research project on scientific data reduction in their recent news.

“The team says people tend to think the more data someone has, the better it is for achieving their goal, which is not always the case. That’s why they’re working on a mechanism that can get rid of the unnecessary data without compromising what’s needed.”

Andy Krauss, Reporter – KBTX news

Further details can be found at the link below:

https://www.kbtx.com/2021/10/07/texas-am-researchers-awarded-24-million-grant-efficiently-reduce-size-data-sets/

SLATE Future Tense features Dr. Yoon’s research on scientific data reduction

Future Tense is a partnership of Slate, New America, and Arizona State University that examines emerging technologies, public policy, and society, exploring how emerging technologies will change the way we live.

Future Tense has recently posted an article entitled “Why the Department of Energy Is Spending Millions to Get Rid of Scientific Data”, in which they featured Dr. Yoon and his team’s research project on objective-driven reduction of scientific data, recently funded by the U.S. Department of Energy (DOE). The news article can be accessed at:

https://slate.com/technology/2021/10/big-data-information-overload-department-of-energy.html

What is Future Tense?

“A partnership of Slate, New America, and Arizona State University, Future Tense explores how emerging technologies will change the way we live. The latest consumer gadgets are intriguing, but we focus on the longer-term transformative power of robotics, information and communication technologies, synthetic biology, augmented reality, space exploration, and other technologies. Future Tense seeks to understand the latest technological and scientific breakthroughs, and what they mean for our environment, how we relate to one another, and what it means to be human. Future Tense also examines whether technology and its development can be governed democratically and ethically. Future Tense asks these questions in daily commentary published on Slate and through public events featuring conversations with leading scientists, technologists, policymakers, and journalists.”

Quoted from Future Tense website. For further info, visit: https://slate.com/future-tense

TRIMER – a modeling and simulation framework for integrated analysis of transcription regulation and metabolic regulation

Our recent collaborative work on integrated modeling of transcription regulation and metabolic regulation has been published in iScience, an open access interdisciplinary journal from Cell Press.

Puhua Niu, Maria J. Soto, Byung-Jun Yoon, Edward R. Dougherty, Francis J. Alexander, Ian Blaby, Xiaoning Qian, “TRIMER: Transcription Regulation Integrated with MEtabolic Regulation,” iScience, doi:10.1016/j.isci.2021.103218.

In this work, we have developed a modeling and simulation pipeline enabling the analysis of Transcription Regulation Integrated with MEtabolic Regulation: TRIMER. TRIMER utilizes a Bayesian network (BN) inferred from transcriptome data to model the transcription factor regulatory network (TRN). TRIMER then infers the probabilities of the gene states relevant to the metabolism of interest, and predicts the metabolic fluxes and their changes that result from the deletion of one or more transcription factors. In this study, we demonstrate that TRIMER yields accurate predictions of how the knockout of one or more TFs affect the metabolic outcomes, outperforming existing state-of-the-art approaches based on both simulated as well as real experimental data from gene knockout experiments.

Overview of the TRIMER modeling and simulation pipeline.

iScience is a new open-access journal from Cell Press that provides a platform for original research in the life, physical, and earth sciences. The primary criterion for publication in iScience is a significant contribution to a relevant field combined with robust results and underlying methodology. The advances appearing in iScience include both fundamental and applied investigations across this interdisciplinary range of topic areas.

Dr. Yoon’s DOE-funded project on scientific data reduction featured on TAMU ECE website

Dr. Yoon’s project on “Objective-Driven Data Reduction for Scientific Workflows” recently funded by the U.S. Department of Energy (DOE), Advanced Scientific Computing Research (ASCR), has been featured in a recent article posted on the Texas A&M Department of Electrical and Computer Engineering website.

The full article can be accessed at the link below:

DOE-funded research project to efficiently reduce massive scientific data