Stephen Chen L&S Math & Physical Sciences

Predicting Plant Gene Expression Levels Using Deep Neural Networks

Since proteins carry out the majority of work in living cells, how a cell operates is heavily determined by the relative abundances of different proteins at any given time. These relative abundances are largely determined by gene expression levels: the extent to which a gene is “activated.” Predicting gene expression levels from noncoding DNA sequence is a major unsolved problem in computational biology. Recently, machine learning (ML) has emerged as one of the most effective tools for this task. While significant work has been done with human genomes, much work remains to be done with plant genomes. This project aims to use cutting-edge machine learning models, particularly convolutional and transformer-based neural networks, to accurately predict the activity levels of plant genes based solely on their DNA sequences. An effective tool for predicting these gene expression levels in plants could have unique applications for agriculture, renewable energy, and carbon capture.

Message To Sponsor

Dear Anselm, I am incredibly grateful for the opportunity to pursue my research on gene expression in plants thanks to your sponsorship! This summer has been an amazing opportunity to dive into the fast-moving intersection of computational biology and machine learning. This fellowship has enabled me to learn so much as a scholar and has solidified my love for scientific research and has inspired me to pursue further studies after college. Your support has meant a lot in shaping my academic journey and future goals. Thank you! Stevie
Profile image of Stephen Chen
Major: Applied Mathematics, Computer Science
Mentor: David Savage
Sponsor: Anselm MPS
Back to Listings
Back to Donor Reports