I am an EDCB (Computational and Quantitative Biology) PhD student at the École polytechnique fédérale de Lausanne and NeuroRestore, under the guidance of Gregoire Courtine, Jordan Squair and Michael Skinnider. Being a huge science nerd, I can talk with someone for hours regarding topics ranging from neuroscience to quantum mechanics. My passion for science stems from my obsession to understand and quantify everything, as well as influences from great philosophers and science popularizers such as Sagan and Feynman.
One of the most fascinating fields to me is biology – how billions of seemingly random biomolecules can come together to construct these intricately complicated yet elegant designs we call life. This ignited my passion for computational biology research.
I am also a fervent advocate of integrating machine learning (ML) into computational biology; the design of ML algorithms are highly apt for the high-dimensional complexity of biological systems. Along my research journey, I have been fortunate enough to produce several publications (2 first-author papers) during my Bachelor's degree and worked on multiple publications (2 first author-papers) in my PhD so far.
EDCB PhD, Doctoral Assistant
February 2022 -
Visiting Student Research Collaborators (VSRC)
July 2024 -
Bachelor of Science (Hons) • Major in Computational Biology
Interdisciplinary Special Programme in Science
August 2017 - May 2021
Exchange Semester
January 2020 - April 2020
Machine Learning (Tensorflow, Pytorch), Artificial Intelligence, Regression Analysis, Statistical Analysis, Image Analysis, Computer Vision, Bioinformatics, Natural Language Processing, Data Structures & Algorithms, Big Data, Database Systems, Graph Theory
Cell culture, Optogenetics, Western Blotting, DNA-sequencing, RNA-sequencing
Python, Java, R, C++, Matlab, Hadoop, Scala, SQL, HTML, Javascript
Here, I attempt to understand the molecular and cellular profiles of the spinal cord using a combination of machine learning algorithms and single-cell sequencing technologies. I mainly work in developing single-cell sequencing analysis frameworks and applying these methods to datasets to identify cells and genes relating to SCI and are responsible for neuro-rehabilitation and neuro-regeneration. The work at Prof Courtine's lab is on the realm of science-fiction and is extremely fascinating to me (NeuroRestore) -- the ability to restore locomotion following SCI. I am really looking forward to help further this cause!
Here, I continued my work from my final year thesis while I awaited the start of my PhD. I compared patient-specific differential gene expression analysis against conventional all-patients differential gene expression analysis. We discovered that by aggregating patient-specific results, we discovered subgroup-specific differentially expressed (DE) genes that are missed out in the conventional analysis. Here, I learnt alot from my supervisors, Prof Chow, Prof Greg, and Prof Roger, as well as my direct mentor, Ah Jung, who has been a wonderful and patient mentor to me. I really enjoyed the project as we attempted to challenge conventional bioinformatic methods and showed an alternative that can help push the agenda of precision medicine. The work here resulted in a paper in BMC Cancer, where I am a co-first author.
For my Final Year Thesis, I worked at the Genome Institute of Singapore under the supervision of Prof Roger Foo and Prof Greg Tucker Kellogg. The project focuses on using multi-omics data (mainly transcriptomics and metabolomics) data to provide a quantitative assessment of inter- and intra-tumor heterogeneity of HCC tumours. The underlying hypothesis is to investigate if HBV affects the tumor heterogeneity of HCC. With this project, I had the freedom to explore different methodologies on my own, which includes VAE, PCA, CNN, and multiple available clustering algorithms. I also analyzed the tumor heterogeneity at other levels (epigenomics, genomics) and examined integrated analysis of the multi-omics data. Concurrently, I also worked in other projects relating to genomics in HCC. The year of work I did here contributed to 2 papers, 1 published and 1 in review.
This project happened in a peculiar time of COVID-19. The project at Boston changed to remote work and I had to re-adjust my schedule to fit the team at Harvard. However, the experience to work with Prof Winston and his team is an enriching one. The project was to prioritize disease-related miRNAs based on gene set analysis of RNA-sequencing data with the subsequent network analysis of the gene sets. My contribution was to develop a prioritization pipeline and test the pipeline on existing datasets such as TCGA data. The work is a manuscript in review (where I am the second author), and was also presented at the Intelligent Systems for Molecular Biology Conference 2020.
Any computational biologist will be drawn to a world-renowned institute such as UCSC Genome Institute. Having used some of their tools such as the UCSC Genome Browser, I wanted the opportunity to go and experience research. Fortunately, I managed to secure a slot at the Institute during my exchange. The project focuses on deciphering the nanopore sequencing signal features utilized by commercial deep learning models (Bonito) for the base-calling.
I came across Prof Noam’s lab when I was looking for some research experience in functional genomics. After talking to Prof Noam online, I was convinced that I had to go to his lab and I flew to Israel armed only with my passion for research. I was tasked with overcoming the low interpretability in the deep learning classification of cancer genomics. The model used is a QRNN model and the input data is raw FASTA sequences. As it was my novice experience in machine learning, I spent a lot of time catching up with the project. Eventually, I adopted the gradient class activation mapping (GCAM) technique and attributed biological significance to the model. My time at this lab was one of my favourites and I learned a lot from Prof Noam, in both research and Israeli culture. The work here laid the foundation of a chapter in his book, Deep Sequencing Data Analysis, where I authored the chapter.
I had the opportunity to work at Prof George’s lab as a UROPS student. The project focuses on the cholinergic modulation on the claustrum, a part of the brain whose function eluded neuroscientists since its discovery. As the lab does not have any computational biologists, I spent a lot of time reading up literature on my own. Another source of inspiration was my direct mentor, Aditya Nair (who is a graduate student at Caltech now), and we would explore theoretical and computational neuroscience concepts during our weekly meetings. Those meetings were always intriguing and I recalled looking forward to them every week. We settled on the idea of building a firing-rate network model using electrophysiological data collected from claustral neurons in brain slices. By approximating the cholinergic input as a gain-control function, we showed that cholinergic input toggle network encoding efficiencies of different subpopulations in the claustrum. The model built here is now in a paper in PNAS where I am a key contributing author.
I had my first taste of research at Prof KL’s lab and I was captivated by the process of scientific research. Although computational biology is not part of Prof KL’s expertise, he encouraged me to explore any methodology that I find interesting. He would constantly push me to read more literature and provide constructive criticism to my ideas. In his lab, I found my love for scientific research. The project I did focused on using random-walk-with-restart algorithm on a protein-protein interaction network built from the STRING database. The approach allowed us to determine molecular links between Parkinson’s disease and Type 2 diabetes, two co-related diseases that have yet to have any explicit overlapping mechanisms.