Publications
Gene Annotation
Improving Gene Annotation with Machine Learning Advancements in genome sequencing have greatly enhanced the study of organisms, but understanding genomic variation remains challenging, especially in genome annotation, which models gene transcription and translation. :dna: We refined a machine …
PlantCaduceus
Revolutionizing Cross-Species Plant Genomics 🌿 PlantCaduceus is an advanced platform designed to model plant genomes across species at single nucleotide resolution. By leveraging pre-trained DNA language models, PlantCaduceus aims to capture the evolutionary conservation of plant genomes, enabling …
DNA and lncRNA triplexes
TRIPBASE Link to the Paper Long-non-coding RNAs (lncRNAs) are defined as RNA sequences which are >200 nt with no coding capacity. These lncRNAs participate in various biological mechanisms and are widely abundant in a diversity of species. There is well-documented evidence that lncRNAs can …