Diogo Ribeiro

Researcher in Bioinformatics

Follow me on GitHub Diogo Ribeiro

About me 👋

I’m a scientist, with 5 years experience as a postdoctoral researcher at the Systems and Population Genetics Group at the University of Lausanne, Switzerland. I have led a research project funded by the European Commission centered on understanding why neighbouring genes are often found co-expressed and what are the molecular and disease-state repercussions of this co-expression. In 2018, I obtained a PhD in Bioinformatics and Genomics from the Aix-Marseille University (France) while working at the TAGC unit, under the supervision of Dr. Christine Brun. Prior to my PhD, I worked as bioinformatician in renowned research institutes (Sanger Institute, UK & Instituto Gulbenkian de Ciência, Portugal).

I have solid experience in functional genomics and analysing big data (e.g. UK Biobank whole genome sequencing on 500k samples), as well as technical skills such as programming and creating scientific databases (e.g. MoonDB, LoCOP). As a main author of several peer-reviewed publications, I have good experience in writing scientific articles, as well as research grants. I have worked both independently and as part of a team, including supervising students.

In my free time you will find me immersed in nature, exploring all the possible trails, only stopping for some bird watching. Currently, I’m enjoy hiking in the lovely Swiss Alps. When the weather takes me indoors, I enjoy playing guitar and tennis.

Interests 💭

My interests are broad and mixed between the understanding of fundamental biology and leveraging computational and technological advances to answer biological questions.

  • Genetics & Evolution
  • Functional genomics & RNA biology
  • High-throughput technologies (e.g. sequencing, multi-omics)
  • Machine learning & scientific programming

Experience & Education 🎓

Postdoctoral researcher (2019-2024)

  • Where: Department of Computational Biology, University of Lausanne, Switzerland
  • Project: Marie-Curie Postdoctoral Research Fellow, leading a project on studying the effect of genetic variants in gene expression, complex traits and diseases
  • Supervisor: Olivier Delaneau (Systems and Population Genetics Group)
  • Keywords: human genetics, genetic variations, gene expression, eQTL analysis, GWAS
  • Internship student supervision: Chaymae Ziyani: Exploring gene expression regulation by enhancers through single cell analysis
  • Master student co-supervision: Yami Arizmendi: Molecular features impacting local gene co-expression
  • Teaching: Python practicals, Biostatistics with R (Bachelor level) / Gene expression, eQTL and GWAS analysis (Master level)

Lavaux Switzerland

PhD in Bioinformatics (2015-2018)

  • Where: Aix-Marseille University, France. Carried out at the TAGC laboratory, INSERM, France.
  • Thesis title: Discovery of the role of protein-RNA interactions in protein multifunctionality and cellular complexity
  • Supervisors: Christine Brun (TAGC, Marseille, France), Gian Gaetano Tartaglia (CRG, Barcelona, Spain)
  • Keywords: protein-RNA interactions, network biology, protein multifunctionality, 3’UTR and lncRNA function, database development
  • Master student co-supervision: Adrien Teixeira (first year): Predicting 3’UTRs scaffolding protein complexes, Paul De Boissier (first year): Large-scale inference of small ORF interactions
  • Teaching: Bioinformatics (3rd year Bachelor students), Informatics for biology (1st year Bachelor students)

Cassis France

Bioinformatician (2014-2015)

  • Where: Wellcome Sanger Institute, Cambridge, UK
  • Supervisors: Matthew Berriman, Avril Coghlan (Parasite Genomics group)
  • Keywords: comparative genomics, de novo genome assembly, gene discovery, gene function annotation, SNP calling, parasitism

Bioinformatician (2012-2013)

  • Where:Instituto Gulbenkian de Ciência (IGC), Oeiras, Portugal
  • Supervisor: Alekos Athanasiadis (Protein-Nucleic Acid Interactions group)
  • Keywords: RNA editing, machine learning, genetic algorithms, evolution

Guincho Portugal

Master in Bioinformatics (2010-2012)

  • Where: University of Lisbon, Portugal
  • Thesis title: Studies on the Influence of A to I RNA Editing on Protein Evolution, carried out at IGC, Portugal

Bachelor in Biology (2006-2010)

Caparica Portugal


Publications 📜

See publications on Google scholar

First-author publications

  1. Ribeiro DM, Delaneau O. (2023) Non-coding rare variant associations with blood traits on 166,740 UK Biobank genomes. BioRxiv. 💬 Exploiting the 200k whole genome sequences from the UK Biobank identify rare variants associated to >50 blood traits, leveraging genomic information linking genes to regulatory regions.
  2. Hofmeister R*, Ribeiro DM*, Rubinacci S*, Delaneau O. (2023) Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank. Nature Genetics. (*co-first authors) 💬 Phasing of the UK Biobank WGS and WES data for ~150K individuals using SHAPEIT5. I participated in phasing quality-control and identifying compound heterozygous genes, showing that these are depleted for essential genes.
  3. Ribeiro DM, Ziyani C, Delaneau O. (2022) Shared regulation and functional relevance of local gene co-expression revealed by single cell analysis. Communications Biology.
    💬 Work using single cell data to measure gene co-expression, comparing bulk and single cell-derived co-expressed gene pairs. Exploiting multimodal single cell data (SHARE-seq data) to link thousands of co-active genes and enhancers, showing that co-expressed genes often share the same enhancers.
  4. Ribeiro DM, Rubinacci S, Ramisch A, Hofmeister RJ, Dermitzakis ET, Delaneau O. (2021) The molecular basis, genetic control and pleiotropic effects of local gene co-expression. Nature Communications.
    💬 This work describes a novel approach to identify neighbouring genes that are co-expressed with each other across 49 human tissues, by levaraging RNA-seq datasets performed on hundreds of individuals (i.e. GTEx). As gene co-expression can stem from the genes having common regulatory elements, we identified thousands of genetic variants that are near to the co-expressed genes and may jointly affect their expression. Interestingly, these variants often overlapped enhancers and associated with multiple human traits and diseases. The main results of this study are available on LoCOP DB.
  5. Ribeiro DM, Prod’homme A, Teixeira A, Zanzoni A, Brun C. (2020) The role of 3′UTR-protein complexes in the regulation of protein multifunctionality and subcellular localization. Nucleic Acids Research.
    💬 This was the final publication stemming from my PhD work. In a very innovative approach, we combined protein-3’UTR and protein-protein interactions into a joint network and predicted the effect of 3’UTRs in affecting the protein cellular localization. This was based in a novel finding that during protein translation, 3’UTRs may facilitate the formation of protein complexes which affect the cellular localization of the nascent protein. For the first time, we predicted that this phenomenon may be common, particularly for proteins which perform multiple unrelated cellular functions (e.g. moonlighting proteins).
  6. Ribeiro DM, Briere G, Bely B, Spinelli L, Brun C. (2018) MoonDB 2.0: an updated database of extreme multifunctional and moonlighting proteins. Nucleic Acids Research.
    💬 In this work I developed a scientific database, MoonDB 2.0, a reiteration of a previous MoonDB database. This database contains computational predictions and manually curated multifunctional and moonlighting proteins, i.e. proteins which are (often entirely unexpected) functions. This database includes proteins from Human, Mouse, Yeast, Drosophila and Worm reference species.
  7. Ribeiro DM, Zanzoni A, (…), Tartaglia GG, Brun C. (2018) Protein complex scaffolding predicted as a prevalent function of long non-coding RNAs. Nucleic Acids Research.
    💬 This was the first publication of my PhD work. This highly innovative work entailed a large-scale analysis of the largest protein-RNA interaction network at the time, an often understudied subject. The creation of such a large network was possible due to the use of catRAPID, a computational prediction software. We used this network to gain insights into the functional of lncRNAs (which is generally unknown), based on their association to proteins (for which we know the function).
  8. Ribeiro DM, Coghlan A, Harsha B, Berriman M. (2018) Identification of lineage-specific gene family expansions in a database of gene families. Protocol Exchange (not peer-reviewed)
    💬 This is a protocol publication (not a research article) describing several metrics developed to identify gene family expansions in a lineage of species within a larger set of species. This protocol was used for the 50 Helminth Genomes Project at the Sanger Institute.

    Last-author publications

  9. Ziyani C, Delaneau O, Ribeiro DM. (2023) Multimodal single cell analysis infers widespread enhancer co-activity in a lymphoblastoid cell line Communications Biology. 💬 Pioneering work exploiting the multimodal single cell SHARE-seq dataset to identify enhancer-enhancer associations based on their co-activity across cells. Enhancers targeting the same gene are often co-active and have binding sites for the same transcription factors.

    Co-authored publications

  10. Avalos D, Rey G, Ribeiro DM, Ramisch A, Dermitzakis ET, Delaneau O. (2023) Genetic variation in cis-regulatory domains suggests cell type-specific regulatory mechanisms in immunity Communications Biology.
  11. Hofmeister RJ, Rubinacci S, Ribeiro DM, Buil A, Kutalik Z, Delaneau O. (2022) Parent-of-Origin inference for biobanks Nature Communications.
  12. Rubinacci S, Ribeiro DM, Hofmeister RJ, Delaneau O. (2021) Efficient phasing and imputation of low-coverage sequencing data using large reference panels Nature Genetics.
  13. Porcu E, (…), Ribeiro DM, (…), Kutalik Z. (2021) Differentially expressed genes reflect disease-induced rather than disease-causing changes in the transcriptome Nature Communications.
  14. Zanzoni A, Ribeiro DM, Brun C. (2019) Understanding protein multifunctionality: from short linear motifs to cellular functions. Cellular and Molecular Life Sciences.(book chapter)
  15. Zanzoni A, Spinelli L, Ribeiro DM, Tartaglia GG, Brun C. (2019) Post-transcriptional regulatory patterns revealed by protein-RNA interactions. Scientific Reports.
  16. International Helminth Genomes Consortium. (2018) Comparative genomics of the major parasitic worms. Nature Genetics.
  17. Protasio AV, Dongen S, Collins J, Quintais L, Ribeiro DM, (…), Berriman m. (2017) MiR-277/4989 regulate transcriptional landscape during juvenile to adult transition in the parasitic helminth Schistosoma mansoni. PLoS Negl Trop Dis.
  18. Hunt VL, Tsai IJ, (…), Ribeiro DM, (…), Berriman M. (2016) The genomic basis of parasitism in the Strongyloides clade of nematodes. Nature Genetics.
  19. Bennett HM, Mok HP, (…), Ribeiro DM, (…), Berriman M. (2014) The genome of the sparganosis tapeworm Spirometra erinaceieuropaei isolated from the biopsy of a migrating brain lesion. Genome Biology.

Awards & Funding


Contact 🌍

🏢 Department of Computational Biology, Genopode, UNIL-Sorge, 1015 Lausanne, Switzerland

✉️ diogo.am.ribeiro@gmail.com

🐤 Twitter

🔗 LinkedIN